Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    NVIDIA cuOpt is an open-source GPU-accelerated optimization engine delivering near real-time solutions for complex decision-making challenges.

    Cuda 189 25

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 327 45

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 15.9k 1.4k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.5k 211

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 3.3k 355

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.2k 766

Repositories

Showing 10 of 580 repositories
  • TensorRT-LLM Public

    TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    C++ 10,779 Apache-2.0 1,506 616 265 Updated Jun 19, 2025
  • KAI-Scheduler Public

    KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale

    NVIDIA/KAI-Scheduler’s past year of commit activity
    Go 651 Apache-2.0 71 18 (2 issues need help) 15 Updated Jun 19, 2025
  • cuda-quantum Public

    C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

    NVIDIA/cuda-quantum’s past year of commit activity
    C++ 725 255 375 (17 issues need help) 73 Updated Jun 19, 2025
  • Megatron-LM Public

    Ongoing research training transformer models at scale

    NVIDIA/Megatron-LM’s past year of commit activity
    Python 12,604 2,854 309 201 Updated Jun 19, 2025
  • compute-eval Public

    Evaluating Large Language Models for CUDA Code Generation ComputeEval is a framework designed to generate and evaluate CUDA code from Large Language Models.

    NVIDIA/compute-eval’s past year of commit activity
    Python 49 9 0 3 Updated Jun 19, 2025
  • NeMo-Skills Public

    A project to improve skills of large language models

    NVIDIA/NeMo-Skills’s past year of commit activity
    Python 426 Apache-2.0 73 14 6 Updated Jun 19, 2025
  • NeMo Public

    A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

    NVIDIA/NeMo’s past year of commit activity
    Python 14,856 Apache-2.0 2,942 59 101 Updated Jun 19, 2025
  • NeMo-Guardrails Public

    NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

    NVIDIA/NeMo-Guardrails’s past year of commit activity
    Python 4,810 489 117 (4 issues need help) 37 Updated Jun 19, 2025
  • Fuser Public

    A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

    NVIDIA/Fuser’s past year of commit activity
    C++ 336 61 252 (11 issues need help) 163 Updated Jun 19, 2025
  • cuda-q-academic Public

    This repo contains CUDA-Q Academic materials, including self-paced Jupyter notebook modules for building and optimizing hybrid quantum-classical algorithms using CUDA-Q.

    NVIDIA/cuda-q-academic’s past year of commit activity
    Jupyter Notebook 131 33 0 5 Updated Jun 19, 2025