Skip to content
Change the repository type filter

All

    Repositories list

    • cccl

      Public
      CUDA Core Compute Libraries
      C++
      3002.1k1.1k204Updated Dec 10, 2025Dec 10, 2025
    • TensorRT-LLM

      Public
      TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
      Python
      1.9k12k580463Updated Dec 10, 2025Dec 10, 2025
    • JAX-Toolbox

      Public
      JAX-Toolbox
      Python
      683678047Updated Dec 10, 2025Dec 10, 2025
    • KAI-Scheduler

      Public
      KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
      Go
      1169622234Updated Dec 10, 2025Dec 10, 2025
    • NVSentinel

      Public
      NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments
      Go
      27112349Updated Dec 10, 2025Dec 10, 2025
    • Megatron-LM

      Public
      Ongoing research training transformer models at scale
      Python
      3.4k14k333237Updated Dec 10, 2025Dec 10, 2025
    • cuda-quantum

      Public
      C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
      C++
      30987140786Updated Dec 10, 2025Dec 10, 2025
    • spark-rapids-jni

      Public
      RAPIDS Accelerator JNI For Apache Spark
      Cuda
      7552835Updated Dec 10, 2025Dec 10, 2025
    • ACCV-Lab

      Public
      Accelerated Computer Vision Lab (ACCV-Lab) is a systematic collection of packages with the common goal to facilitate end-to-end efficient training in the ADAS domain, each package offering tools & best practices for a specific aspect/task in this domain.
      Python
      42310Updated Dec 10, 2025Dec 10, 2025
    • nvidia-container-toolkit

      Public
      Build and run containers leveraging NVIDIA GPUs
      Go
      4483.9k11819Updated Dec 10, 2025Dec 10, 2025
    • Model-Optimizer

      Public
      A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
      Python
      2111.6k6853Updated Dec 10, 2025Dec 10, 2025
    • DALI

      Public
      A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
      C++
      6515.6k22334Updated Dec 10, 2025Dec 10, 2025
    • Fuser

      Public
      A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
      C++
      71365207211Updated Dec 10, 2025Dec 10, 2025
    • cloudai

      Public
      CloudAI Benchmark Framework
      Python
      387626Updated Dec 10, 2025Dec 10, 2025
    • doca-platform

      Public
      DOCA Platform manages provisioning and service orchestration for Bluefield DPUs
      Go
      146200Updated Dec 10, 2025Dec 10, 2025
    • k8s-device-plugin

      Public
      NVIDIA device plugin for Kubernetes
      Go
      7593.6k7538Updated Dec 10, 2025Dec 10, 2025
    • OSMO

      Public
      The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge devices in a simple YAML
      Python
      35767Updated Dec 10, 2025Dec 10, 2025
    • gpu-operator

      Public
      NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
      Go
      4242.4k9475Updated Dec 10, 2025Dec 10, 2025
    • NeMo-Agent-Toolkit

      Public
      The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
      Python
      4461.6k5429Updated Dec 10, 2025Dec 10, 2025
    • aerial-cuda-accelerated-ran

      Public
      An SDK (Software Development Kit) for building commercial-grade, AI-native, 3GPP, and O-RAN compliant 5G/6G gNB software on NVIDIA-accelerated computing platforms.
      C++
      71910Updated Dec 10, 2025Dec 10, 2025
    • aerial-framework

      Public
      A toolchain for generating high-performance, GPU-accelerated 5G/6G pipelines from Python and a modular, real-time runtime for executing the pipelines on NVIDIA Aerial™ RAN Computer platforms.
      C++
      2900Updated Dec 10, 2025Dec 10, 2025
    • tilus

      Public
      Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.
      Python
      1442080Updated Dec 10, 2025Dec 10, 2025
    • numba-cuda

      Public
      The CUDA target for Numba
      Python
      472239925Updated Dec 10, 2025Dec 10, 2025
    • accelerated-computing-hub

      Public
      NVIDIA curated collection of educational resources related to general purpose GPU programming.
      Jupyter Notebook
      169952142Updated Dec 10, 2025Dec 10, 2025
    • nv-ingest

      Public
      NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
      Python
      2792.8k10137Updated Dec 10, 2025Dec 10, 2025
    • libmctp

      Public
      C
      36800Updated Dec 10, 2025Dec 10, 2025
    • nsmd

      Public
      MCTP VDM-based Nvidia System Management API
      C++
      1410Updated Dec 10, 2025Dec 10, 2025
    • TileGym

      Public
      Helpful kernel tutorials and examples for tile-based GPU programming
      Python
      1532701Updated Dec 10, 2025Dec 10, 2025
    • cudaqx

      Public
      Accelerated libraries for quantum-classical computing built on CUDA-Q.
      C++
      37702712Updated Dec 10, 2025Dec 10, 2025
    • cuopt

      Public
      GPU accelerated decision optimization
      Cuda
      1016097522Updated Dec 10, 2025Dec 10, 2025