Skip to content
Change the repository type filter

All

    Repositories list

    • client

      Public
      Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
      Python
      2466545428Updated Oct 29, 2025Oct 29, 2025
    • The Triton backend for the PyTorch TorchScript models.
      C++
      6116208Updated Oct 29, 2025Oct 29, 2025
    • Python
      3230409Updated Oct 29, 2025Oct 29, 2025
    • server

      Public
      The Triton Inference Server provides an optimized cloud and edge inferencing solution.
      Python
      1.7k10k77980Updated Oct 29, 2025Oct 29, 2025
    • Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Server models.
      Python
      80494297Updated Oct 29, 2025Oct 29, 2025
    • The Triton TensorRT-LLM Backend
      13190331723Updated Oct 28, 2025Oct 28, 2025
    • core

      Public
      The core library and APIs implementing the Triton Inference Server.
      C++
      117152020Updated Oct 28, 2025Oct 28, 2025
    • common

      Public
      Common source, scripts and utilities shared across all Triton repositories.
      C++
      757607Updated Oct 27, 2025Oct 27, 2025
    • Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
      C++
      184651014Updated Oct 15, 2025Oct 15, 2025
    • The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.
      C++
      33137236Updated Oct 13, 2025Oct 13, 2025
    • tutorials

      Public
      This repository contains tutorials and examples for Triton Inference Server
      Python
      134792816Updated Oct 10, 2025Oct 10, 2025
    • Triton CLI is an open source command line interface that enables users to create, deploy, and profile models served by the Triton Inference Server.
      Python
      57133Updated Oct 10, 2025Oct 10, 2025
    • Third-party source packages that are modified for use in Triton.
      C
      62704Updated Oct 10, 2025Oct 10, 2025
    • The Triton backend for TensorRT.
      C++
      337901Updated Oct 10, 2025Oct 10, 2025
    • Simple Triton backend used for testing.
      C++
      5300Updated Oct 10, 2025Oct 10, 2025
    • An example Triton backend that demonstrates sending zero, one, or multiple responses for each request.
      C++
      8700Updated Oct 10, 2025Oct 10, 2025
    • TRITONCACHE implementation of a Redis cache
      C++
      41630Updated Oct 10, 2025Oct 10, 2025
    • Python
      351152918Updated Oct 10, 2025Oct 10, 2025
    • OpenVINO backend for Triton.
      C++
      183461Updated Oct 10, 2025Oct 10, 2025
    • The Triton backend for the ONNX Runtime.
      C++
      72163744Updated Oct 10, 2025Oct 10, 2025
    • Implementation of a local in-memory cache for Triton Inference Server's TRITONCACHE API
      C++
      2610Updated Oct 10, 2025Oct 10, 2025
    • Example Triton backend that demonstrates most of the Triton Backend API.
      C++
      14700Updated Oct 10, 2025Oct 10, 2025
    • C++
      92104Updated Oct 10, 2025Oct 10, 2025
    • The Triton repository agent that verifies model checksums.
      C++
      91100Updated Oct 10, 2025Oct 10, 2025
    • backend

      Public
      Common source, scripts and utilities for creating Triton backends.
      C++
      10135303Updated Oct 10, 2025Oct 10, 2025
    • FIL backend for the Triton Inference Server
      Jupyter Notebook
      3883520Updated Oct 8, 2025Oct 8, 2025
    • pytriton

      Public
      PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.
      Python
      55824120Updated Aug 13, 2025Aug 13, 2025
    • The Triton backend for TensorFlow.
      C++
      225302Updated Jun 18, 2025Jun 18, 2025
    • Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.
      Python
      2721340Updated Apr 22, 2025Apr 22, 2025
    • .github

      Public
      Community health files for NVIDIA Triton
      2200Updated Mar 27, 2025Mar 27, 2025