Inferless
Popular repositories Loading
- 
      triton-co-pilottriton-co-pilot PublicGenerate Glue Code in seconds to simplify your Nvidia Triton Inference Server Deployments 
- 
      whisper-large-v3whisper-large-v3 Public templateState‑of‑the‑art speech recognition model for English, delivering transcription accuracy across diverse audio scenarios. <metadata> gpu: T4 | collections: ["CTranslate2"] </metadata> 
- 
      qwq-32b-previewqwq-32b-preview Public templateA 32B experimental reasoning model for advanced text generation and robust instruction following. <metadata> gpu: A100 | collections: ["vLLM"] </metadata> 
- 
      deepseek-r1-distill-qwen-32bdeepseek-r1-distill-qwen-32b Public templateA distilled DeepSeek-R1 variant built on Qwen2.5-32B, fine-tuned with curated data for enhanced performance and efficiency. <metadata> gpu: A100 | collections: ["vLLM"] </metadata> 
Repositories
-           chatterbox Public templateChatterbox is an TTS by Resemble AI featuring emotion exaggeration control, zero-shot voice cloning, alignment-informed real-time synthesis, and built-in PerTh neural watermarking for responsible, high-quality speech generation audio. <metadata> gpu: A10 | collections: ["HF_Transformers"] </metadata> inferless/chatterbox’s past year of commit activity 
-           qwen3-30b-a3b-instruct-2507 Public template30.5B MoE language model from Qwen team, tuned for broad instruction following, reasoning, multilingual tasks, and agentic tool use.<metadata> gpu: A100 | collections: ["HF_Transformers"] </metadata> inferless/qwen3-30b-a3b-instruct-2507’s past year of commit activity 
-           flux-1-krea-dev Public template12B model distilled from Krea 1, designed to deliver highly photorealistic results. <metadata> gpu: A100 | collections: ["HF_Transformers"] </metadata> inferless/flux-1-krea-dev’s past year of commit activity 
-           code-debugging-agent Publicinferless/code-debugging-agent’s past year of commit activity 
-           dia-1.6b Publicinferless/dia-1.6b’s past year of commit activity 
-           qwen-image Publicinferless/qwen-image’s past year of commit activity 
-           pyannote-speaker-diarization-3.1 Public templateA state-of-the-art model that segments and labels audio recordings by accurately distinguishing different speakers. <metadata> gpu: T4 | collections: ["HF Transformers"] </metadata> inferless/pyannote-speaker-diarization-3.1’s past year of commit activity 
-           facebook-bart-cnn Public templateA variant of the BART model designed specifically for natural language summarization. It was pre-trained on a large corpus of English text and later fine-tuned on the CNN/Daily Mail dataset. <metadata> gpu: T4 | collections: ["HF Transformers"] </metadata> inferless/facebook-bart-cnn’s past year of commit activity 
Most used topics
Loading…