PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Gpt Oss Python Packages

Python packages with the GitHub topic gpt-oss. Sorted by relevance, with stars and monthly downloads.
sgl-project
sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

303.2M 28K 6K
vllm-project
vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

6.2M 81K 17K
unslothai
unsloth

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

2.2M 65K 6K
unslothai
unsloth-zoo

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

1.4M 65K 6K
sgl-project
sglang-kernel

SGLang is a high-performance serving framework for large language models and multimodal models.

330K 28K 6K
sgl-project
sgl-kernel

SGLang is a high-performance serving framework for large language models and multimodal models.

295K 28K 6K
lightseekorg
tokenspeed-mla

TokenSpeed is a speed-of-light LLM inference engine.

236K 1K 94
vllm-project
vllm-tpu

A high-throughput and memory-efficient inference and serving engine for LLMs

170K 81K 17K
modelscope
mcore-bridge

MCore-Bridge: Providing Megatron-Core model definitions for state-of-the-art large models and making Megatron training as simple as Transformers — with support for 300+ large language models and 200+ multimodal large models.

14K 62 13
NVIDIA
nemo-automodel

🚀 Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

9K 505 153
yichuan-w
leann

[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

6K 11K 1K
lightseekorg
tokenspeed-smg

TokenSpeed is a speed-of-light LLM inference engine.

5K 1K 94
sgl-project
sglang-kt

SGLang is a high-performance serving framework for large language models and multimodal models.

4K 28K 6K
oumi-ai
oumi

Easily fine-tune, evaluate and deploy Gemma 4, Qwen3.5, Qwen3.6, gpt-oss, DeepSeek-R1, or any open source LLM / VLM!

3K 9K 765
InternLM
xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

2K 5K 422
unslothai
unsloth-studio

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

1K 65K 6K
Perpetue237
agentsculptor

AgentSculptor: Refactor, restructure & modernize codebases with natural language — powered by GPT-OSS and vLLM.

761 11 0
vllm-project
ai-dynamo-vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

620 81K 17K
sgl-project
dblcsgen

SGLang is a high-performance serving framework for large language models and multimodal models.

571 28K 6K
vllm-project
vllm-acc

A high-throughput and memory-efficient inference and serving engine for LLMs

536 81K 17K
vllm-project
vllm-xft

A high-throughput and memory-efficient inference and serving engine for LLMs

510 81K 17K
GGUFloader
ggufloader

GGUF Loader with its Agentic Mode, and floating button, ai Models | Open Source & Offline. Mistral, Deepseek, llama, gemma, qwen

505 48 11
unslothai
indigo-print

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

496 65K 6K
vllm-project
wxy-test

A high-throughput and memory-efficient inference and serving engine for LLMs

407 2K 1K
    • Data from PyPI, GitHub, ClickHouse, and BigQuery