PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Qwen Python Packages

Python packages with the GitHub topic qwen. Sorted by relevance, with stars and monthly downloads.
sgl-project
sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

303.2M 28K 6K
huggingface
transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

149.1M 161K 33K
vllm-project
vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

6.2M 81K 17K
unslothai
unsloth

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

2.2M 65K 6K
unslothai
unsloth-zoo

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

1.4M 65K 6K
sgl-project
sglang-kernel

SGLang is a high-performance serving framework for large language models and multimodal models.

330K 28K 6K
sgl-project
sgl-kernel

SGLang is a high-performance serving framework for large language models and multimodal models.

295K 28K 6K
lightseekorg
tokenspeed-mla

TokenSpeed is a speed-of-light LLM inference engine.

236K 1K 94
hiyouga
llamafactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

235K 71K 9K
vllm-project
vllm-tpu

A high-throughput and memory-efficient inference and serving engine for LLMs

170K 81K 17K
huggingface
pytorch-pretrained-bert

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

85K 161K 33K
hud-evals
hud-python

OSS RL environment + evals toolkit

66K 254 57
raullenchai
rapid-mlx

The fastest local AI engine for Apple Silicon. 4.2x faster than Ollama, 0.08s cached TTFT, 100% tool calling. 17 tool parsers, prompt cache, reasoning separation, cloud routing. Drop-in OpenAI replacement. Works with Claude Code, Cursor, Aider.

48K 2K 287
huggingface
pytorch-transformers-pvt-nightly

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

43K 161K 33K
xorbitsai
xinference

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

41K 9K 824
filipstrand
mflux

MLX native implementations of state-of-the-art generative image models

36K 2K 143
huggingface
pytorch-transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

32K 161K 33K
youssofal
mtplx

2.24x decode TPS increase On Qwen 3.6 27B @ temp 0.6 | Native MTP Speculative Decoding On Apple Silicon With No External Drafter.

16K 492 19
Tencent
angelslim

Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.

7K 1K 131
snowby666
poe-api-wrapper

👾 A Python API wrapper for Poe.com. With this, you will have free access to GPT-4, Claude, Llama, Gemini, Mistral and more! 🚀

7K 1K 142
ThreeFish-AI
coding-proxy

A High-Availability, Transparent, and Smart Multi-Vendor Proxy for Claude Code. Support Claude Plans, GitHub Copilot, Google Antigravity, ZAI/GLM, MiniMax, Qwen, Xiaomi, Kimi, Doubao...

6K 16 2
Shelpuk-AI-Technology-Consulting
kitty-bridge

Universal LLM bridge for AI agents. Use Claude Code with MiniMax, Codex with GLM, or Gemini CLI with OpenRouter — one command, any provider. Works with coding agents, OpenClaw, Hermes, and others.

6K 10 3
lightseekorg
tokenspeed-smg

TokenSpeed is a speed-of-light LLM inference engine.

5K 1K 94
TeamKillerX
akenoai

AkenoAi Python Wrapper For Plus+

5K 6 1
    • Data from PyPI, GitHub, ClickHouse, and BigQuery