PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Qwen3 Python Packages

Python packages with the GitHub topic qwen3. Sorted by relevance, with stars and monthly downloads.
vllm-project
vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

6.2M 81K 17K
vllm-project
vllm-tpu

A high-throughput and memory-efficient inference and serving engine for LLMs

170K 81K 17K
modelscope
ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, Phi4, ...) (AAAI 2025).

141K 14K 1K
hud-evals
hud-python

OSS RL environment + evals toolkit

66K 254 57
n24q02m
qwen3-embed

Lightweight Qwen3 text embedding and reranking via ONNX Runtime and GGUF

10K 3 0
NVIDIA
nemo-automodel

🚀 Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

9K 505 153
julep-ai
steadytext

Deterministic text generation and embeddings with zero configuration

1K 43 2
thc1006
taiwan-asr-toolkit

Production-grade Traditional Chinese / Taiwan Mandarin speech-to-text. Qwen3-ASR + MediaTek Breeze-ASR-25, hot-word injection, LLM polish, speaker diarization. RTF up to 1554x on RTX 5090, 56 TDD tests.

1K 2 0
vllm-project
ai-dynamo-vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

620 81K 17K
zilliztech
deepsearcher

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

602 8K 758
vllm-project
vllm-acc

A high-throughput and memory-efficient inference and serving engine for LLMs

536 81K 17K
vllm-project
vllm-xft

A high-throughput and memory-efficient inference and serving engine for LLMs

510 81K 17K
GGUFloader
ggufloader

GGUF Loader with its Agentic Mode, and floating button, ai Models | Open Source & Offline. Mistral, Deepseek, llama, gemma, qwen

505 48 11
FluffyAIcode
kakeyalattice

Discrete Kakeya cover for LLM KV cache: D4/E8 nested-lattice quantisation realising a Kakeya-style tube-cover over the direction sphere. 2.4x-2.8x compression at <1% perplexity loss on Qwen3, Llama-3, DeepSeek, GLM-4, Gemma. Drop-in transformers.DynamicCache. pip install kakeyalattice.

458 8 2
vllm-project
wxy-test

A high-throughput and memory-efficient inference and serving engine for LLMs

407 2K 1K
vllm-project
nextai-vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

406 81K 17K
Keyvanhardani
german-ocr

High-performance German document OCR - Local & Cloud with GPU/CPU support

389 107 6
vllm-project
vllm-consul

A high-throughput and memory-efficient inference and serving engine for LLMs

384 81K 17K
jimnoneill
obsidian-umbra

Turn any Obsidian vault into a Zettelkasten graph — locally, with a dozen years of notes in minutes. 4-phase pipeline: daily splitter (Qwen3-4B) → semantic backlinks (Potion-32M) → keyword linker → synonym clustering (GTE-large + HDBSCAN). Zero cloud.

367 3 0
vllm-project
vllm-musa

A high-throughput and memory-efficient inference and serving engine for LLMs

364 81K 17K
vllm-project
vllm-npu

A high-throughput and memory-efficient inference and serving engine for LLMs

354 81K 17K
ChaokunHong
metascreener

Open-source multi-LLM ensemble tool for systematic review workflows

303 1K 48
vllm-project
vllm-hust

A high-throughput and memory-efficient inference and serving engine for LLMs

296 81K 17K
vllm-project
vllm-emissary

A high-throughput and memory-efficient inference and serving engine for LLMs

258 81K 17K
    • Data from PyPI, GitHub, ClickHouse, and BigQuery