PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Kimi Python Packages

Python packages with the GitHub topic kimi. Sorted by relevance, with stars and monthly downloads.
vllm-project
vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

6.2M 81K 17K
lightseekorg
tokenspeed-mla

TokenSpeed is a speed-of-light LLM inference engine.

236K 1K 94
vllm-project
vllm-tpu

A high-throughput and memory-efficient inference and serving engine for LLMs

170K 81K 17K
ThreeFish-AI
coding-proxy

A High-Availability, Transparent, and Smart Multi-Vendor Proxy for Claude Code. Support Claude Plans, GitHub Copilot, Google Antigravity, ZAI/GLM, MiniMax, Qwen, Xiaomi, Kimi, Doubao...

6K 16 2
Shelpuk-AI-Technology-Consulting
kitty-bridge

Universal LLM bridge for AI agents. Use Claude Code with MiniMax, Codex with GLM, or Gemini CLI with OpenRouter — one command, any provider. Works with coding agents, OpenClaw, Hermes, and others.

6K 10 3
lightseekorg
tokenspeed-smg

TokenSpeed is a speed-of-light LLM inference engine.

5K 1K 94
TUNC-AI
tunc-clm

Append-only handoff format for multi-session AI threads. Linear write cost at depth, full audit lineage. Any AI family.

2K 1 2
vllm-project
ai-dynamo-vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

620 81K 17K
LLMPages
llm-onesdk

OneSDK is a Python library that provides a unified interface for interacting with various Large Language Model (LLM) providers.

551 2 0
vllm-project
vllm-acc

A high-throughput and memory-efficient inference and serving engine for LLMs

536 81K 17K
vllm-project
vllm-xft

A high-throughput and memory-efficient inference and serving engine for LLMs

510 81K 17K
KevRojo
ia-web-parser

Turn any AI web-chat provider into a tool-capable streaming API. Harvest once, chat forever — with persistent Playwright browser profiles and automatic cookie management.

428 2 0
vllm-project
wxy-test

A high-throughput and memory-efficient inference and serving engine for LLMs

407 2K 1K
vllm-project
nextai-vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

406 81K 17K
floomhq
floom-sdk

Turn Python functions into web apps. Type hints become UI, API, and shareable links.

391 1 1
vllm-project
vllm-consul

A high-throughput and memory-efficient inference and serving engine for LLMs

384 81K 17K
vllm-project
vllm-musa

A high-throughput and memory-efficient inference and serving engine for LLMs

364 81K 17K
vllm-project
vllm-npu

A high-throughput and memory-efficient inference and serving engine for LLMs

354 81K 17K
Amanbig
devorch

A terminal-native, multi-provider intelligent assistant that plans, executes, and tracks developer tasks, not just answers prompts, similar to Claude Code and Gemini CLI.

315 4 0
vllm-project
vllm-hust

A high-throughput and memory-efficient inference and serving engine for LLMs

296 81K 17K
vllm-project
vllm-emissary

A high-throughput and memory-efficient inference and serving engine for LLMs

258 81K 17K
vllm-project
vllm-online

A high-throughput and memory-efficient inference and serving engine for LLMs

234 81K 17K
vllm-project
vllm-usf

A high-throughput and memory-efficient inference and serving engine for LLMs

233 81K 17K
shibing624
chatpilot

ChatPilot: Chat Agent Web UI,实现Chat对话前端,支持Google搜索、文件网址对话(RAG)、代码解释器功能,复现了Kimi Chat(文件,拖进来;网址,发出来)。

220 600 60
    • Data from PyPI, GitHub, ClickHouse, and BigQuery