PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Openai Api Python Packages

Python packages with the GitHub topic openai-api. Sorted by relevance, with stars and monthly downloads.
ShishirPatil
bfcl-eval

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

285K 13K 1K
xtekky
g4f

The official gpt4free repository | various collection of powerful language models | opus 4.6 gpt 5.3 kimi 2.5 deepseek v3.2 gemini 3

166K 66K 14K
raullenchai
rapid-mlx

The fastest local AI engine for Apple Silicon. 4.2x faster than Ollama, 0.08s cached TTFT, 100% tool calling. 17 tool parsers, prompt cache, reasoning separation, cloud routing. Drop-in OpenAI replacement. Works with Claude Code, Cursor, Aider.

48K 2K 287
xorbitsai
xinference

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

41K 9K 824
jjang-ai
vmlx

vMLX - JANGTQ Uber Compressed MLX Models - L2 Disk Cache (survives restart) + L1 Paged (super fast ttft) + Hybrid SSM Scheduler + Cont Batching + etc!

35K 510 62
BrainBlend-AI
atomic-agents

Building AI agents, atomically

24K 6K 511
sipsalabs
ultracompress

Lossless 5-bit transformer compression. 22 architectures shipped (0.6B-405B incl. dense + MoE + SSM), 14 PPL-verified. Hermes-3-405B 1.0066x, Mistral-7B 1.00548x, Mixtral-8x7B 1.00368x. SHA-256-verifiable bit-identical reconstruction. OpenAI-compatible API at api.sipsalabs.com. pip install ultracompress

10K 12 0
codelion
optillm

Optimizing inference proxy for LLMs

7K 4K 319
rohitgarg19
opencode-llmstack

Cursor-Auto / Claude-tier-style serving for local GGUF models on Mac (M4 Max, 64 GB). FastAPI router fronts llama-swap + llama.cpp, classifying each request into a coder, planner, or uncensored-planner tier. OpenAI-compatible API, opencode integration, per-project subshell, one `llmstack` console-script.

6K 0 0
Elijas
token-throttle

Multi-resource rate limiting for LLM APIs. Reserve tokens before you call, refund what you don't use, stay under the limit across workers.

6K 18 2
Oaklight
llm-rosetta

Production-ready LLM API translation layer for Python — bidirectional conversion between OpenAI, Anthropic & Google formats via hub-and-spoke IR. Optional API gateway. Streaming & non-streaming. Zero core deps. Contributions welcome!

6K 21 1
microsoft
lida

Automatic Generation of Visualizations and Infographics using Large Language Models

6K 3K 379
TeamKillerX
ryzenth

Ryzenth is a flexible Multi-API SDK with built-in support for API key management and database integration.

5K 3 1
orkunkinay
openai-cost-calculator

Calculate exact USD cost of each OpenAI API call — no guesswork.

5K 14 6
Aquiles-ai
aquiles-image

A high-performance, memory-efficient inference server for diffusion models, compatible with the OpenAI client

4K 20 0
nazdridoy
ngpt

🤖 nGPT - A lightning-fast CLI tool that brings any OpenAI-compatible LLM (OpenAI, Ollama, Groq, Claude, Gemini) directly to your terminal. Generate code, craft git commits, execute shell commands, rewrite text, and chat interactively, all with seamless provider switching and real-time streaming.

4K 46 4
madroidmaq
mlx-omni-server

MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. It implements OpenAI-compatible API endpoints, enabling seamless integration with existing OpenAI SDK clients while leveraging the power of local ML inference.

3K 716 87
Sxvxgee
unlimitedgpt

An unofficial Python wrapper for OpenAI's ChatGPT API

3K 419 43
berkayildi
mcp-content-pipeline

MCP server for YouTube video analysis and X feed digests

3K 0 0
project-david-ai
projectdavid-platform

Deployment orchestrator for the Project David / Entities platform

3K 1 0
sethbang
adaptive-rate-limiter

Provider-agnostic adaptive rate limiting for AI/ML APIs with intelligent scheduling, streaming support, and distributed backends

3K 1 0
Ascyt
ezgpt

Python library for easier GPT usage than the default openai library.

2K 3 1
maemreyo
omnivoice-server

OpenAI-compatible HTTP server for OmniVoice text-to-speech

2K 49 17
The-Cloud-Clockwork
agentibridge

MCP server that indexes Claude Code CLI transcripts and exposes them via 16 tools — search, semantic search, dispatch, memory, and more

2K 1 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery