PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Cpu Inference Python Packages

Python packages with the GitHub topic cpu-inference. Sorted by relevance, with stars and monthly downloads.
MekayelAnik
vllm-cpu

Wheels & Docker images for running vLLM on CPU-only systems, optimized for different CPU instruction sets

30K 6 0
FoxNoseTech
diarize

Speaker diarization for Python — "who spoke when?" CPU-only, no API keys, Apache 2.0. ~10.8% DER on VoxConverse, 8x faster than real-time.

3K 67 7
brontoguana
krasis

Krasis is a Hybrid LLM runtime which focuses on efficient running of larger models on consumer grade VRAM limited hardware

3K 452 26
MekayelAnik
vllm-cpu-avx512vnni

Wheels & Docker images for running vLLM on CPU-only systems, optimized for different CPU instruction sets

2K 6 0
MekayelAnik
vllm-cpu-avx512bf16

Wheels & Docker images for running vLLM on CPU-only systems, optimized for different CPU instruction sets

2K 6 0
MekayelAnik
vllm-cpu-amxbf16

Wheels & Docker images for running vLLM on CPU-only systems, optimized for different CPU instruction sets

2K 6 0
MekayelAnik
vllm-cpu-avx512

Wheels & Docker images for running vLLM on CPU-only systems, optimized for different CPU instruction sets

2K 6 0
HaseebKhalid1507
velocirag

Lightning-fast RAG for AI agents. ONNX-powered, 4-layer fusion, MCP server. No PyTorch.

924 6 1
laelhalawani
gguf-llama

Wrapper for simplified use of Llama2 GGUF quantized models.

774 7 1
codito
arey

Simple large language model playground app

343 6 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery