PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Llm Infra Python Packages

Python packages with the GitHub topic llm-infra. Sorted by relevance, with stars and monthly downloads.
thu-ml
sageattention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

160K 3K 416
    • Data from PyPI, GitHub, ClickHouse, and BigQuery