PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Flash Attention Python Packages

Python packages with the GitHub topic flash-attention. Sorted by relevance, with stars and monthly downloads.
NVIDIA
nvidia-cudnn-frontend

cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it

3.5M 723 153
xlite-dev
ffpa-attn

🤖FFPA: Extends FlashAttention-2 (forward & backward) via Split-D for large headdim, delivering 1.5~3×↑🎉 speedup over SDPA.

6K 293 17
HKUSTDial
flash-sparse-attn

Trainable fast and memory-efficient sparse attention

595 681 56
davidkny22
easywheels

Smart GPU wheel installer. Auto-detects CUDA, GPU, torch, and Python.

501 0 0
kyegomez
flashmha

An simple pytorch implementation of Flash MultiHead Attention

441 22 4
DAMO-NLP-SG
inf-cl

[CVPR 2025 Highlight] The official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss". A super memory-efficiency CLIP training scheme.

314 286 12
ot-triton-lab
flash-sinkhorn

Sinkhorn optimal transport kernels in PyTorch + Triton (squared Euclidean, no cost matrix materialization).

302 194 20
Mapika
gpkg

GPU package manager — find prebuilt CUDA wheels, build missing ones

257 0 0
erfanzar
jax-flash-attn2

A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/Pallas/JAX).

165 34 1
egaoharu-kensei
flash-attention-triton

Cross-platform FlashAttention-2 Triton implementation for Turing+ GPUs with custom configuration mode

165 26 0
SmallDoges
flash-dmattn

Trainable fast and memory-efficient sparse attention

138 690 56
    • Data from PyPI, GitHub, ClickHouse, and BigQuery