PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Attention Python Packages

Python packages with the GitHub topic attention. Sorted by relevance, with stars and monthly downloads.
sgl-project
sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

303.2M 28K 6K
flashinfer-ai
flashinfer-python

FlashInfer: Kernel Library for LLM Serving

5M 6K 977
flashinfer-ai
flashinfer-cubin

FlashInfer: Kernel Library for LLM Serving

3.5M 6K 977
NVIDIA
nvidia-cudnn-frontend

cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it

3.5M 723 153
sgl-project
sglang-kernel

SGLang is a high-performance serving framework for large language models and multimodal models.

330K 28K 6K
sgl-project
sgl-kernel

SGLang is a high-performance serving framework for large language models and multimodal models.

295K 28K 6K
thu-ml
sageattention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

160K 3K 416
lucidrains
dreamer4

Implementation of Danijar's latest iteration for his Dreamer line of work

12K 186 18
CyberZHG
keras-transformer

Transformer implemented in Keras

9K 368 94
lucidrains
performer-pytorch

An implementation of Performer, a linear attention-based transformer, in Pytorch

9K 1K 150
CyberZHG
keras-position-wise-feed-forward

Feed forward layer implemented in Keras

7K 8 5
leondgarse
keras-cv-attention-models

Keras beit,caformer,CMT,CoAtNet,convnext,davit,dino,efficientdet,edgenext,efficientformer,efficientnet,eva,fasternet,fastervit,fastvit,flexivit,gcvit,ghostnet,gpvit,hornet,hiera,iformer,inceptionnext,lcnet,levit,maxvit,mobilevit,moganet,nat,nfnets,pvt,swin,tinynet,tinyvit,uniformer,volo,vanillanet,yolor,yolov7,yolov8,yolox,gpt2,llama2, alias kecam

7K 625 97
ai4co
rl4co

A PyTorch library for all things Reinforcement Learning (RL) for Combinatorial Optimization (CO)

6K 872 147
densechen
activations

AReLU: Attention-based-Rectified-Linear-Unit

5K 62 8
sgl-project
sglang-kt

SGLang is a high-performance serving framework for large language models and multimodal models.

4K 28K 6K
lucidrains
transfusion-pytorch

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

4K 1K 72
kyegomez
qwen

My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't released model code yet sooo...

3K 13 3
lucidrains
native-sparse-attention-pytorch

Native Sparse Attention

3K 805 52
labmlai
labml-nn

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

2K 67K 7K
rhoadesScholar
rotary-spatial-embeddings

PyTorch implementation of Rotary Spatial Embeddings

2K 7 1
ddbourgin
numpy-ml

Machine learning, in numpy

2K 16K 4K
lucidrains
spear-tts-pytorch

Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch

2K 277 20
sovit-123
vision-transformers

Vision Transformers for image classification, image segmentation, and object detection.

2K 68 9
lucidrains
h-transformer-1d

Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning

2K 166 22
    • Data from PyPI, GitHub, ClickHouse, and BigQuery