PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Gpu Python Packages

Python packages with the GitHub topic gpu. Sorted by relevance, with stars and monthly downloads.
pytorch
torch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

87.9M 100K 28K
NVIDIA
nvidia-nccl-cu12

Optimized primitives for collective multi-GPU communication

48.6M 5K 1K
catboost
catboost

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

6.4M 9K 1K
apache
apache-tvm-ffi

Open ABI and FFI for Machine Learning Systems

6.1M 396 78
flashinfer-ai
flashinfer-python

FlashInfer: Kernel Library for LLM Serving

5M 6K 977
ashvardanian
stringzilla

Up to 100x faster strings for C, C++, CUDA, Python, Rust, Swift, JS, & Go, leveraging NEON, AVX2, AVX-512, SVE, GPGPU, & SWAR to accelerate search, hashing, sorting, edit distances, sketches, and memory ops 🦖

4.9M 3K 125
NVIDIA
nvidia-cutlass-dsl

CUDA Templates and Python DSLs for High-Performance Linear Algebra

4.8M 10K 2K
wookayin
gpustat

📊 A simple command-line utility for querying and monitoring GPU status

4.6M 4K 286
NVIDIA
nvidia-cutlass-dsl-libs-base

CUDA Templates and Python DSLs for High-Performance Linear Algebra

4.2M 10K 2K
flashinfer-ai
flashinfer-cubin

FlashInfer: Kernel Library for LLM Serving

3.5M 6K 977
NVIDIA
nvidia-cudnn-frontend

cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it

3.5M 723 153
cupy
cupy-cuda12x

NumPy & SciPy for GPU

2.9M 11K 1K
meta-pytorch
torchrec

Pytorch domain library for recommendation systems

2.4M 3K 644
skypilot-org
skypilot

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).

1.8M 10K 1K
nvidia
cuda-tile

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

1.6M 2K 136
isl-org
open3d

Open3D: A Modern Library for 3D Data Processing

1.5M 14K 3K
NVIDIA
warp-lang

A Python framework for GPU-accelerated simulation, robotics, and machine learning.

1.5M 7K 509
deepspeedai
deepspeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

1.2M 42K 5K
runpod
runpod

🐍 | Python library for RunPod API and serverless worker SDK.

978K 297 115
pytorch
torch-model-archiver

Serve, optimize and scale PyTorch models in production

963K 4K 886
PennyLaneAI
pennylane-lightning

The Lightning plugin ecosystem provides fast quantum state-vector and tensor network simulators written in C++ for use with PennyLane.

761K 136 54
intel-analytics
ipex-llm

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.

740K 9K 1K
fastai
fastai

The fastai deep learning library

651K 28K 8K
Qiskit
qiskit-aer

Aer is a high performance simulator for quantum circuits that includes noise models

625K 665 433
    • Data from PyPI, GitHub, ClickHouse, and BigQuery