PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Rl Python Packages

Python packages with the GitHub topic rl. Sorted by relevance, with stars and monthly downloads.
pytorch
torchrl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

1.6M 3K 455
JudgmentLabs
judgeval

The Continuous-Improvement Stack for Agents. Our environment data and evals power agent improvement and monitoring.

479K 1K 93
Stable-Baselines-Team
sb3-contrib

Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

296K 716 240
neptune-ai
neptune

πŸ“˜ The experiment tracker for foundation model training

179K 623 75
thu-ml
tianshou

An elegant PyTorch deep reinforcement learning library.

134K 11K 1K
hud-evals
hud-python

OSS RL environment + evals toolkit

66K 254 57
neptune-ai
neptune-client

πŸ“˜ The experiment tracker for foundation model training

42K 623 75
pytorch
torchrl-nightly

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

40K 3K 455
google
dopamine-rl

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

35K 11K 1K
axon-rl
gem-llm

A Gym for Agentic LLMs

23K 487 33
google-research
rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

21K 872 49
jiauzhang
torchstudio

Deep Learning Experiment

7K 2 0
yamoling
multi-agent-rlenv

Strongly typed reinforcement learning environment framework

7K 1 1
instadeepai
flashbax

⚑ Flashbax: Accelerated Replay Buffers in JAX

6K 279 23
DLR-RM
rl-zoo3

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

6K 3K 598
epignatelli
navix

Accelerated minigrid environments with JAX

5K 170 21
inclusionAI
awex

A high-performance RL training-inference weight synchronization framework, designed to enable second-level parameter updates from training to inference in RL workflows

4K 150 17
pfeinsper
dsse

The Drone Swarm Search project provides an environment for SAR missions built on PettingZoo, where agents, represented by drones, are tasked with locating targets identified as shipwrecked individuals.

4K 72 14
abundant-ai
oddish

Run Harbor tasks in the cloud

3K 2 1
gbionics
amp-rsl-rl

πŸ” AMP-RSL-RL: Adversarial Motion Priors for robotic RL (PPO + motion imitation)

2K 312 25
lguibr
trianglengin

The core logic for the Triangle Puzzle game. Features a fast C++ backend, Pybind11 wrappers, and a Python API designed for AI/ML development and simulation.

2K 0 0
luccabb
moonfish

~2000 Elo Python Chess Engine that implements: Negamax, PeSTO’s Evaluation, Null Move, Quiescence Search, Lazy SMP.

2K 25 4
sintefneodroid
neodroid

Python interface for the Neodroid platform, an API for communicating with a Unity Game process for a feedback response loop

2K 8 5
inclusionAI
aenvironment

Standardized environment infrastructure for Agentic AI development.

1K 301 36
    • Data from PyPI, GitHub, ClickHouse, and BigQuery