PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Reward Shaping Python Packages

Python packages with the GitHub topic reward-shaping. Sorted by relevance, with stars and monthly downloads.
lucidrains
host-pytorch

Implementation of Humanoid Standing Up, from the paper "Learning Humanoid Standing-up Control across Diverse Postures" out of Shanghai, in Pytorch

4K 45 5
audieleon
goodhart

Catch reward traps before training. Named after Goodhart's Law.

2K 0 0
haizelabs
verdict

Inference-time scaling for LLMs-as-a-judge.

2K 339 26
Digitalized-Energy-Systems
opfgym

Reinforcement Learning environments for learning the Optimal Power Flow

206 29 3
takato86
shaner

for shaping RL agent package.

102 3 1
    • Data from PyPI, GitHub, ClickHouse, and BigQuery