PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Rlhf Python Packages

Python packages with the GitHub topic rlhf. Sorted by relevance, with stars and monthly downloads.
hiyouga
llamafactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

235K 71K 9K
transformerlab
transformerlab

The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters.

126K 5K 511
THUDM
image-reward

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

28K 2K 92
transformerlab
transformerlab-cli

The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters.

13K 5K 511
recognai
rubrix

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

11K 5K 486
agentscope-ai
py-openjudge

OpenJudge: A Unified Framework for Holistic Evaluation and Quality Rewards

10K 605 50
agentscope-ai
trinity-rft

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).

4K 629 67
Kiln-AI
kiln-ai

Build, Evaluate, and Optimize AI Systems. Includes evals, RAG, agents, fine-tuning, synthetic data generation, dataset management, MCP, and more.

3K 5K 366
argilla-io
argilla-server

Open-source tool for exploring, labeling, and monitoring data for NLP projects.

2K 5K 485
Goekdeniz-Guelmez
mlx-lm-lora

Train LLMs on Apple silicon with MLX and the Hugging Face Hub

2K 335 42
voidful
textrl

TextRL - reinforcement learning for text generation, built on HuggingFace TRL.

2K 564 61
log10-io
log10-io

Python client library for improving your LLM app accuracy

1K 96 13
Kiln-AI
kiln-server

Build, Evaluate, and Optimize AI Systems. Includes evals, RAG, agents, fine-tuning, synthetic data generation, dataset management, MCP, and more.

1K 5K 366
hiyouga
llmtuner

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

1K 71K 9K
dannylee1020
openpo

Build high quality synthetic datasets with AI feedback from 200+ LLMs

880 27 0
hiyouga
lazyllm-llamafactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

667 71K 9K
sail-sg
oat-llm

Online AlignmenT (OAT) for LLMs.

631 653 63
huggingface
alignment-handbook

Robust recipes to align language models with human and AI preferences

406 6K 489
CyberAgentAILab
aepo

Code of Annotation-Efficient Language Model Alignment via Diverse and Representative Response Texts (EMNLP Findings 2025)

397 10 1
liuxiaotong
knowlyr-datalabel

Serverless annotation framework with LLM pre-labeling, inter-annotator agreement analysis & offline HTML interface. CLI + MCP ready.

358 0 0
argilla-io
argilla-v1

Open-source tool for exploring, labeling, and monitoring data for NLP projects.

303 5K 485
xrsrke
instruct-goose

Implementation of Reinforcement Learning from Human Feedback (RLHF)

296 174 21
michaelellis003
lmxlab

Transformer language models on Apple Silicon with MLX

269 1 0
TUDB-Labs
mlora-cli

An Efficient "Factory" to Build Multiple LoRA Adapters

244 378 66
    • Data from PyPI, GitHub, ClickHouse, and BigQuery