PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Instruction Tuning Python Packages

Python packages with the GitHub topic instruction-tuning. Sorted by relevance, with stars and monthly downloads.
hiyouga
llamafactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

235K 71K 9K
bespokelabsai
bespokelabs-curator

Synthetic data curation for post-training and structured data extraction

46K 2K 141
ContextualAI
gritlm

Generative Representational Instruction Tuning

15K 691 50
datajuicer
py-data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

5K 6K 371
datadreamer-dev
datadreamer-dev

Prompt. Generate Synthetic Data. Train & Align Models.

1K 1K 59
snowmuffin
convmerge

Merge heterogeneous chat/text sources into a single LLM training format (JSONL)

1K 0 1
hiyouga
llmtuner

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

1K 71K 9K
sileod
tasksource

Datasets collection and preprocessings framework for NLP extreme multitask learning

1K 195 11
haotian-liu
llava-torch

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

954 25K 3K
hiyouga
lazyllm-llamafactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

667 71K 9K
vincentzed
decontaminate

`decon`, but with python API binding.

649 3 0
zhuang-li
scar-tool

[ACL 2025 main] SCAR: Data Selection via Style Consistency-Aware Response Ranking for Efficient Instruction-Tuning of Large Language Models

283 40 4
stef41
castwright

Generate synthetic instruction-tuning data from seed examples. Simple API, built-in quality filtering, multi-provider.

282 1 0
Luodian
otter-ai

Otter: A Multi-Modal Model with In-Context Instruction Tuning

243 3K 212
simplifine-llm
simplifine-alpha

🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud integration, and popular optimization frameworks. ✨

237 96 4
mohammedaly22
vibeprompt

🦩VibePrompt. A lightweight Python package for adapting prompts by tone, style, and audience. Built on top of LangChain, VibePrompt supports multiple LLM providers and enables structured, customizable prompt transformations for developers, writers, and researchers.

229 9 0
hiyouga
llamafactory-songlab

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

222 71K 9K
    • Data from PyPI, GitHub, ClickHouse, and BigQuery