PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Data Labeling Python Packages

Python packages with the GitHub topic data-labeling. Sorted by relevance, with stars and monthly downloads.
HumanSignal
label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

107K 27K 4K
cleanlab
cleanlab

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

62K 11K 893
segments-ai
segments-ai

Segments.ai Python SDK

41K 27 10
shoumikchow
bbox-visualizer

Make drawing and labeling bounding boxes a piece of cake

39K 413 36
doccano
auto-labeling-pipeline

doccano auto labeling pipeline helps doccano to annotate a document automatically.

14K 45 18
doccano
doccano

Open source annotation tool for machine learning practitioners.

11K 11K 2K
alteryx
composeml

A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning.

9K 511 50
Toloka
toloka-kit

Toloka-Kit is a Python library for working with Toloka API.

6K 212 36
cleanlab
cleanlab-studio

Client interface for all things Cleanlab Studio

4K 32 10
doccano
doccano-client

A simple client for doccano API.

2K 86 68
davidjurgens
potato-annotation

potato: the portable annotation tool

2K 381 70
MichaelAkridge-NOAA
coral-annotation-tool

CAT: Coral Annotation Tool. A file-based(JSON) Structure from Motion (SfM) orthomosaic annotation tool for coral reef research

1K 4 1
strickvl
panlabel

The universal annotation converter

1K 15 0
langformers
langformers

🚀 Unified NLP Pipelines for Language Models

1K 19 1
phurwicz
hover

:speedboat: Label data at scale. Fun and precision included.

706 330 19
cleanlab
cleanlab-cli

Command line interface for all things Cleanlab Studio

647 32 10
liuxiaotong
knowlyr-datalabel

Serverless annotation framework with LLM pre-labeling, inter-annotator agreement analysis & offline HTML interface. CLI + MCP ready.

360 0 0
smrfeld
dash-annotate-cv

A Python library for computer vision annotation tasks using Dash

360 11 2
cleanlab
example-package-elisno

The standard package for data-centric AI, machine learning with label errors, and automatically finding and fixing dataset issues in Python.

344 11K 893
heartexlabs
pyheartex

Deploying machine learning for Heartex or Label Studio

277 27 7
villagecomputing
superpipe-py

build unstructured to structured data transformation pipelines

243 108 2
villagecomputing
labelkit

Superpipe - optimized LLM pipelines for structured data

180 108 2
code-kern-ai
kern-refinery

The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.

171 1K 73
ksavkin
swiftlabel

Keyboard-first image classification tool for ML practitioners

167 6 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery