PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Data Annotation Python Packages

Python packages with the GitHub topic data-annotation. Sorted by relevance, with stars and monthly downloads.
cleanlab
cleanlab

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

62K 11K 893
rsgoncalves
text2term

a tool for mapping free-text descriptions of entities to ontology terms

3K 20 6
strickvl
panlabel

The universal annotation converter

1K 15 0
pixano
pixano

Data-centric AI building blocks for computer vision applications

785 59 12
ufal
factgenie

Lightweight self-hosted span annotation tool

378 43 8
cleanlab
example-package-elisno

The standard package for data-centric AI, machine learning with label errors, and automatically finding and fixing dataset issues in Python.

344 11K 893
fastent
fastent

custom models for named-entity recognition

314 6 2
liamtoran
flippers

`flippers` is a weak supervision library for creating high quality labels using domain kownledge and heuristics.

249 4 1
explosion
jupyterlab-prodigy

🧬 A JupyterLab extension for annotating data with Prodigy

78 189 24
    • Data from PyPI, GitHub, ClickHouse, and BigQuery