PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Data Centric Ai Python Packages

Python packages with the GitHub topic data-centric-ai. Sorted by relevance, with stars and monthly downloads.
voxel51
fiftyone-db

Refine high-quality datasets and visual AI models

170K 11K 761
voxel51
fiftyone

Refine high-quality datasets and visual AI models

136K 11K 761
cleanlab
cleanlab

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

62K 11K 893
cleanlab
cleanvision

Automatically find issues in image datasets and practice data-centric computer vision.

10K 1K 82
cleanlab
cleanlab-studio

Client interface for all things Cleanlab Studio

4K 32 10
voxel51
fiftyone-db-ubuntu2204

Refine high-quality datasets and visual AI models

3K 11K 761
Hyper3Labs
hyperview

HyperView curates datasets and provides model introspection in hyperbolic and Euclidean geometries.

2K 17 3
Digital-Dermatology
selfclean

A holistic self-supervised data cleaning strategy to detect off-topic samples, near duplicates and label errors.

2K 37 2
voxel51
fiftyone-desktop

Refine high-quality datasets and visual AI models

1K 11K 761
cleanlab
cleanlab-cli

Command line interface for all things Cleanlab Studio

647 32 10
opendataval
opendataval

OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)

645 101 11
aai-institute
pydvl

pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation

495 146 10
voxel51
fiftyone-db-ubuntu2004

Refine high-quality datasets and visual AI models

482 11K 761
mdbloice
labeller

Quickly set up an image labelling web application for manually tagging images for machine learning tasks.

350 9 2
cleanlab
example-package-elisno

The standard package for data-centric AI, machine learning with label errors, and automatically finding and fixing dataset issues in Python.

344 11K 893
ear-team
bambird

BAM, unsupervised labelling function to extract and cluster similar animal vocalizations together

262 31 7
Docta-ai
docta-ai

Docta.ai

231 3K 256
JieyuZ2
ws-benchmark

a benchmark for weak supervision

222 227 34
voxel51
fiftyone-db-debian9

Refine high-quality datasets and visual AI models

189 11K 761
code-kern-ai
kern-refinery

The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.

171 1K 73
voxel51
fiftyone-db-rhel7

Refine high-quality datasets and visual AI models

144 11K 761
voxel51
fiftyone-db-ubuntu1604

Refine high-quality datasets and visual AI models

143 11K 761
code-kern-ai
refinery-python-sdk

Official Python SDK for Kern AI refinery.

125 20 3
code-kern-ai
kern-python-client

Official Python SDK for Kern AI refinery.

1 20 3
    • Data from PyPI, GitHub, ClickHouse, and BigQuery