PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Datasets Preparation Python Packages

Python packages with the GitHub topic datasets-preparation. Sorted by relevance, with stars and monthly downloads.
visual-layer
vl-datasets

Open, Clean Datasets for Computer Vision.

703 69 4
0ssamaak0
dlta-ai

DLTA-AI is the next generation of annotation tools, integrating the power of Computer Vision SOTA models to Labelme in a seamless expirence and intuitive workflow to make creating image datasets easier than ever before

396 360 40
hephaes-ai
hephaes

Turn robot data into reproducible training & eval datasets

311 3 0
nicolay-r
arekit-ss

Low Resource Context Relation Sampler for contexts with relations for fact-checking and fine-tuning your LLM models, powered by AREkit

251 4 0
nmicovic
katachi

Katachi is a Python framework for validating and processing hierarchical directory structures using YAML-based schemas. It ensures your folders and files follow expected shapes, naming rules, and relationships—before any processing begins. Use it to enforce structure, catch issues early, and keep your data pipelines reliable.

192 2 1
    • Data from PyPI, GitHub, ClickHouse, and BigQuery