PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Data Synthesis Python Packages

Python packages with the GitHub topic data-synthesis. Sorted by relevance, with stars and monthly downloads.
reactor-no8
neots

NeoTextSynthesizer is a high-performance OCR training data generator.

14K 1 0
vkit-x
vkit-nightly

Boosting Document Intelligence

3K 23 1
Open-DataFlow
open-dataflow

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

3K 4K 378
DIYer22
bpycv

Computer vision utils for Blender.

2K 501 60
Open-DataFlow
open-dataflow-adp

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

841 4K 378
sebhaan
tabpfgen

TabPFGen: Synthetic Tabular Data Generation with TabPFN

522 41 6
EtienneChollet
oct-vesselseg

A Label-Free and Data-Free Synthesis Engine and Training Framework for Vascular Segmentation of sOCT Data with PyTorch.

293 6 0
open-sciencelab
graphg

GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

236 1K 82
ArenaGrenade
bpycv3d

Blender Python Package for extracting internal data from blender scenes for 3d related data generation purposes.

227 6 0
MatthewCYM
gense

Official implementaion of EMNLP 2022 paper "Generate, Discriminate, and Contrast: A Semi-Supervised Sentence Representation Learning Framework"

73 23 1
vkit-x
vkit

Boosting Document Intelligence

60 23 1
    • Data from PyPI, GitHub, ClickHouse, and BigQuery