PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Synthetic Python Packages

Python packages with the GitHub topic synthetic. Sorted by relevance, with stars and monthly downloads.
WenjieDu
pygrinder

PyGrinder: a Python toolkit for grinding data beans into the incomplete for real-world data simulation by introducing missing values with different missingness patterns, including MCAR (complete at random), MAR (at random), MNAR (not at random), sub sequence missing, and block missing

122K 66 6
DLR-RM
blenderproc

A procedural Blender pipeline for photorealistic training image generation

5K 4K 508
Baukebrenninkmeijer
table-evaluator

Evaluate real and synthetic datasets against each other

5K 92 28
Belval
trdg

A synthetic data generator for text recognition

3K 4K 1K
kontextox
datasety

CLI tool for dataset preparation: resize, align, caption, shuffle, synthetic, and mask generation.

2K 2 0
MattyB95
jabberjay

🦜 Synthetic Voice Detection

2K 10 1
rasinmuhammed
misata

High-performance open-source synthetic data engine. Uses LLMs for schema design and vectorized NumPy for deterministic, scalable generation.

1K 55 3
OllieBoyne
blendersynth

Synthetic Rendering for Blender

1K 96 10
ZumoLabs
zpy-zumo

Synthetic data for computer vision. An open source toolkit using Blender and Python.

1K 321 35
clovaai
synthtiger

Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021

1K 573 109
AmadeusITGroup
synthetic-face-masks

A Python library for generating synthetic face datasets with facial region masks between different face images. This tool is designed for creating training datasets for computer vision and machine learning applications.

1K 0 0
instana
synctl

CLI Tool for Synthetic Monitoring to Manage Synthetic Test and Locations Easily

954 7 1
eqasim-org
synpp

Synthetic population pipeline code for eqasim

920 21 17
meta-llama
synthetic-data-kit

Tool for generating high quality Synthetic datasets

883 2K 219
tellae
bhepop2

Synthetic population enrichment from aggregated data

839 2 1
alfurka
synloc

A Python package to create synthetic data from locally estimated distributions

724 3 0
nhsengland
nhssynth

Synthetic data generation pipeline leveraging a Differentially Private Variational Auto Encoder assessed using a variety of metrics

615 5 5
OmarSamirz
iftg

IFTG (ImageFromTextGenerator) is a Python package that simplifies creating robust datasets for OCR models. Generate images from text, apply over 10 built-in noise effects, and customize fonts and layouts. IFTG supports all languages and offers endless noise combinations, including custom noise creation.

481 21 2
NLR-Distribution-Suite
nrel-shift

Python package for developing power distribution model using opensource data.

341 6 1
WenjieDu
pycorruptor

PyGrinder: a Python toolkit for grinding data beans into the incomplete for real-world data simulation by introducing missing values with different missingness patterns, including MCAR (complete at random), MAR (at random), MNAR (not at random), sub sequence missing, and block missing

329 66 6
finos
datahub-core

DataHub - Synthetic data library

234 80 11
DocsaidLab
wordcanvas-docsaid

Generating text with custom fonts and styles.

176 0 0
finos
datahub-core-grovesy

DataHub - Synthetic data library

162 80 11
dynatrace-oss
db-load-generator

Mock database activity and run scalable simulations of database load with as little code as necessary

1 6 2
    • Data from PyPI, GitHub, ClickHouse, and BigQuery