PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Data Versioning Python Packages

Python packages with the GitHub topic data-versioning. Sorted by relevance, with stars and monthly downloads.
wandb
wandb

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

26.2M 11K 872
treeverse
lakefs-sdk

lakeFS - Data version control for your data lake | Git for data

1M 5K 447
treeverse
lakefs

lakeFS - Data version control for your data lake | Git for data

927K 5K 447
laminlabs
lamindb

Open-source data lakehouse for biology. Context and memory for datasets and models at scale, across infrastructure. Query, trace & validate with a lineage-native lakehouse that supports bio-formats, registries & ontologies. 🍊YC S22

95K 264 25
treeverse
lakefs-client

lakeFS - Data version control for your data lake | Git for data

81K 5K 447
quiltdata
quilt3

Quilt is a Scientific Data Management Platform on AWS that helps teams and AI find, trust, and reuse data through deeply versioned, context-rich data packages.

26K 1K 90
laminlabs
lamindb-core

Open-source data lakehouse for biology. Context and memory for datasets and models at scale, across infrastructure. Query, trace & validate with a lineage-native lakehouse that supports bio-formats, registries & ontologies. 🍊YC S22

22K 264 25
layerai
layer

Metadata store for Production ML

6K 87 6
quiltdata
quilt

Quilt is a Scientific Data Management Platform on AWS that helps teams and AI find, trust, and reuse data through deeply versioned, context-rich data packages.

4K 1K 90
BemiHQ
bemi-sqlalchemy

Automatic data change tracking for SQLAlchemy

2K 6 0
data-as-code
dac

Python Data as Code core implementation

2K 12 1
BemiHQ
bemi-django

Automatic data change tracking for Django

2K 6 0
wandb
wandb-ng

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

855 11K 872
MaratSaidov
artificial-detection

Python framework for artificial text detection: NLP approaches to compare natural text against generated by neural networks.

430 16 1
eliask
farchive

Local content-addressed archive with observation history. Stores bytes by SHA-256, preserves locator state as contiguous spans, compresses with zstd and corpus-trained dictionaries. SQLite-backed.

264 7 1
treeverse
lakefs-sdk-async

lakeFS API

263 5K 447
wandb
wandb-testing

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

231 11K 872
NewronAI
newron

Newron is a data-centric ML platform to easily build, manage, deploy and continuously improve models through data driven development.

150 3 4
NewronAI
newron-sdk

Newron is a data-centric ML platform to easily build, manage, deploy and continuously improve models through data driven development.

136 3 4
quiltdata
quilt-installer

Quilt is a Scientific Data Management Platform on AWS that helps teams and AI find, trust, and reuse data through deeply versioned, context-rich data packages.

135 1K 90
quiltdata
quilt-stack-installer

Quilt Data installation tool

91 1K 90
wandb
custom-wandb

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

50 11K 872
wandb
tendb

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

47 11K 872
wandb
wandb-zc

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

34 11K 872
    • Data from PyPI, GitHub, ClickHouse, and BigQuery