PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Train Test Split Python Packages

Python packages with the GitHub topic train-test-split. Sorted by relevance, with stars and monthly downloads.
yu9824
kennard-stone

This is an algorithm for evenly partitioning.

2K 12 1
marmurar
jano

Temporal partitioning and backtesting for time-correlated datasets

1K 2 1
ODAncona
bboxconverter

Converting bounding box annotations to popular formats like a breeze.

491 27 1
graph-part
graph-part

GraphPart, a data partitioning method for ML on biological sequences

428 35 6
burning-cost
insurance-cv

Temporal cross-validation for insurance pricing - respects policy time structure, CatBoost, Polars

360 0 0
michaelscutari
protclust

Python tools for protein sequence clustering and dataset splitting

225 4 0
maksymsur
spltr

`Spltr` is a simple PyTorch-based data loader and splitter. It may be used to load arrays and matrices or Pandas DataFrames and CSV files containing numerical data with subsequent split it into train, test (validation) subsets in the form of PyTorch DataLoader objects.

158 1 0
emilelampe
maestros

Multi-label stratified splits, while preserving group independence. Includes a stratification chart and report.

113 1 0
bharatadk
python-splitter

📁 Repo for python_splitter Python package. This package can split Images into Train, Test, Validation folders automatically by shuffling media/images for machine learning.

110 12 4
ODAncona
bboxtools-2

This library allows reading and converting bounding box annotations in many popular formats

79 27 1
michaelscutari
mmseqspy

protclust is a Python library for protein sequence analysis that integrates MMseqs2 for fast clustering and provides tools for creating robust machine learning datasets. It offers cluster-aware data splitting to prevent sequence similarity bias in model evaluation, along with comprehensive protein embedding capabilities for feature generation.

77 4 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery