PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Imbalanced Data Python Packages

Python packages with the GitHub topic imbalanced-data. Sorted by relevance, with stars and monthly downloads.
nickkunz
smogn

Synthetic Minority Over-Sampling Technique for Regression

14K 347 85
ZhiningLiu1998
imbalanced-ensemble

[NeurIPS'25]🛠️Class-imbalanced Ensemble Learning Toolbox. | 类别不平衡/长尾机器学习库

5K 424 59
analyticalmindsltd
smote-variants

A collection of 85 minority oversampling techniques (SMOTE) for imbalanced learning with multi-class oversampling and model selection features

4K 686 148
vishishtpriyadarshi
cobraclassifier

COBRA for Classification tasks on Imbalanced Data

577 1 0
ZhiningLiu1998
self-paced-ensemble

[ICDE'20] ⚖️ A general, efficient ensemble framework for imbalanced classification. | 泛用,高效,鲁棒的类别不平衡学习框架

289 261 49
ashishpatel26
datascienv

datascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforest that provide single line of import all required ml libraries

238 58 12
Pushp-Kharat1
pkboost

Gradient boosting that adapts to concept drift in imbalanced data

219 70 5
pradeepdev-1995
databalancer

Databalancer is the python library dedicated to balance the imbalanced text classification datasets before the model training in machine learning applications

214 7 0
pkmap
pkmap

This dataset imbalance visualization toolkit will be the beginning of a fire-new branch in NILM studies. (the website is pending)

113 1 0
tgsmith61591
skoot

A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn friendly interface in an effort to expedite the modeling process.

92 57 11
artefactory
mgs-grf

MGS-GRF for imbalanced-mixed-tabular data (AISTATS 2026 and ECML-PKDD 2025)

81 50 1
amaxiom
overnan

Oversampling for Imbalanced Learning with Missing Values

71 1 0
vishishtpriyadarshi
imbcobra

COBRA for Classification tasks on Imbalanced Data

70 1 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery