PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Data Exploration Python Packages

Python packages with the GitHub topic data-exploration. Sorted by relevance, with stars and monthly downloads.
ydataai
ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

2M 14K 2K
ydataai
pandas-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

648K 14K 2K
Kanaries
pygwalker

PyGWalker: Turn your dataframe into an interactive UI for visual analysis

285K 16K 863
fbdesignpro
sweetviz

Visualize and compare datasets, target values and associations, with one line of code.

144K 3K 288
polyaxon
traceml

Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.

134K 533 47
mouradmourafiq
pandas-summary

Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.

112K 533 47
polyaxon
datatile

Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.

112K 533 47
InfuseAI
piperider-nightly

Code review for data in dbt

15K 494 23
panel-extensions
panel-graphic-walker

A project providing a Graphic Walker Pane for use with HoloViz Panel.

14K 353 14
sfu-db
dataprep

Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.

12K 2K 224
cleanlab
cleanvision

Automatically find issues in image datasets and practice data-centric computer vision.

10K 1K 82
Data-Centric-AI-Community
fg-data-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

10K 14K 2K
copyleftdev
x12-edi-tools

A comprehensive set of tools for working with X12 EDI files

8K 25 6
ironmussa
optimuspyspark

:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

4K 2K 232
abhayspawar
featexp

Feature exploration for supervised learning

3K 759 160
InfuseAI
piperider

Code review for data in dbt

3K 494 23
desbordante
desbordante

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

3K 478 100
debiai
debiai-gui

DebiAI easy start module, the standalone version of DebiAI

3K 30 5
comet-ml
kangas

🦘 Explore multimedia datasets at scale

2K 1K 50
tvdboom
atom-ml

A Python package for fast exploration of machine learning pipelines

2K 164 15
Renumics
sliceguard

A library for detecting problematic data segments in structured and unstructured data with few lines of code.

1K 63 3
grafana-toolbox
grafana-wtf

Grep through all Grafana entities in the spirit of git-wtf.

1K 220 22
eikevons
pandas-paddles

Access the parent Pandas data frame in loc[], iloc[], assign(), and others Pandas helpers

982 5 0
Data-Centric-AI-Community
datakit-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

820 14K 2K
    • Data from PyPI, GitHub, ClickHouse, and BigQuery