PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Data Analysis Python Packages

Python packages with the GitHub topic data-analysis. Sorted by relevance, with stars and monthly downloads.
pandas-dev
pandas

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

703.2M 49K 20K
scikit-learn
scikit-learn

scikit-learn: machine learning in Python

208.6M 66K 27K
aws
redshift-connector

Redshift Python Connector. It supports Python Database API Specification v2.0.

48.7M 218 87
statsmodels
statsmodels

Statsmodels: statistical modeling and econometrics in Python

36.7M 11K 3K
streamlit
streamlit

Streamlit โ€” A faster way to build and share data apps.

29.2M 45K 4K
gradio-app
gradio

Build and share delightful machine learning apps, all in Python. ๐ŸŒŸ Star to support our work!

14.8M 43K 3K
scikit-learn-contrib
imbalanced-learn

A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning

13.2M 7K 1K
gradio-app
gradio-client

Build and share delightful machine learning apps, all in Python. ๐ŸŒŸ Star to support our work!

9.2M 43K 3K
has2k1
plotnine

A Grammar of Graphics for Python

3.4M 5K 248
databrickslabs
dbl-tempo

API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation

3.2M 342 59
akfamily
akshare

AKShare is an elegant and simple financial data interface library for Python, built for human beings! ๅผ€ๆบ่ดข็ปๆ•ฐๆฎๆŽฅๅฃๅบ“

2.8M 19K 3K
ydataai
ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

2M 14K 2K
dylan-profiler
visions

Type System for Data Analysis in Python

1.6M 218 20
scikit-hep
awkward

Manipulate JSON-like data with NumPy-like idioms.

1.4M 959 123
elementary-data
elementary-data

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

1.3M 2K 217
scikit-hep
awkward-cpp

Manipulate JSON-like data with NumPy-like idioms.

1.2M 959 123
arvkevi
kneed

Knee point detection in Python :chart_with_upwards_trend:

856K 809 76
pydata
pandas-datareader

Extract data from a wide range of Internet sources into a pandas DataFrame.

855K 3K 693
dfm
corner

Make some beautiful corner plots

819K 571 234
flyteorg
flyteidl

Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows.

700K 7K 819
ydataai
pandas-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

648K 14K 2K
predict-idlab
plotly-resampler

Visualize large time series data with plotly.py

523K 1K 74
reflex-dev
reflex-hosting-cli

๐Ÿ•ธ๏ธ Web apps in pure Python ๐Ÿ

517K 28K 2K
reflex-dev
reflex

๐Ÿ•ธ๏ธ Web apps in pure Python ๐Ÿ

481K 28K 2K
    • Data from PyPI, GitHub, ClickHouse, and BigQuery