PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Pandas Python Packages

Python packages with the GitHub topic pandas. Sorted by relevance, with stars and monthly downloads.
pandas-dev
pandas

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

702.9M 49K 20K
tqdm
tqdm

:zap: A Fast, Extensible Progress Bar for Python and CLI

481.2M 31K 1K
huggingface
datasets

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

127M 22K 3K
narwhals-dev
narwhals

Lightweight and extensible compatibility layer between dataframe libraries!

86.3M 2K 193
aws
awswrangler

pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

86.1M 4K 729
jmcnamara
xlsxwriter

A Python module for creating Excel XLSX files.

82.1M 4K 665
mwaskom
seaborn

Statistical data visualization in Python

51.1M 14K 2K
dask
dask

Parallel computing with task scheduling

25M 14K 2K
delta-io
deltalake

A native Rust library for Delta Lake, with bindings into Python

24.2M 3K 619
pydata
xarray

N-D labeled arrays and datasets in Python

20.5M 4K 1K
ranaroussi
yfinance

Download market data from Yahoo! Finance's API

19.1M 24K 3K
geopandas
geopandas

Python tools for geographic data

17.8M 5K 1K
pandera-dev
pandera

A light-weight, flexible, and expressive statistical data testing library

8.8M 4K 397
jmcarpenter2
swifter

A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner

7.3M 3K 104
dimastbk
python-calamine

Python binding for Rust's library for reading excel and odf file - calamine.

4M 461 14
robin900
gspread-dataframe

Read/write Google spreadsheets using pandas DataFrames

3.7M 260 24
ToucanToco
fastexcel

A fast excel reader for Rust and Python

3.3M 230 20
databrickslabs
dbl-tempo

API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation

3.3M 342 59
thombashi
pytablewriter

pytablewriter is a Python library to write a table in various formats: AsciiDoc / CSV / Elasticsearch / HTML / JavaScript / JSON / LaTeX / LDJSON / LTSV / Markdown / MediaWiki / NumPy / Excel / Pandas / Python / reStructuredText / SQLite / TOML / TSV.

3.2M 631 47
chezou
tabula-py

Simple wrapper of tabula-java: extract table from PDF into pandas DataFrame

2.7M 2K 303
amirziai
flatten-json

Flatten JSON in Python

2.6M 553 97
capitalone
datacompy

Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!

2.6M 643 160
fugue-project
fugue

A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.

2.5M 2K 100
modin-project
modin

Modin: Scale your Pandas workflows by changing a single line of code

2.4M 10K 673
    • Data from PyPI, GitHub, ClickHouse, and BigQuery