PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Data Analytics Python Packages

Python packages with the GitHub topic data-analytics. Sorted by relevance, with stars and monthly downloads.
snowflakedb
snowflake-snowpark-python

Snowflake Snowpark Python API

66.8M 333 147
aiguofer
gspread-pandas

A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.

384K 408 55
apache
apache-superset-core

Apache Superset is a Data Visualization and Data Exploration Platform

151K 73K 17K
dbt-labs
dbt-mcp

A MCP (Model Context Protocol) server for interacting with dbt.

89K 564 121
llnl
llnl-hatchet

Graph-indexed Pandas DataFrames for analyzing hierarchical performance data

66K 35 19
girder
girder-worker

Distributed task execution engine with Girder integration, developed by Kitware

30K 35 32
tirthajyoti
mlr

Multiple linear regression with statistical inference, residual analysis, direct CSV loading, and other features

22K 35 10
mabel-dev
opteryx

🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.

21K 112 14
feldera
feldera

The Feldera Incremental Computation Engine

17K 2K 119
pathwaycom
pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

12K 63K 2K
girder
girder-import-tracker

A data management platform for the web, developed by Kitware

11K 455 177
benrutter
wimsey

Easy and flexible data contracts

9K 170 2
BCG-X-Official
gamma-facet

Human-explainable AI.

8K 533 46
mabel-dev
opteryx-core

🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.

6K 112 14
ralfbecher
orionbelt-semantic-layer-mcp

MCP server for the OrionBelt Semantic Layer — enables LLMs to explore semantic models, compile queries, and execute analytics via natural language.

4K 2 1
hatchet
hatchet

Analyze graph/hierarchical performance data using pandas dataframes

4K 119 41
feldera
dbt-feldera

The Feldera Incremental Computation Engine

4K 2K 119
unytics
bigfunctions

Supercharge BigQuery with BigFunctions

4K 757 70
denisecase
datafun-streaming

Shared Python utilities for Kafka, DuckDB, validation, stats, and visualization across streaming data analytics projects.

3K 1 0
denisecase
datafun-toolkit

Privacy-safe diagnostics, paths, and logging helpers for analytics projects.

3K 1 0
xoolive
traffic

A toolbox for processing and analysing air traffic data

3K 490 94
desbordante
desbordante

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

3K 478 100
Zen-Reportz
zen-dash

Simple, Fast, Scalable , production grade dashboard application . Right solution for team

2K 14 3
Squarespace
datasheets

Read data from, write data to, and modify the formatting of Google Sheets

2K 625 55
    • Data from PyPI, GitHub, ClickHouse, and BigQuery