PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Open Data Python Packages

Python packages with the GitHub topic open-data. Sorted by relevance, with stars and monthly downloads.
NeurodataWithoutBorders
pynwb

A Python API for working with Neurodata stored in the NWB Format

256K 215 94
blaylockbk
herbie-data

Download numerical weather prediction datasets (HRRR, RAP, GFS, IFS, etc.) from NOMADS, NODD partners (Amazon, Google, Microsoft), ECMWF open data, and the University of Utah Pando Archive System.

82K 746 132
upgini
upgini

Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, including open & commercial LLMs

37K 349 26
hadesllm
morie

Multi-domain scientific computing toolkit hosting the MRM framework (Python + R).

24K 0 0
eegdash
eegdash

EEG-DaSh: an open data, tool, and compute resource. a Python library and catalog for 700+ BIDS-first EEG, MEG, fNIRS, EMG, and iEEG datasets, ML-ready via PyTorch

23K 62 8
biglocalnews
warn-scraper

Command-line interface for downloading WARN Act notices of qualified plant closings and mass layoffs from state government websites

22K 41 18
earthobservations
wetterdienst

Open weather data for humans.

21K 438 59
sentinelsat
sentinelsat

Search and download Copernicus Sentinel satellite images

20K 1K 246
basedosdados
basedosdados

⚙️ Código de manutenção do datalake (metadados e pacotes de acesso) | 📖 Docs: https://basedosdados.org/docs/home

17K 419 88
datadotworld
datadotworld

Python package for data.world

16K 103 26
siznax
wptools

Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis

15K 595 77
osPlanning
openmatrix

Open Matrix (OMX)

11K 55 18
City-Bureau
city-scrapers-core

Core functionality for City Scrapers projects

9K 8 10
datagouv
csv-detective

Inspection of tabular (csv, xls-like) files to guess the columns' content

8K 51 11
atviriduomenys
spinta

Spinta is a framework to describe, extract and publish data (a DEP Framework).

6K 20 8
blaylockbk
goes2go

Download and process GOES-16 and GOES-17 data from NOAA's archive on AWS using Python.

6K 251 43
IFCA-Advanced-Computing
pycanon

pyCANON is a Python library and CLI to assess the values of the parameters associated with the most common privacy-preserving techniques.

5K 52 9
flatland-association
fab-clientlib

Flatland Benchmarks (FAB) is an open-source web-based platform for running Benchmarks to foster Open Research

5K 3 3
Aryan-Jhaveri
statcan-mcp-server

MCP server + CLI for Statistics Canada (StatCan) — access 7,000+ Canadian statistical tables via WDS and SDMX REST APIs

5K 5 1
kayhendriksen
foehn

Download MeteoSwiss Open Government Data — weather stations, radar, hail, forecasts and climate series — via Python API, CLI, or MCP server, as DataFrames or Parquet files

4K 38 1
kensho-technologies
qwikidata

Python tools for interacting with Wikidata

4K 161 18
hadesllm
moirais

Multi-domain scientific computing toolkit hosting the MRM framework (Python + R).

3K 0 0
Jaypatel1511
cdfidata

ETL pipeline for US Treasury CDFI Fund public datasets — TLR, CLR, ILR, NMTC, and Awards data

3K 1 0
ale-saglia
cup-check

Local-first validator for Italian public project codes (CUP), with OpenCUP lookup and Python library.

3K 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery