PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Data Collection Python Packages

Python packages with the GitHub topic data-collection. Sorted by relevance, with stars and monthly downloads.
airbytehq
airbyte-source-declarative-manifest

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

324K 21K 5K
airbytehq
airbyte-source-facebook-marketing

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

23K 21K 5K
airbytehq
airbyte-source-google-ads

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

20K 21K 5K
airbytehq
airbyte-source-s3

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

19K 21K 5K
chapmanjacobd
xklb

xk library

17K 477 15
airbytehq
airbyte-source-salesforce

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

16K 21K 5K
airbytehq
airbyte-source-github

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

14K 21K 5K
airbytehq
airbyte-source-shopify

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

12K 21K 5K
Altimis
scweet

Scrape tweets, profiles, followers and following from Twitter/X, no API key needed. Python library with smart multi-account pooling, proxy support and async.

11K 1K 270
oxylabs
oxylabs-mcp

Official Oxylabs MCP integration

11K 95 24
airbytehq
airbyte-source-google-sheets

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

11K 21K 5K
airbytehq
airbyte-source-zendesk-support

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

10K 21K 5K
airbytehq
airbyte-source-faker

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

9K 21K 5K
airbytehq
airbyte-source-google-drive

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

9K 21K 5K
airbytehq
airbyte-source-bing-ads

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

8K 21K 5K
airbytehq
airbyte-source-google-analytics-data-api

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

8K 21K 5K
airbytehq
airbyte-source-gcs

Source implementation for Gcs.

8K 21K 5K
airbytehq
airbyte-source-marketo

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

8K 21K 5K
airbytehq
airbyte-source-hubspot

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

8K 21K 5K
airbytehq
airbyte-source-stripe

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

7K 21K 5K
airbytehq
airbyte-source-jira

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

7K 21K 5K
airbytehq
airbyte-source-google-search-console

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

7K 21K 5K
airbytehq
airbyte-source-vantage

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

7K 21K 5K
airbytehq
airbyte-source-file

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

6K 21K 5K
    • Data from PyPI, GitHub, ClickHouse, and BigQuery