PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Scraper Python Packages

Python packages with the GitHub topic scraper. Sorted by relevance, with stars and monthly downloads.
rushter
selectolax

Python binding to Modest and Lexbor engines. Fast HTML5 parser with CSS selectors for Python.

7.4M 2K 92
firecrawl
firecrawl-py

🔥 Search, scrape, and clean the web for AI agents.

6.8M 121K 7K
codelucas
newspaper3k

newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:

1M 15K 2K
firecrawl
firecrawl

🔥 Search, scrape, and clean the web for AI agents.

971K 121K 7K
JoMingyu
google-play-scraper

Google play scraper for Python inspired by <facundoolano/google-play-scraper>

564K 971 246
apify
crawlee

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

536K 9K 742
spider-rs
spider-client

Python, Javascript, and Rust libraries for the Spider Cloud API.

417K 25 9
scrapfly
scrapfly-sdk

Official Python SDK for the Scrapfly platform: web scraping, screenshots, AI extraction, crawling, and a remote anti-bot browser. Integrates with Scrapy, LlamaIndex, and LangChain.

256K 55 15
0x676e67
rnet

An ergonomic Python HTTP Client with TLS fingerprint

241K 1K 105
d60
twikit

Twitter API Scraper | Without an API key | Twitter Internal API | Free | Twitter scraper | Twitter Bot

165K 4K 537
ZenRows
zenrows

SDK to access ZenRows API directly from Python. We handle proxies rotation, headless browsers and CAPTCHAs for you.

147K 18 9
JustAnotherArchivist
snscrape

A social networking service scraper in Python

93K 5K 780
cinemagoer
cinemagoer

Cinemagoer is a Python package useful to retrieve and manage the data of the IMDb (to which we are not affiliated in any way) movie database about movies, people, characters and companies

89K 1K 375
0x676e67
wreq

An ergonomic Python HTTP Client with TLS fingerprint

68K 1K 105
vladkens
twscrape

2025! X / Twitter API scrapper with authorization support. Allows you to scrape search results, User's profiles (followers/following), Tweets (favoriters/retweeters) and more.

62K 2K 291
BrianWeiHaoMa
misoreports

A comprehensive Python library for downloading Midcontinent Independent System Operator (MISO) public reports into pandas dataframes.

55K 8 0
dermasmid
scrapetube

A YouTube scraper for scraping channels, playlists, and searching 🔎

46K 510 72
isaackogan
tiktoklive

The definitive Python library to receive livestream events (comments, gifts, etc.) in realtime from TikTok LIVE.

43K 1K 271
henrique-coder
perplexity-webui-scraper

An advanced, high-performance Python client, MCP server, and REST API for reverse-engineering Perplexity AI's WebUI.

37K 84 19
outscraper
outscraper

The library provides convenient access to the Outscraper API from applications written in the Python language. Allows using Outscraper's services from your code.

36K 92 21
cinemagoer
imdbpy

Cinemagoer is a Python package useful to retrieve and manage the data of the IMDb (to which we are not affiliated in any way) movie database about movies, people, characters and companies

33K 1K 375
tn3w
is-crawler

Crawler detection from User-Agent strings in 50 ns. Issues and pull requests welcome!

22K 0 0
cowboy-bebug
app-store-scraper

Single API ☝ App Store Review Scraper 🧹

20K 100 61
ZacharyHampton
homeharvest

Python package for scraping real estate property data

18K 681 159
    • Data from PyPI, GitHub, ClickHouse, and BigQuery