PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Webscraping Python Packages

Python packages with the GitHub topic webscraping. Sorted by relevance, with stars and monthly downloads.
requests-cache
requests-cache

Persistent HTTP cache for python requests

17.3M 1K 157
adbar
htmldate

Fast and robust date extraction from web pages, with Python or on the command-line

11.2M 149 30
firecrawl
firecrawl-py

🔥 Search, scrape, and clean the web for AI agents.

6.8M 121K 7K
Kaliiiiiiiiii-Vinyzu
patchright

Undetected Python version of the Playwright testing and automation library.

5.3M 1K 99
seleniumbase
seleniumbase

APIs for browser automation, testing, and bypassing bot-detection.

3.5M 13K 2K
firecrawl
firecrawl

🔥 Search, scrape, and clean the web for AI agents.

971K 121K 7K
D4Vinci
scrapling

🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!

881K 50K 5K
daijro
camoufox

🦊 Anti-detect browser

738K 9K 722
CloakHQ
cloakbrowser

Stealth Chromium that passes every bot detection test. Drop-in Playwright replacement with source-level fingerprint patches. 30/30 tests passed.

238K 13K 989
ZenRows
zenrows

SDK to access ZenRows API directly from Python. We handle proxies rotation, headless browsers and CAPTCHAs for you.

147K 18 9
assafelovic
gpt-researcher

An autonomous agent that conducts deep research on any data using any LLM providers

94K 27K 4K
jpjacobpadilla
stealth-requests

Undetected web-scraping & seamless HTML parsing in Python!

45K 470 48
openzim
libzim

Libzim binding for Python: read/write ZIM files in Python

39K 104 29
seleniumbase
selenium-base

APIs for browser automation, testing, and bypassing bot-detection.

27K 13K 2K
seleniumbase
pytest-seleniumbase

APIs for browser automation, testing, and bypassing bot-detection.

26K 13K 2K
seleniumbase
sbase

APIs for browser automation, testing, and bypassing bot-detection.

24K 13K 2K
seleniumbase
pytest-sbase

APIs for browser automation, testing, and bypassing bot-detection.

22K 13K 2K
ZacharyHampton
homeharvest

Python package for scraping real estate property data

18K 681 159
Hyper-Solutions
hyper-sdk

Python SDK for Bot Protection Bypass - Automate Akamai, Incapsula, Kasada, and DataDome. No browsers required. Solve challenges and generate valid sensors/cookies via API.

13K 58 3
seleniumbase
basecase

APIs for browser automation, testing, and bypassing bot-detection.

11K 13K 2K
ScrapingAnt
scrapingant-client

ScrapingAnt API client for Python.

10K 43 5
vypivshiy
ssc-codegen

python-dsl code converter to html parser for web scraping

8K 4 0
maxhumber
gazpacho

The simple, fast, and modern web scraping library

7K 769 54
phoenixthrush
aniworld

AniWorld Downloader is a cross-platform tool for streaming and downloading anime from aniworld.to, as well as series from s.to. It runs on Windows, macOS, and Linux, providing a seamless experience for offline viewing or instant playback. If you enjoy using it, feel free to leave a ⭐!

6K 247 38
    • Data from PyPI, GitHub, ClickHouse, and BigQuery