PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Scraping Websites Python Packages

Python packages with the GitHub topic scraping-websites. Sorted by relevance, with stars and monthly downloads.
kennethreitz
requests-html

Pythonic HTML Parsing for Humans™

827K 326 42
Anorov
cfscrape

A Python module to bypass Cloudflare's anti-bot page.

44K 4K 452
outscraper
outscraper

The library provides convenient access to the Outscraper API from applications written in the Python language. Allows using Outscraper's services from your code.

36K 92 21
TeamKillerX
tgcore

TGCoreSDK | A fluent DSL framework for Telegram, APIs, and AI workflows.

3K 1 0
crawlbase-source
crawlbase

Fast python library for the Crawlbase API

2K 25 2
sujitmandal
scrape-search-engine

Search anything on the different Search Engine's it will collect all the links.

2K 14 5
proxycrawl
proxycrawl

ProxyCrawl Python library for scraping and crawling

1K 58 19
multimodal-ai-lab
scrapemm

LLM-friendly scraper for media and text from social media and the open web.

1K 5 0
okaits
nicovideo-py

ニコニコ動画のAPIを使用して、動画や投稿者などの情報を取得するライブラリです。

753 0 0
fedecalendino
nintendeals

Scraping tools for Nintendo Switch games and prices on NA, EU and JP.

727 129 18
JordanAllen101
aioprox

Async Python proxy manager with latency testing

586 1 0
andriystr
lst

Declarative Scraping Library

562 0 0
outscraper
google-maps-reviews

Google Maps Reviews API SDK

549 14 5
pyporn-san
multporn

python library used to interact with multporn.net via python

486 28 1
outscraper
google-services-api

The library provides convenient access to the Outscraper API from applications written in the Python language. Allows using Outscraper's services from your code.

454 92 21
danangfir
indoquake

A Latest Earthquake Detection Package Taken Based on BMKG | Meteorological, Climatological, and Geophysical Agency

426 0 0
proxymesh
scrapy-proxy-headers

Add custom proxy headers to HTTPS requests in Scrapy

372 4 0
sarartur
liquidcss

Alters css selector names across css files and html templates.

319 3 0
Javinator9889
g-pygle

A tool for searching the entire web with the Google technology

311 5 1
erikqu
newsdatascraper

Easily query articles

297 5 0
Indigo-Coder-github
korean-news-crawler

Python Library for Crawling News Artircles in Korean Top 10 News Websites with Utilities

215 1 0
edwardseley
lyricscorpora

An unofficial Python API that allows users to create a corpus of lyrical text from their favorite artists and billboard charts

191 18 1
machinia
scraper-factory

Scraping library to retrieve data from useful pages, such as Amazon wishlists

165 1 0
Musubi-ai
musubi-scrape

A convenient crawling package for collecting web data.

158 1 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery