PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Web Archives Python Packages

Python packages with the GitHub topic web-archives. Sorted by relevance, with stars and monthly downloads.
webrecorder
warcio

Streaming WARC/ARC library for fast web archive IO

1.3M 457 69
webrecorder
pywb

Core Python Web Archiving Toolkit for replay and recording of web archives

12K 2K 239
cocrawler
cdx-toolkit

A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine

4K 206 34
oduwsdl
mementoembed

A service that provides archive-aware oEmbed-compatible embeddable surrogates (social cards, thumbnails, etc.) for archived web pages (mementos).

954 14 3
caltechlibrary
eprints2archives

Send records from an EPrints server to the Internet Archive and other web archives

388 4 0
ikreymer
pywayback

Core Python Web Archiving Toolkit for replay and recording of web archives

1 2K 239
    • Data from PyPI, GitHub, ClickHouse, and BigQuery