PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Web Archive Python Packages

Python packages with the GitHub topic web-archive. Sorted by relevance, with stars and monthly downloads.
webis-de
archive-query-log

Mining Millions of Search Result Pages of Hundreds of Search Engines from 25 Years of Web Archives.

3K 34 0
Own-Data-Privateer
hoardy-web

Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing, replay, mirroring, data scraping, and/or indexing. Your own personal private Wayback Machine that can also archive HTTP POST requests and responses, as well as most other HTTP-level data.

496 127 10
internetarchive
cdxsummary

Summarize web archive capture index (CDX) files

284 90 30
Own-Data-Privateer
hoardy-web-sas

A simple archiving server for the `Hoardy-Web` Web Extension browser add-on.

199 127 10
oduwsdl
mementomap

A Tool to Summarize Web Archive Holdings

107 11 1
    • Data from PyPI, GitHub, ClickHouse, and BigQuery