PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Internet Archive Python Packages

Python packages with the GitHub topic internet-archive. Sorted by relevance, with stars and monthly downloads.
akamhy
waybackpy

Wayback Machine API interface & a command-line tool

2.5M 579 40
webis-de
archive-query-log

Mining Millions of Search Result Pages of Hundreds of Search Engines from 25 Years of Web Archives.

3K 34 0
saveweb
wikiteam3

archiving MediaWikis (and uploading wikidump to the Internet Archive)

2K 96 13
bibanon
tubeup

Use yt-dlp to download video/metadata and upload to the Internet Archive.

2K 488 73
saveweb
dokuwikidumper

A tool for archiving DokuWiki

2K 28 5
agude
wayback-machine-archiver

A Python script to submit web pages to the Wayback Machine for archiving.

1K 85 12
alopezrivera
anchorage

Anchor your little piece of internet.

883 23 1
GeiserX
wayback-archive

Download complete websites from the Wayback Machine with full asset preservation for offline viewing

776 10 6
BGforgeNet
yawbdl

A tool to download pages from Internet Archive.

546 21 4
claromes
waybacktweets

Archived tweets from the Wayback Machine

498 201 50
gdamdam
iagitup

Archive GitHub, GitLab, Bitbucket & any git repo to the Internet Archive as portable bundles with rich metadata.

455 99 9
WEB-CHILD
internet-archive-extractor

Tool for querying the Internet Archive CDX API, downloading resources and packaging them as WARC files.

416 2 0
caltechlibrary
eprints2archives

Send records from an EPrints server to the Internet Archive and other web archives

388 4 0
kenlhlui
pyarchiveit

A Python library to interact with the Archive-It's API

381 0 0
opencitations
piccione

Pronounced Py-ccione. A Python toolkit for uploading and downloading data to external repositories and cloud services

372 0 0
Quoorex
archive-file-urls

Submit URLs listed inside a file to website archival services

303 3 0
GeiserX
wayback-diff

Intelligent web page comparison tool with Wayback Machine support and visual regression testing

208 1 0
internetarchive
iacopilot

Summarize and ask questions about items in the Internet Archive

156 17 5
OpenJarbas
youtube-archivist

Index, canonicalize, deduplicate and serve media catalogues from YouTube / Bandcamp / SoundCloud / Internet Archive into a typed mediavocab dataset

96 2 1
bac0id
save-page-now-api

A Python wrapper for the Internet Archive's Save Page Now API.

96 3 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery