PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Wikipedia Dump Python Packages

Python packages with the GitHub topic wikipedia-dump. Sorted by relevance, with stars and monthly downloads.
macbre
mediawiki-dump

Python package for working with MediaWiki XML content dumps

920 25 4
omarkamali
wikisets

Flexible Wikipedia dataset builder with sampling and pretraining support

376 4 0
jon-edward
wiki-data-dump

A library that assists in traversing and downloading from Wikimedia Data Dumps and their mirrors.

357 12 1
akb89
witokit

A Python toolkit to generate a tokenized dump of Wikipedia for NLP

275 11 1
bfontaine
wpydumps

Work with Wikipedia dumps.

188 1 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery