wikipedia-dump
Python package for working with MediaWiki XML content dumps
Flexible Wikipedia dataset builder with sampling and pretraining support
A library that assists in traversing and downloading from Wikimedia Data Dumps and their mirrors.
A Python toolkit to generate a tokenized dump of Wikipedia for NLP
Work with Wikipedia dumps.