wp2txt 1.0.1
WP2TXT extracts text and category data from Wikipedia dump files (encoded in XML / compressed with Bzip2), removing MediaWiki markup and other metadata.
Gemfile:
=
install:
=
Runtime Dependencies (7):
htmlentities
>= 0
nokogiri
>= 0
optimist
>= 0
parallel
>= 0
pastel
>= 0
ruby-progressbar
>= 0
tty-spinner
>= 0