wp2txt 1.0.0
WP2TXT extracts plain text data from Wikipedia dump file (encoded in XML/compressed with Bzip2) stripping all the MediaWiki markups and other metadata.
Gemfile:
=
安装:
=
Runtime 依赖 (7):
htmlentities
>= 0
nokogiri
>= 0
optimist
>= 0
parallel
>= 0
pastel
>= 0
ruby-progressbar
>= 0
tty-spinner
>= 0