RubyGems Navigation menu

wp2txt 1.0.1

WP2TXT extracts text and category data from Wikipedia dump files (encoded in XML / compressed with Bzip2), removing MediaWiki markup and other metadata.

Gemfile:
=

install:
=

Versions:

  1. 1.1.3 May 13, 2023 (7.78 MB)
  2. 1.1.2 April 15, 2023 (7.78 MB)
  3. 1.1.1 January 25, 2023 (7.78 MB)
  4. 1.1.0 January 22, 2023 (7.78 MB)
  5. 1.0.2 November 25, 2022 (7.78 MB)
  6. 1.0.1 August 11, 2022 (7.78 MB)
Show all versions (29 total)

Runtime Dependencies (7):

Owners:

Pushed by:

Authors:

  • Yoichiro Hasebe

SHA 256 checksum:

=

Total downloads 66,455

For this version 817

Version Released:

Licenses:

N/A

Required Ruby Version: >= 0

Links: