RubyGems Navigation menu

tiny_segmenter 0.0.6

Ruby port of TinySegmenter.js for tokenizing Japanese text. Uses a Naive Bayes model that has been trained using the RWCP corpus and optimized using L1-norm regularization. The resultant model is quite compact, yet has a 95% accuracy rate.

Versions:

  1. 0.0.6 - October 26, 2015 (16 KB)
  2. 0.0.4 - March 31, 2013 (16 KB)
  3. 0.0.2 - August 27, 2012 (14 KB)
  4. 0.0.1 - August 20, 2012 (11.5 KB)
Show all versions (6 total)

Development Dependencies (2):

rake ~> 10.4
rspec ~> 3.3

Owners:

Authors:

  • Peter Graham

SHA 256 checksum:

e82614942df0a8b6dded150002fc7bd2f5073289be7dcbec34ac026d114a03ef

Total downloads 203,890

For this version 195,462

Gemfile:
= Copy to clipboard Copied!

install:
=

License:

BSD

Required Ruby Version: >= 0

Links: