tiny_segmenter 0.0.6
Ruby port of TinySegmenter.js for tokenizing Japanese text. Uses a Naive Bayes model that has been trained using the RWCP corpus and optimized using L1-norm regularization. The resultant model is quite compact, yet has a 95% accuracy rate.
Gemfile:
=
Copier
Copié!
installation:
=
Versions:
- 0.0.6 - October 26, 2015 (16 ko)
- 0.0.4 - March 31, 2013 (16 ko)
- 0.0.2 - August 27, 2012 (14 ko)
- 0.0.1 - August 20, 2012 (11,5 ko)