tiny_segmenter 0.0.6
Ruby port of TinySegmenter.js for tokenizing Japanese text. Uses a Naive Bayes model that has been trained using the RWCP corpus and optimized using L1-norm regularization. The resultant model is quite compact, yet has a 95% accuracy rate.
Gemfile:
=
复制到剪贴板
已复制!
安装:
=
版本列表:
- 0.0.6 - October 26, 2015 (16.0 KB)
- 0.0.4 - March 31, 2013 (16.0 KB)
- 0.0.2 - August 27, 2012 (14.0 KB)
- 0.0.1 - August 20, 2012 (11.5 KB)