Skip to content
Snippets Groups Projects
user avatar
Tom N Harris authored
An external tokenizer inserts extra spaces to mark words in the input text.
The text is sent through STDIN and STDOUT file handles.

A good choice for Chinese and Japanese is MeCab.
http://sourceforge.net/projects/mecab/
With the command line 'mecab -O wakati'
1c07b9e6
History
Code owners
Assign users and groups as approvers for specific file changes. Learn more.
Name Last commit Last update
..