An extremely fast implementation of Aho Corasick algorithm based on Double Array Trie.
Utolsó Kiadás okt. 07, 2016汉语言处理包
Utolsó Kiadás nullA Lucene tokenizer plugin for both Simplified Chinese and Traditional Chinese, featured with Chinese Word Segmentation, custom dictionary etc.
Utolsó Kiadás dec. 14, 2016HanLP: Han Language Processing
Utolsó Kiadás dec. 27, 2020A Lucene tokenizer plugin for both Simplified Chinese and Traditional Chinese, featured with Chinese Word Segmentation, custom dictionary etc.
Utolsó Kiadás dec. 14, 2016