3k
.NET/C#
SymSpell: 1 million times faster through Symmetric Delete sp...
C/C++
High performance chinese tokenizer with both GBK and UTF-8 c...
11.6k
Python
100+ Chinese Word Vectors 上百种预训练中文词向量
3.8k
自然语言处理
百度 NLP:分词,词性标注,命名实体识别。「Baidu NLP: word segmentation, lexical...
3.1k
Python
自然语言处理工具包HanLP的Python接口
Python
一个微型&算法全面的中文分词引擎 | A micro tokenizer for Chinese
Rust
The Jieba Chinese Word Segmentation Implemented in Rust
Java
Chinese Word Segmentation Tool, THULAC的Java实现.
Python
Source codes for paper "Neural Networks Incorporating Dictio...
1.4k
Python
Python package for Korean natural language processing.
Python
Source code for an ACL2017 paper on Chinese word segmentatio...