3.3k
.NET/C#
SymSpell: 1 million times faster through Symmetric Delete sp...
C/C++
High performance chinese tokenizer with both GBK and UTF-8 c...
12.1k
Python
100+ Chinese Word Vectors 上百种预训练中文词向量
4k
自然语言处理
百度 NLP:分词,词性标注,命名实体识别。「Baidu NLP: word segmentation, lexical...
3.2k
自然语言处理
自然语言处理工具包 HanLP1.x 的 Python 接口。「Python interfaces for HanLP1...
Python
一个微型&算法全面的中文分词引擎 | A micro tokenizer for Chinese
自然语言处理
用 Rust 实现的 Jieba 汉字分词法。「The Jieba Chinese Word Segmentation ...
Java
Chinese Word Segmentation Tool, THULAC的Java实现.
Python
Source codes for paper "Neural Networks Incorporating Dictio...
1.5k
Python
Python package for Korean natural language processing.
Python
Source code for an ACL2017 paper on Chinese word segmentatio...