nlp-lang

这个项目是一个基本包.封装了大多数nlp项目中常用工具

  • Owner: NLPchina/nlp-lang
  • Platform:
  • License:: Apache License 2.0
  • Category::
  • Topic:
  • Like:
    0
      Compare:

Github stars Tracking Chart

这个项目是一个基本包.封装了大多数nlp项目中常用工具

Main metrics

Overview
Name With OwnerNLPchina/nlp-lang
Primary LanguageJava
Program languageJava (Language Count: 1)
Platform
License:Apache License 2.0
所有者活动
Created At2014-03-30 13:38:45
Pushed At2024-04-18 02:16:29
Last Commit At2024-04-18 10:16:29
Release Count2
Last Release Name1.7.6 (Posted on )
First Release Name1.7.3 (Posted on )
用户参与
Stargazers Count1.5k
Watchers Count148
Fork Count497
Commits Count189
Has Issues Enabled
Issues Count40
Issue Open Count14
Pull Requests Count10
Pull Requests Open Count1
Pull Requests Close Count2
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private

nlp-lang

1.X Build Status
sourcegraph

文档地址:http://www.nlpcn.org/docs/7
部分演示:http://www.nlpcn.org/demo

##MAVEN

<dependencies>
    <dependency>
        <groupId>org.nlpcn</groupId>
        <artifactId>nlp-lang</artifactId>
        <version>1.7.6</version>
    </dependency>
</dependencies>

这个项目是一个基本包.封装了大多数nlp项目中常用工具

工具

  • √ 词语标准化
  • √ tire树结构
  • √ 双数组tire树
  • √ 文本断句
  • √ html标签清理
  • √ Viterbi算法增加

组件

  • √ 汉字转拼音
  • √ 简繁体转换
  • √ bloomfilter
  • √ 指纹去重
  • √ SimHash文章相似度计算
  • √ 词共现统计
  • √ 基于内存的搜索提示
  • √ WordWeight词频统计,词idf统计,词类别相关度统计