tessdata

  • 所有者: tesseract-ocr/tessdata
  • 平台:
  • 许可证: Apache License 2.0
  • 分类:
  • 主题:
  • 喜欢:
    0
      比较:

Github星跟踪图

tessdata

These language data files only work with Tesseract 4.0.0.
They are based on the sources in
tesseract-ocr/langdata on GitHub.
(still to be updated for 4.0.0 - 20180322)

These have models for legacy tesseract engine (--oem 0) as well as the new LSTM neural net based engine (--oem 1).

The LSTM models (--oem 1) in these files
have been updated to the integerized versions of
tessdata_best on GitHub.
So, they should be faster but probably a little less accurate than tessdata_best.

tessdata_fast on GitHub
provides an alternate set of integerized LSTM models which have been built with a smaller network.
tessdata_fast files are the ones packaged for Debian and Ubuntu.

The legacy tesseract models (--oem 0) have been removed for Indic and
Arabic script language files.

tessdata for 3.04 or 3.05

Get language data files for Tesseract 3.04 or 3.05 from the
3.04 tree.

More information and a complete list of all languages is available in the
Tesseract wiki.

All data in the repository are licensed under the
Apache-2.0 License, see file LICENSE.

主要指标

概览
名称与所有者tesseract-ocr/tessdata
主编程语言
编程语言 (语言数: 0)
平台
许可证Apache License 2.0
所有者活动
创建于2015-04-12 22:50:47
推送于2024-03-09 10:04:28
最后一次提交
发布数4
最新版本名称4.1.0 (发布于 2021-02-16 22:21:44)
第一版名称3.04.00 (发布于 )
用户参与
星数7k
关注者数227
派生数2.4k
提交数45
已启用问题?
问题数166
打开的问题数53
拉请求数14
打开的拉请求数2
关闭的拉请求数6
项目设置
已启用Wiki?
已存档?
是复刻?
已锁定?
是镜像?
是私有?