tessdata

  • 所有者: tesseract-ocr/tessdata
  • 平台:
  • 許可證: Apache License 2.0
  • 分類:
  • 主題:
  • 喜歡:
    0
      比較:

Github星跟蹤圖

tessdata

These language data files only work with Tesseract 4.0.0.
They are based on the sources in
tesseract-ocr/langdata on GitHub.
(still to be updated for 4.0.0 - 20180322)

These have models for legacy tesseract engine (--oem 0) as well as the new LSTM neural net based engine (--oem 1).

The LSTM models (--oem 1) in these files
have been updated to the integerized versions of
tessdata_best on GitHub.
So, they should be faster but probably a little less accurate than tessdata_best.

tessdata_fast on GitHub
provides an alternate set of integerized LSTM models which have been built with a smaller network.
tessdata_fast files are the ones packaged for Debian and Ubuntu.

The legacy tesseract models (--oem 0) have been removed for Indic and
Arabic script language files.

tessdata for 3.04 or 3.05

Get language data files for Tesseract 3.04 or 3.05 from the
3.04 tree.

More information and a complete list of all languages is available in the
Tesseract wiki.

All data in the repository are licensed under the
Apache-2.0 License, see file LICENSE.

主要指標

概覽
名稱與所有者tesseract-ocr/tessdata
主編程語言
編程語言 (語言數: 0)
平台
許可證Apache License 2.0
所有者活动
創建於2015-04-12 22:50:47
推送於2024-03-09 10:04:28
最后一次提交
發布數4
最新版本名稱4.1.0 (發布於 2021-02-16 22:21:44)
第一版名稱3.04.00 (發布於 )
用户参与
星數7k
關注者數227
派生數2.4k
提交數45
已啟用問題?
問題數166
打開的問題數53
拉請求數14
打開的拉請求數2
關閉的拉請求數6
项目设置
已啟用Wiki?
已存檔?
是復刻?
已鎖定?
是鏡像?
是私有?