tessdata

  • Owner: tesseract-ocr/tessdata
  • Platform:
  • License:: Apache License 2.0
  • Category::
  • Topic:
  • Like:
    0
      Compare:

Github stars Tracking Chart

tessdata

These language data files only work with Tesseract 4.0.0.
They are based on the sources in
tesseract-ocr/langdata on GitHub.
(still to be updated for 4.0.0 - 20180322)

These have models for legacy tesseract engine (--oem 0) as well as the new LSTM neural net based engine (--oem 1).

The LSTM models (--oem 1) in these files
have been updated to the integerized versions of
tessdata_best on GitHub.
So, they should be faster but probably a little less accurate than tessdata_best.

tessdata_fast on GitHub
provides an alternate set of integerized LSTM models which have been built with a smaller network.
tessdata_fast files are the ones packaged for Debian and Ubuntu.

The legacy tesseract models (--oem 0) have been removed for Indic and
Arabic script language files.

tessdata for 3.04 or 3.05

Get language data files for Tesseract 3.04 or 3.05 from the
3.04 tree.

More information and a complete list of all languages is available in the
Tesseract wiki.

All data in the repository are licensed under the
Apache-2.0 License, see file LICENSE.

Main metrics

Overview
Name With Ownertesseract-ocr/tessdata
Primary Language
Program language (Language Count: 0)
Platform
License:Apache License 2.0
所有者活动
Created At2015-04-12 22:50:47
Pushed At2024-03-09 10:04:28
Last Commit At
Release Count4
Last Release Name4.1.0 (Posted on 2021-02-16 22:21:44)
First Release Name3.04.00 (Posted on )
用户参与
Stargazers Count7k
Watchers Count228
Fork Count2.3k
Commits Count45
Has Issues Enabled
Issues Count164
Issue Open Count52
Pull Requests Count14
Pull Requests Open Count2
Pull Requests Close Count6
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private