Calamari OCR

基于 OCRopy 的线型 ATR 引擎。「Line based ATR Engine based on OCRopy」

  • 所有者: Calamari-OCR/calamari
  • 平台: Linux, Mac, Windows
  • 许可证: GNU General Public License v3.0
  • 分类:
  • 主题:
  • 喜欢:
    0
      比较:

Github星跟踪图

logo

Python Test
Upload Python Package
Lint

OCR Engine based on OCRopy and Kraken using python3.
It is designed to both be easy to use from the command line but also be modular to be integrated and customized from other python scripts.

preview

Documentation

The documentation of Calamari is hosted here.

Pretrained model repository

Pretrained models are available at (https://github.com/Calamari-OCR/calamari_models).
The current release can be accessed here (255 MB).

Installing

Calamari is available on pypi:

pip install calamari-ocr

Read the docs for further instructions.

Command-Line Interface

See the docs to learn how to use Calamari from the command line.

Calamari API

See the docs to learn how to adapt Calamari for your needs.

Citing Calamari

If you use Calamari in your Research-Project, please cite:

Wick, C., Reul, C., Puppe, F.: Calamari - A High-Performance Tensorflow-based Deep Learning Package for Optical Character Recognition. Digital Humanities Quarterly 14(1) (2020)

@article{wick_calamari_2020,
    title = {Calamari - {A} {High}-{Performance} {Tensorflow}-based {Deep} {Learning} {Package} for {Optical} {Character} {Recognition}},
    volume = {14},
    number = {1},
    journal = {Digital Humanities Quarterly},
    author = {Wick, Christoph and Reul, Christian and Puppe, Frank},
    year = {2020},
}

主要指标

概览
名称与所有者Calamari-OCR/calamari
主编程语言Python
编程语言Python (语言数: 2)
平台Linux, Mac, Windows
许可证GNU General Public License v3.0
所有者活动
创建于2018-03-20 15:22:29
推送于2025-05-12 16:17:05
最后一次提交2025-05-12 18:17:04
发布数39
最新版本名称v2.3.1 (发布于 )
第一版名称v0.1.0 (发布于 )
用户参与
星数1.1k
关注者数52
派生数212
提交数479
已启用问题?
问题数276
打开的问题数57
拉请求数64
打开的拉请求数3
关闭的拉请求数29
项目设置
已启用Wiki?
已存档?
是复刻?
已锁定?
是镜像?
是私有?