Calamari OCR

基于 OCRopy 的线型 ATR 引擎。「Line based ATR Engine based on OCRopy」

  • Owner: Calamari-OCR/calamari
  • Platform: Linux, Mac, Windows
  • License:: GNU General Public License v3.0
  • Category::
  • Topic:
  • Like:
    0
      Compare:

Github stars Tracking Chart

logo

Python Test
Upload Python Package
Lint

OCR Engine based on OCRopy and Kraken using python3.
It is designed to both be easy to use from the command line but also be modular to be integrated and customized from other python scripts.

preview

Documentation

The documentation of Calamari is hosted here.

Pretrained model repository

Pretrained models are available at (https://github.com/Calamari-OCR/calamari_models).
The current release can be accessed here (255 MB).

Installing

Calamari is available on pypi:

pip install calamari-ocr

Read the docs for further instructions.

Command-Line Interface

See the docs to learn how to use Calamari from the command line.

Calamari API

See the docs to learn how to adapt Calamari for your needs.

Citing Calamari

If you use Calamari in your Research-Project, please cite:

Wick, C., Reul, C., Puppe, F.: Calamari - A High-Performance Tensorflow-based Deep Learning Package for Optical Character Recognition. Digital Humanities Quarterly 14(1) (2020)

@article{wick_calamari_2020,
    title = {Calamari - {A} {High}-{Performance} {Tensorflow}-based {Deep} {Learning} {Package} for {Optical} {Character} {Recognition}},
    volume = {14},
    number = {1},
    journal = {Digital Humanities Quarterly},
    author = {Wick, Christoph and Reul, Christian and Puppe, Frank},
    year = {2020},
}

Main metrics

Overview
Name With OwnerCalamari-OCR/calamari
Primary LanguagePython
Program languagePython (Language Count: 2)
PlatformLinux, Mac, Windows
License:GNU General Public License v3.0
所有者活动
Created At2018-03-20 15:22:29
Pushed At2025-05-12 16:17:05
Last Commit At2025-05-12 18:17:04
Release Count39
Last Release Namev2.3.1 (Posted on )
First Release Namev0.1.0 (Posted on )
用户参与
Stargazers Count1.1k
Watchers Count52
Fork Count212
Commits Count479
Has Issues Enabled
Issues Count276
Issue Open Count57
Pull Requests Count64
Pull Requests Open Count3
Pull Requests Close Count29
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private