
Security News
OWASP 2025 Top 10 Adds Software Supply Chain Failures, Ranked Top Community Concern
OWASP’s 2025 Top 10 introduces Software Supply Chain Failures as a new category, reflecting rising concern over dependency and build system risks.
calamari-ocr
Advanced tools

OCR Engine based on OCRopy and Kraken using Python 3.
It is designed to both be easy to use from the command line but also be modular to be integrated and customized from other python scripts.

The documentation of Calamari is hosted here.
Pretrained models are available at calamari_models and calamari_models_experimental.
Current releases (with individual model tarballs) can be accessed here and here.
Calamari is available on pypi:
pip install calamari-ocr
Read the docs for further instructions.
See the docs to learn how to use Calamari from the command line.
See the docs to learn how to adapt Calamari for your needs.
If you use Calamari in your Research-Project, please cite:
Wick, C., Reul, C., Puppe, F.: Calamari - A High-Performance Tensorflow-based Deep Learning Package for Optical Character Recognition. Digital Humanities Quarterly 14(1) (2020)
@article{wick_calamari_2020,
title = {Calamari - {A} {High}-{Performance} {Tensorflow}-based {Deep} {Learning} {Package} for {Optical} {Character} {Recognition}},
volume = {14},
number = {1},
journal = {Digital Humanities Quarterly},
author = {Wick, Christoph and Reul, Christian and Puppe, Frank},
year = {2020},
}
FAQs
Line based ATR Engine based on OCRopy
We found that calamari-ocr demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 3 open source maintainers collaborating on the project.
Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Security News
OWASP’s 2025 Top 10 introduces Software Supply Chain Failures as a new category, reflecting rising concern over dependency and build system risks.

Research
/Security News
Socket researchers discovered nine malicious NuGet packages that use time-delayed payloads to crash applications and corrupt industrial control systems.

Security News
Socket CTO Ahmad Nassri discusses why supply chain attacks now target developer machines and what AI means for the future of enterprise security.