Single Character OCR
End-to-End Multi-Lingual Optical Character Recognition (OCR) Solution
Python bindings for libmongocrypt
String distances considering OCR errors.
A cross platform OCR Library based on OnnxRuntime.
Awesome OCR toolkits based on PaddlePaddle(8.6M ultra-lightweight pre-trained model, support training and deployment among server, mobile, embedded and IoT devices)
OCR-D framework
带带弟弟OCR
Python-tesseract is a python wrapper for Google's Tesseract-OCR
OCR-D framework
OCR-D framework
A simple, Pillow-friendly, Python wrapper around tesseract-ocr API using Cython
OCR-D framework
OCR-D framework
RO-Crate metadata generator/parser
OCR, layout, reading order, and table recognition in 90+ languages
This module can be used to validate BagitProfiles.
Create and validate BagIt packages
A Python wrapper library for subprocess module.
Postgres-based distributed task processing library
abstract_ocr
Tencent Cloud Ocr SDK for Python
pylsd is the python bindings for LSD - Line Segment Detector
Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)
Python package for docstring repetition
Generate RO-Crates from workflow repositories
Easily parse JSON returned by Amazon Textract.
Preprocess, segment and recognize text using Tesseract OCR and the OCR-D framework
Hobby Project GUI for the Python Program 'OCRmyPDF' by James R. Barlow
A Helper class to get more meaninful text out of common OCR outputs
Awesome OCR Library
font recognition and OCR
OCRmyPDF plugin to generate SVG files for Papermerge
Converters for various file formats used for representing OCR
OCR-D framework
A Python wrapper for OCR engines (Tesseract, Cuneiform, etc)
A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
Nougat: Neural Optical Understanding for Academic Documents
A tool for comparing OCR results from different OCR engines
A small example package
Renders Python docstrings to rich HTML
A CLI tool to apply OCR on PDF files and export to multiple formats.