Single Character OCR
End-to-End Multi-Lingual Optical Character Recognition (OCR) Solution
Awesome multilingual OCR and document parsing toolkits based on PaddlePaddle
Awesome OCR Library
A cross platform OCR Library based on OnnxRuntime.
OCR, layout, reading order, and table recognition in 90+ languages
Python bindings for libmongocrypt
Python-tesseract is a python wrapper for Google's Tesseract-OCR
Postgres-based distributed task processing library
OCR-D framework
A simple, Pillow-friendly, Python wrapper around tesseract-ocr API using Cython
带带弟弟OCR
OCR-D framework
Create and validate BagIt packages
This module can be used to validate BagitProfiles.
RO-Crate metadata generator/parser
ONNX-based OCR (PP-OCRv5) inference pipeline.
OCR-D framework
Tencent Cloud Ocr SDK for Python
Automatic generation of crystal structure descriptions
Audio generation research library for PyTorch
A Python wrapper library for subprocess module.
Generate RO-Crates from workflow repositories
Python package for docstring repetition
A Helper class to get more meaninful text out of common OCR outputs
OCR-D framework
OCR-D framework
A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
Python3 package for Chinese/English OCR, with small pretrained models
Easily parse JSON returned by Amazon Textract.
OCR-D framework
OCR plugin for MarkItDown - Extracts text from images in PDF, DOCX, PPTX, and XLSX via LLM Vision
A Python wrapper for OCR engines (Tesseract, Cuneiform, etc)
Onnx Text Recognition (OnnxTR) OCR plugin for docling
@cryptobot api asynchronous python wrapper
Zoho CRM SDK for ZOHO CRM v8 APIs
A package to use AWS Textract services.
Convert documents to markdown with high speed and accuracy.
Fast & Lightweight OCR for vehicle license plates.
OCR model that converts documents to markdown, HTML, or JSON.
A structured OCR pipeline designed for **layout-aware text extraction from complex documents**, combining preprocessing, column detection, region classification, and ordered OCR assembly.
Renders Python docstrings to rich HTML
Fast, efficient, and high quality OCR powered by open visual language models
OCR for Japanese manga
Thoughtful OCR Package