ocr for recognizing text on computer screen
A small example package
Weights for levenshtein distance based on ocr character similarity
Python3 package for Chinese/English OCR, with small pretrained models
OCR for Japanese manga
unified interface to google vision, aws textract, azure & tesseract OCR tools.
Add OCR data to Joplin notes
Audio generation research library for PyTorch
common predictors
Implementation of the hOCR specs
OCR-D wrapper for arbitrary coords-preserving image operations
Intelligent text and table extraction OCR tool which uses Nanonets OCR Engine to read and extract plain text and tables from image or pdf files with great accuracy
Fast, efficient, and high quality OCR powered by open visual language models
A cross platform OCR Library based on PaddlePaddle.
This repository contains a Python program designed to execute Optical Character Recognition (OCR) and Facial Recognition on images.
Typegroups classifier for OCR
Automatically crops faces from batches of pictures
OCR-D wrapper for detectron2 based segmentation models
Awesome OCR toolkits based on PaddlePaddle(8.6M ultra-lightweight pre-trained model, support training and deployment among server, mobile, embedded and IoT devices)
Extrair textos de documentos digitalizados e imagens.
qoala ocr rule
A Python wrapper for Tesseract
Tiny wrapper around pytesseract with image preprocessing and OCR configurations
OCR-Ops
Fast & Lightweight OCR for vehicle license plates.
Document Text Recognition (docTR): deep Learning for high-performance OCR on documents.
Convert documents to markdown with high speed and accuracy.
A package for performing OCR and interpreting the output using OpenAI and Gemini models.
SDK For publishing to Socrata
A robot framework library that utilizes OpenCV image processing and pytesseract OCR.
Mapping OCR predictions to fixed-size vocabularies
Clever, simple, and intuitive wrapper functionalities for OCRing specific textual materials
A tool for comparing OCR results from different OCR engines
font recognition and OCR
Lightweight Python library for RO-Crate manipulation implemented in Rust
Dead simple standard for storing/loading OCR datasets (image + label pairs)
OCRticle - Structured OCR for articles
ONNX-based OCR (PP-OCRv5) inference pipeline with DirectML support.
TrustGraph provides a means to run a pipeline of flexible AI processing components in a flexible means to achieve a processing pipeline.
Line based ATR Engine based on OCRopy
OpenMMLab Text Detection, OCR, and NLP Toolbox
Cryptography Extension for Hydrogram
OCR for DjVu (Python 3 fork)