Convert documents to markdown with high speed and accuracy.
OCR for Japanese manga
A Python wrapper for Tesseract
A tool for comparing OCR results from different OCR engines
Aspose.OCR for Python is a powerful yet easy-to-use and cost-effective API for extracting text from scanned images, photos, screenshots, PDF documents, and other files.
Fast, efficient, and high quality OCR powered by open visual language models
SDK For publishing to Socrata
Alibaba Cloud ocr-api (20210707) SDK Library for Python
Python API for Amocrm
OCR-D framework
Line based ATR Engine based on OCRopy
Page segmentation and segmentation evaluation in the OCR-D framework
unified interface to google vision, aws textract, azure & tesseract OCR tools.
OCR-Ops
Document Text Recognition (docTR): deep Learning for high-performance OCR on documents.
A package for extracting structured content from PDFs and images using Typhoon OCR models
Tiny wrapper around pytesseract with image preprocessing and OCR configurations
common predictors
Automatically crops faces from batches of pictures
Cryptography Extension for Hydrogram
Library for retrieving and uploading to database data from amocrm API
A cross platform OCR Library based on PaddlePaddle.
OpenMMLab Text Detection, OCR, and NLP Toolbox
Transformer base text detection
Dead simple standard for storing/loading OCR datasets (image + label pairs)
Lightweight Python library for RO-Crate manipulation implemented in Rust
Mapping OCR predictions to fixed-size vocabularies
Awesome OCR toolkits based on PaddlePaddle(8.6M ultra-lightweight pre-trained model, support training and deployment among server, mobile, embedded and IoT devices)
A cross platform OCR API Library based on RapidOCR
Automatic generation of crystal structure descriptions
GeOCR is a Python library for OCR, data cleaning, and historical geocoding using custom GeoJSON maps.
OCR-D wrapper for detectron2 based segmentation models
OCR-D wrapper for arbitrary coords-preserving image operations
Intelligent text and table extraction OCR tool which uses Nanonets OCR Engine to read and extract plain text and tables from image or pdf files with great accuracy
Fast & Lightweight OCR for vehicle license plates.
OCR for DjVu (Python 3 fork)
debug OCR utils
Do automated crawling of pages using scrapy
A small example package
This repository contains a Python program designed to execute Optical Character Recognition (OCR) and Facial Recognition on images.
Clever, simple, and intuitive wrapper functionalities for OCRing specific textual materials
font recognition and OCR
The ocr module of Aliyun Python sdk.