MCP server for OCR with multiple backends (Marker, DeepSeek, Mistral)
High-performance Robust OCR powered
A package for extracting structured data from receipts.
An pytorch ocr base library for MBBank lib
MCP server for academic document authoring: grant proposals (科研費/JSPS), journal papers (IEEE/IEEJ/APS/Elsevier), research-talk slides, Excalidraw diagram pipeline + cross-cutting meta helpers. 86 diagnostic tools distilled from classical Japanese/English writing books (木下/本多/Wallwork). 0.6.0: OCR-inspired per-connected-component classification (text_like/line/circle/fill) + auto compose + iterative PSNR-driven trace refinement for the diagram pipeline.
Tesseract OCR with OpenCV preprocessing and auto-correct.
SDK Python officiel pour l'API OCR Facture France - Extraction automatique de données de factures via OCR
It reads "Oh! Craft". This is package for OCR.
Implements Sinapsis templates to perform optical character recognition on images
Manage your package version through ocrd-tool.json
OCR-D processor and client for OLA-HD
This repository contains a Python program designed to execute Optical Character Recognition (OCR) and Facial Recognition on images.
Add your description here
Generate keywords from CVs
A lightweight OCR system based on PaddleOCR
Turkmen language LSTM model for Tesseract OCR — pip install turkman_ocr
Alibaba Cloud ocr-api (20210707) SDK Library for Python2
Tools of extracting PDF content based on RapidOCR
OCR-D wrapper for DoxaPy image binarization via locally adaptive thresholding
Biblioteca para fazer ocr em imagem
This repository contains a Python program designed to execute Optical Character Recognition (OCR) and Facial Recognition on images.
a parser for AWS and Azure ocr json files
Different python scripts used in the OCR4all workflow.
Enterprise-grade OCR and document conversion tool with dual OCR engines
Modification on top of OpenMMLab Text Detection, OCR, and NLP Toolbox
cylinder ocr package
tesseract-ocr data for projects using it without having to install it.
OCR for incoming document in Tryton
Indian place-name lookup, OCR address cleanup, instant precheck and SQLite prefix fast correction, large Indian vocabulary, extraction, and address intelligence
A wrapper around tesseract that can be easily called from the command line on an image from the clipboard
OCR CLI for Arabic images using unsloth/surya
Templates for optical character recognition using the GLM-OCR model
A powerful DeepSeek-based Optical Character Recognition (OCR) implementation supporting text extraction and grounding.
OCR Accuracy Reporter
OCR-D wrapper for ocr-fileformat
Fast PaddleOCR MCP server - Extract text from images using PaddleOCR with optimized performance
E-kyc for Industries application
Enterprise Document Intelligence Platform with High-Performance C++ Extensions, API v2, MCP, and Vector Search
A versatile OCR and document processing command-line tool.
A handy tool for OCRing manga
OCRmyPDF-AIH — batch PDF OCR pipeline with Tesseract/Calamari backends, based on OCRmyPDF
Optical character recognition for Varian Real-Time Position Management System
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source products in an easy and simple manner. Access 10000+ state-of-the-art NLP and OCR models for Finance, Legal and Medical domains. Easily scalable to Spark Cluster
Convert scanned PDFs to text file using OCR