
Security News
npm Adopts OIDC for Trusted Publishing in CI/CD Workflows
npm now supports Trusted Publishing with OIDC, enabling secure package publishing directly from CI/CD workflows without relying on long-lived tokens.
ocrguru is a lightweight, extensible CLI tool that wraps the powerful Docling OCR pipeline.
Process scanned PDFs or images in one command, choose your OCR engine, and export clean text in Markdown, JSON, or hOCR—all without the usual setup fuss.
Multiple Engines
Input Formats
.pdf
, .png
, .jpg
/.jpeg
, .tiff
, .bmp
Output Formats
.md
) – human-friendly.json
) – full coordinates & metadata.html
) – preserve layout & stylingZero-Config Defaults
Cross-Platform
Extensible Codebase
core.py
cli.py
From PyPI:
pip install ocrguru
From your GitHub clone:
git clone https://github.com/yourusername/ocrguru.git
cd ocrguru
pip install .
Note:
docling
andhuggingface-hub
will install automatically.- For GPU-accelerated EasyOCR, install PyTorch with CUDA support:
pip install torch torchvision --extra-index-url https://download.pytorch.org/whl/cu118
Perform OCR on a PDF using the default RapidOCR engine, export to Markdown:
docling-ocr --engine rapidocr --input ./scanned_document.pdf --format md --output ./scanned_document.md
No extra flags needed!
Usage: docling-ocr [OPTIONS]
Options:
-e, --engine [easyocr|tesseract_py|tesseract_cli|rapidocr]
OCR engine (default: rapidocr)
-i, --input PATH Input file path (.pdf, image)
-f, --format [md|json|html] Output format (default: md)
-o, --output PATH Output file path
-h, --help Show this message and exit
docling-ocr --engine easyocr --input invoice.jpg --format json --output invoice.json
docling-ocr --engine tesseract_cli --input contract.pdf --format html --output contract.hocr.html
for pdf in reports/*.pdf; do
out="${pdf%.pdf}.md"
docling-ocr --input "$pdf" --output "$out"
done
ocrguru/
├── src/
│ └── ocrguru/
│ ├── cli.py # CLI entry point
│ └── core.py # OCR conversion logic
├── tests/ # pytest test suite
├── pyproject.toml # build & metadata
└── README.md # this file
We welcome your ideas and pull requests!
pip install -e .[test]
tests/
and implement your feature in src/ocrguru/
pytest
main
Please adhere to PEP 8 and write clear commit messages.
We use pytest for automated testing. Coverage reporting is encouraged:
pytest --cov=ocrguru
Ensure new features include corresponding tests.
Released under the MIT License. See LICENSE for full text.
❤️ Happy OCR’ing with ocrguru! ❤️
FAQs
CLI wrapper for Docling OCR engines
We found that ocrguru demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
npm now supports Trusted Publishing with OIDC, enabling secure package publishing directly from CI/CD workflows without relying on long-lived tokens.
Research
/Security News
A RubyGems malware campaign used 60 malicious packages posing as automation tools to steal credentials from social media and marketing tool users.
Security News
The CNA Scorecard ranks CVE issuers by data completeness, revealing major gaps in patch info and software identifiers across thousands of vulnerabilities.