Llama OCR
An npm library to run OCR for free with Llama 3.2 Vision.
data:image/s3,"s3://crabby-images/d88c5/d88c5ae1bbea1bd9bf334ec5b197552a776d9ee2" alt="Current version"
Installation
npm i llama-ocr
Usage
import { ocr } from "llama-ocr";
const markdown = await ocr({
filePath: "./trader-joes-receipt.jpg",
model: "Llama-3.2-90B-Vision",
apiKey: process.env.TOGETHER_API_KEY,
});
How it works
This library uses the free Llama 3.2 endpoint from Together AI to parse images and return markdown. Paid endpoints for Llama 3.2 11B and Llama 3.2 90B are also available for faster performance and higher rate limits.
Roadmap
Credit
This project was inspired by Zerox. Go check them out!