Socket
Book a DemoInstallSign in
Socket

vision-ocr

Package Overview
Dependencies
Maintainers
0
Versions
12
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

vision-ocr

Image to markdown (OCR) with Llama 3.2 Vision.

0.0.1
latest
Source
npmnpm
Version published
Maintainers
0
Created
Source

vision-ocr

Convert images to markdown using Llama 3.2 Vision model powered by Groq AI.

Features

  • 📸 Convert images to clean, formatted markdown
  • 🌐 Support for both local and remote images
  • 🎯 Accurate text extraction and formatting
  • 📝 Preserves document structure and formatting
  • ⚡ Fast processing using Groq AI

Installation

npm install vision-ocr

Setup

  • Get your Groq API key from Groq Console
  • Set up your API key either:
    • As an environment variable: GROQ_API_KEY=your_api_key
    • Or pass it directly in the code

Usage

import { ocr } from 'vision-ocr';

// Using a remote image
const markdown = await ocr({
  filePath: 'https://example.com/image.jpg',
  apiKey: 'your_groq_api_key' // Optional if env variable is set
});

// Using a local image
const markdown = await ocr({
  filePath: './path/to/local/image.jpg'
});

console.log(markdown);

API Reference

ocr(options)

Main function to convert images to markdown.

Options

ParameterTypeRequiredDefaultDescription
filePathstringYes-Path to local image or URL of remote image
apiKeystringNoprocess.env.GROQ_API_KEYYour Groq API key
modelstringNo"llama-3.2-11b-vision-preview"Model to use for OCR

Supported Models

  • llama-3.2-11b-vision-preview
  • llama-3.2-90b-vision-preview

Error Handling

The package throws errors in these cases:

  • Missing file path
  • Invalid image file
  • API errors
  • No content extracted from image
try {
  const markdown = await ocr({
    filePath: 'path/to/image'
  });
} catch (error) {
  console.error('OCR failed:', error.message);
}

Limitations

  • Images must be in a supported format (JPEG, PNG)
  • Maximum image size depends on Groq API limits
  • API key required for operation

License

MIT © Saurabh Udupi (@0xSaurabhx)

Contributing

Contributions, issues, and feature requests are welcome! Feel free to check the issues page.

Credit

This project was inspired by nutlope. Go check it out!

Keywords

Groq ai

FAQs

Package last updated on 21 Nov 2024

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts

SocketSocket SOC 2 Logo

Product

About

Packages

Stay in touch

Get open source security insights delivered straight into your inbox.

  • Terms
  • Privacy
  • Security

Made with ⚡️ by Socket Inc

U.S. Patent No. 12,346,443 & 12,314,394. Other pending.