Huge News!Announcing our $40M Series B led by Abstract Ventures.Learn More →

esearch-ocr

Package Overview

Dependencies

Advanced tools

Install Socket

Detect and block malicious and high-risk dependencies

Install

esearch-ocr

paddleocr models run on onnx

5.1.5
latest
npm

Version published: 3 weeks ago

Weekly downloads: 3; decreased by-86.96%

Maintainers: 0

Weekly downloads

Created: last year

Source

eSearch-OCR

本仓库是 eSearch的 OCR 服务依赖

支持本地 OCR（基于 PaddleOCR）

PaddleOCR License

paddle 预测库

基于onnxruntime的 web runtime，使用 wasm 运行，未来可能使用 webgl 甚至是 webgpu。

模型需要转换为 onnx 才能使用：Paddle2ONNX 或在线转换

部分模型已打包：Releases

在 js 文件下可以使用 electron 进行调试

使用

npm i esearch-ocr onnxruntime-web

web

import * as ocr from "esearch-ocr";
import * as ort from "onnxruntime-web";

const ocr = require("esearch-ocr");
const ort = require("onnxruntime-node");

[!IMPORTANT] 需要手动安装 onnxruntime（onnxruntime-node 或 onnxruntime-web，视平台而定），并在init参数中传入ort 这样设计是因为 web 和 electron 可以使用不同的 ort，很难协调，不如让开发者自己决定

浏览器或 Electron 示例

await ocr.init({
    detPath: "ocr/det.onnx",
    recPath: "ocr/rec.onnx",
    dic: "abcdefg...",
    ort,
});

const url = "data:image/png;base64,...";
ocr.ocr(url)
    .then((l) => {})
    .catch((e) => {});

或者

const localOCR = await ocr.init({
    detPath: "ocr/det.onnx",
    recPath: "ocr/rec.onnx",
    dic: "abcdefg...",
    ort,
});

localOCR.ocr(/*像上面ocr.ocr一样调用*/);

这在需要多次运行 ocr 时非常有用

node.js 示例，需要安装canvas

演示项目

init type

{
    ort: typeof import("onnxruntime-web");
    detPath: string;
    recPath: string;
    dic: string; // 文件内容，不是路径
    dev?: boolean;
    maxSide?: number;
    imgh?: number;
    imgw?: number;
    detShape?: [number, number]; // ppocr v3 需要指定为[960, 960], v4 为[640, 640]
    canvas?: (w: number, h: number) => any; // 用于node
    imageData?: any; // 用于node
    cv?: any;
}

ocr type

type PointType = [number, number]
ocr(img: ImageData): Promise<{
    text: string;
    mean: number;
    box: [PointType, PointType, PointType, PointType]; // ↖ ↗ ↘ ↙
}[]>

除了 ocr 函数，还有det函数，可单独运行，检测文字坐标；rec函数，可单独运行，检测文字内容。具体定义可看编辑器提示

Keywords

FAQs

What is esearch-ocr?

Is esearch-ocr popular?

Is esearch-ocr well maintained?

Package last updated on 16 Oct 2024

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

esearch-ocr

eSearch-OCR

使用

Keywords

Related posts

vlt Debuts New JavaScript Package Manager and Serverless Registry at NodeConf EU

Malicious Python Package Typosquats Popular 'fabric' SSH Library, Exfiltrates AWS Credentials