
Security News
Attackers Are Hunting High-Impact Node.js Maintainers in a Coordinated Social Engineering Campaign
Multiple high-impact npm maintainers confirm they have been targeted in the same social engineering campaign that compromised Axios.
A cross-platform OCR library based on PaddleOCR v5 and ONNX Runtime that is as small as possible.
A lightweight, type-safe, dependency-free JavaScript/TypeScript library for PaddleOCR, supporting both Node.js and browser environments.
onnxruntime-web and onnxruntime-node.npm install paddleocr
# or
yarn add paddleocr
# or
pnpm add paddleocr
import * as ort from "onnxruntime-web";
import * as ort from "onnxruntime-node";
You can use fetch, fs.readFileSync, or any other method to load your ONNX model files and dictionary as ArrayBuffer and string array, respectively.
import { PaddleOcrService } from "paddleocr";
const paddleOcrService = await PaddleOcrService.createInstance({
ort,
detection: {
modelBuffer: detectOnnx,
minimumAreaThreshold: 24,
textPixelThreshold: 0.55,
paddingBoxVertical: 0.3,
paddingBoxHorizontal: 0.5,
},
recognition: {
modelBuffer: recOnnx,
charactersDictionary: dict,
imageHeight: 48,
},
});
The detection and recognition objects above act as instance defaults. They are applied to every recognize() call unless you override them per request.
The recognize method expects an object with width, height, and data (Uint8Array of RGB(A) values). Use your preferred image decoding library (e.g., fast-png, image-js).
import { decode } from "fast-png";
const imageFile = await readFile("tests/image.png");
const buffer = imageFile.buffer.slice(imageFile.byteOffset, imageFile.byteOffset + imageFile.byteLength);
const image = decode(buffer);
const input = {
data: image.data,
width: image.width,
height: image.height,
};
const result = await paddleOcrService.recognize(input);
console.log(result);
You can pass onProgress to recognize to receive detection and recognition updates in real time.
const result = await paddleOcrService.recognize(input, {
onProgress(event) {
console.log(event.type, event.stage, event.progress);
if (event.type === "rec" && event.stage === "item") {
console.log("Partial result:", event.result?.text, event.box);
}
},
});
Event contract:
det emits preprocess, infer, and postprocess with fixed progress 1/3, 2/3, 3/3rec emits start, one item per detected text box, then completerec/item includes the current result and boxdet/postprocess includes detectedCountYou can set defaults when creating the instance, then override them for a single image.
const strictResult = await paddleOcrService.recognize(invoiceInput, {
detection: {
minimumAreaThreshold: 40,
textPixelThreshold: 0.65,
paddingBoxVertical: 0,
paddingBoxHorizontal: 0,
dilationKernelSize: 3,
},
recognition: {
imageHeight: 64,
charactersDictionary: digitsOnlyDict,
},
ordering: {
sortByReadingOrder: true,
sameLineThresholdRatio: 0.15,
},
});
const looseResult = await paddleOcrService.recognize(noteInput, {
detection: {
minimumAreaThreshold: 8,
textPixelThreshold: 0.45,
},
});
Supported per-call override groups:
detection: padding, mean, stdDeviation, maxSideLength, paddingBoxVertical, paddingBoxHorizontal, minimumAreaThreshold, textPixelThreshold, dilationKernelSizerecognition: imageHeight, mean, stdDeviation, charactersDictionaryordering: sortByReadingOrder, sameLineThresholdRatiocharactersDictionary can be provided either when creating the instance or per call. If neither is provided, recognize() throws.
processRecognitionconst rawRecognition = await paddleOcrService.recognize(input);
const processed = paddleOcrService.processRecognition(rawRecognition, {
lineMergeThresholdRatio: 0.8,
});
console.log(processed.text);
console.log(processed.lines);
You can find sample models in the assets/ directory:
PP-OCRv5_mobile_det_infer.onnxPP-OCRv5_mobile_rec_infer.onnxppocrv5_dict.txtSee the examples/ directory for usage samples.
About browser usage with Vite, check out paddleocr-vite-example
Contributions are welcome! Feel free to submit a PR or open an issue.
MIT
FAQs
A cross-platform OCR library based on PaddleOCR v5 and ONNX Runtime that is as small as possible.
The npm package paddleocr receives a total of 280 weekly downloads. As such, paddleocr popularity was classified as not popular.
We found that paddleocr demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Security News
Multiple high-impact npm maintainers confirm they have been targeted in the same social engineering campaign that compromised Axios.

Security News
Axios compromise traced to social engineering, showing how attacks on maintainers can bypass controls and expose the broader software supply chain.

Security News
Node.js has paused its bug bounty program after funding ended, removing payouts for vulnerability reports but keeping its security process unchanged.