中文版
ocr
Ocr is a text recognition module, which includes two models: ocr_detection and ocr_recognition。 Ocr_detection model detects the region of the text in the picture, ocr_recognition model can recognize the characters (Chinese / English / numbers) in each text area.
The module provides a simple and easy-to-use interface. Users only need to upload pictures to obtain text recognition results.
The input shape of the ocr_recognition model is [1, 3, 32, 320], and the selected area of the picture text box will be processed before the model reasoning: the width height ratio of the selected area of the picture text box is < = 10, and the whole selected area will be transferred into the recognition model; If the width height ratio of the frame selected area is > 10, the frame selected area will be cropped according to the width, the cropped area will be introduced into the recognition model, and finally the recognition results of each part of the cropped area will be spliced.
Ocr_detection model is downloaded frompaddleOCR.
ocr_recognition model is an inference model with an input shape of [1,3,32,320] derived from the ch_PP-OCRv2_rec_train training model.
Run Demo
- Execute in the current directory
npm install
npm run dev
- Visit http://0.0.0.0:8872
Usage
Text Recognition
import * as ocr from '@paddlejs-models/ocr';
await ocr.init();
const res = await ocr.recognize(img, option?);
console.log(res.text);
console.log(res.points);
Text Detection
To do text position detection without recognition:
import * as ocr from '@paddlejs-models/ocr';
await ocr.init();
const points = await ocr.detect(img);
Online experience
https://paddlejs.baidu.com/ocr
Performance