Huge News!Announcing our $40M Series B led by Abstract Ventures.Learn More
Socket
Sign inDemoInstall
Socket

@paddlejs-models/ocr

Package Overview
Dependencies
Maintainers
5
Versions
13
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

@paddlejs-models/ocr

[中文版](./README_cn.md)

  • 1.2.4
  • latest
  • npm
  • Socket score

Version published
Weekly downloads
38
increased by15.15%
Maintainers
5
Weekly downloads
 
Created
Source

中文版

ocr

Ocr is a text recognition module, which includes two models: ocr_detection and ocr_recognition。 Ocr_detection model detects the region of the text in the picture, ocr_recognition model can recognize the characters (Chinese / English / numbers) in each text area.

version size downloads downloads

The module provides a simple and easy-to-use interface. Users only need to upload pictures to obtain text recognition results.

The input shape of the ocr_recognition model is [1, 3, 32, 320], and the selected area of the picture text box will be processed before the model reasoning: the width height ratio of the selected area of the picture text box is < = 10, and the whole selected area will be transferred into the recognition model; If the width height ratio of the frame selected area is > 10, the frame selected area will be cropped according to the width, the cropped area will be introduced into the recognition model, and finally the recognition results of each part of the cropped area will be spliced.

Ocr_detection model is downloaded frompaddleOCR.

ocr_recognition model is an inference model with an input shape of [1,3,32,320] derived from the ch_PP-OCRv2_rec_train training model.

Run Demo

  1. Execute in the current directory
npm install
npm run dev
  1. Visit http://0.0.0.0:8872

Usage

Text Recognition


import * as ocr from '@paddlejs-models/ocr';

// Model initialization
await ocr.init();

// Get the text recognition result API, img is the user's upload picture, and option is an optional parameter
// option.canvas as HTMLElementCanvas:if the user needs to draw the selected area of the text box, pass in the canvas element
// option.style as object:if the user needs to configure the canvas style, pass in the style object
// option.style.strokeStyle as string:select a color for the text box
// option.style.lineWidth as number:width of selected line segment in text box
// option.style.fillStyle as string:select the fill color for the text box
const res = await ocr.recognize(img, option?);
// character recognition results
console.log(res.text);
// text area points
console.log(res.points);

Text Detection

To do text position detection without recognition:


import * as ocr from '@paddlejs-models/ocr';

// Model initialization
await ocr.init();

// Get the text detection points
const points = await ocr.detect(img);

Online experience

https://paddlejs.baidu.com/ocr

Performance

ocr

FAQs

Package last updated on 16 Nov 2023

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts

SocketSocket SOC 2 Logo

Product

  • Package Alerts
  • Integrations
  • Docs
  • Pricing
  • FAQ
  • Roadmap
  • Changelog

Packages

npm

Stay in touch

Get open source security insights delivered straight into your inbox.


  • Terms
  • Privacy
  • Security

Made with ⚡️ by Socket Inc