Join our webinar on Wednesday, June 26, at 1pm EDTHow Chia Mitigates Risk in the Crypto Industry.Register
Socket
Socket
Sign inDemoInstall

dynamsoft-label-recognizer

Package Overview
Dependencies
8
Maintainers
1
Versions
32
Alerts
File Explorer

Advanced tools

Install Socket

Detect and block malicious and high-risk dependencies

Install

    dynamsoft-label-recognizer

Dynamsoft Label Recognizer (DLR) is an SDK designed to recognize meaningful zonal text or symbols in an image (Label). Common scenarios include price tags in supermarkets, inventory labels in warehouses, VIN codes on car windshields, driver licenses, pass


Version published
Weekly downloads
242
decreased by-2.81%
Maintainers
1
Install size
32.3 MB
Created
Weekly downloads
 

Readme

Source

Dynamsoft Label Recognizer for Your Website

Add the capability of reading ID cards or other text of fixed formats in your web application with just a few lines of code.

version downloads jsdelivr vulnerabilities

Once integrated, your users can open your website in a browser, access their cameras, and read the intended text directly from the video input.

In this guide, you will learn step by step on how to integrate this library into your website.

Table of Contents

Hello World - Simplest Implementation

Let's start by testing the "Hello World" example of the library which demonstrates how to use the minimum code to enable a web page to read text from a live video stream.

Step One: Check the code of the example

The complete code of the "Hello World" example is shown below

<!DOCTYPE html>
<html>

<body>
    <script src="https://cdn.jsdelivr.net/npm/dynamsoft-label-recognizer@2.2.3/dist/dlr.js"></script>
    <script src="https://cdn.jsdelivr.net/npm/dynamsoft-camera-enhancer@2.3.2/dist/dce.js"></script>
    <script>
        // Specify a license, you can visit https://www.dynamsoft.com/customer/license/trialLicense?utm_source=npm&product=dlr&package=js to get your own trial license good for 30 days. 
        Dynamsoft.DLR.LabelRecognizer.license = 'DLS2eyJvcmdhbml6YXRpb25JRCI6IjIwMDAwMSJ9';
        // Initialize and use the library
        (async () => {
            Dynamsoft.DLR.LabelRecognizer.onResourcesLoadStarted = (resourcePath) => {
                console.log("Loading " + resourcePath);
                // Show a visual cue that a model file is being downloaded
            };
            Dynamsoft.DLR.LabelRecognizer.onResourcesLoaded = (resourcePath) => {
                console.log("Finished loading " + resourcePath);
                // Hide the visual cue
            };
            let recognizer = await Dynamsoft.DLR.LabelRecognizer.createInstance();
            Dynamsoft.DCE.CameraEnhancer.defaultUIElementURL = Dynamsoft.DLR.LabelRecognizer.defaultUIElementURL;
            let cameraEnhancer = await Dynamsoft.DCE.CameraEnhancer.createInstance();
            recognizer.setImageSource(cameraEnhancer);
            await recognizer.updateRuntimeSettingsFromString("video-letter");
            recognizer.onImageRead = results => {
                for (let result of results) {
                    for (let lineResult of result.lineResults) {
                        console.log(lineResult.text);
                    }
                }
            };
            recognizer.onUniqueRead = (txt, results) => {
                alert(txt);
            };
            await recognizer.startScanning(true);
        })();
    </script>
</body>

</html>

Run via JSFiddle


About the code
  • LabelRecognizer.createInstance(): This method creates a LabelRecognizer object called recognizer.

  • CameraEnhancer.createInstance(): this method creates a CameraEnhancer object called cameraEnhancer which is used to control the camera as well as the default user interface. To use cameraEnhancer with recognizer, we pass to it the customized UI provided by the Dynamsoft Label Recognizer SDK via defaultUIElementURL and then bind it to recognizer with setImageSource() to allow the latter to fetch frames from the camera for recognition as well as highlight the recognized text areas.

  • updateRuntimeSettingsFromString("video-letter"): this sets up recognizer with a built-in template optimized for reading letters from continous video frames. Note that templates starting with "video-" are only valid after a CameraEnhancer instance has been bound to this LabelRecognizer instance.

  • onImageRead: This event is triggered every time the library finishes scanning a video frame image. The results object contains all the text results that the library has found on this frame. In this example, we print the results to the browser console.

  • onUniqueRead: This event is triggered when the library finds a new text, which is not a duplicate among multiple frames. txt holds the text value while results is an array of objects that hold details of the text. In this example, an alert will be displayed for this new text.

  • startScanning(true): Starts contious video frame scanning. The return value is a Promise which resovles when the camera is opened, the video shows up on the page and the scanning begins (which means cameraEnhancer has started feeding recognizer with frames to recognize).

Step Two: Test the example

Create a text file with the name "helloworld.html", fill it with the code above and save. After that, open the example page in a browser, allow the page to access your camera and the video will show up on the page. After that, you can point the camera at something with a simple line of text to read it.

You can also just test it at https://jsfiddle.net/DynamsoftTeam/b1w8vm0t/21/

If the text is decoded, an alert will pop up with the result text. At the same time, the text location will be highlighted in the video feed.

Note:

  • Although the page should work properly when opened directly as a file ("file:///"), it's recommended that you deploy it to a web server before accessing it. Also, some browsers require a secure connection (HTTPS) to access the cameras, so deploying the page to a HTTPS website is the best choice.
  • For first use, you may need to wait a few seconds for the library to initialize.
  • The license "DLS2eyJvcmdhbml6YXRpb25JRCI6IjIwMDAwMSJ9" used in this sample is an online license and requires network connection to work.

If the test doesn't go as expected, you can check out the FAQ or contact us.

Building your own page

Here we'll show all the steps required to build a web page for reading machine-readable zones (MRZ) on passports, visas, etc.

Include the library

Use a CDN

The simplest way to include the library is to use either the jsDelivr or UNPKG CDN. The "hello world" example above uses jsDelivr. Since the recognition is mostly on a video input, we should also include the supporting library Dynamsoft Camera Enhancer.

  • jsDelivr

    <script src="https://cdn.jsdelivr.net/npm/dynamsoft-label-recognizer@2.2.3/dist/dlr.js"></script>
    <script src="https://cdn.jsdelivr.net/npm/dynamsoft-camera-enhancer@2.3.2/dist/dce.js"></script>
    
  • UNPKG

    <script src="https://unpkg.com/dynamsoft-label-recognizer@2.2.3/dist/dlr.js"></script>
    <script src="https://unpkg.com/dynamsoft-camera-enhancer@2.3.2/dist/dce.js"></script>
    
Host the library yourself

Besides using the CDN, you can also download the library and host its files on your own website / server before including it in your application.

To download the library:

  • yarn

    yarn add dynamsoft-label-recognizer@2.2.3
    yarn add dynamsoft-camera-enhancer@2.3.2
    
  • npm

    npm install dynamsoft-label-recognizer@2.2.3
    npm install dynamsoft-camera-enhancer@2.3.2
    

Depending on how you downloaded the library and where you put it. You can typically include it like this:

<script src="/dlr-js-2.2.3/dist/dlr.js"></script>
<script src="/dlr-js-2.2.3/dce/dist/dce.js"></script>

or

<script src="/node_modules/dynamsoft-label-recognizer/dist/dlr.js"></script>
<script src="/node_modules/dynamsoft-camera-enhancer/dist/dce.js"></script>

Read more on how to host the library.

Configure the library

Before using the library, you need to configure a few things.

Specify the license

The library requires a license to work, use the API license to specify a license key.

Dynamsoft.DLR.LabelRecognizer.license = "YOUR-LICENSE-KEY";

To test the library, you can request a 30-day trial license via the customer portal.

If you registered for a Dynamsoft account and downloaded the library from the official site, Dynamsoft will generate a 30-day trial license for you and put the license key in all the downloaded samples.

Specify the location of the "engine" files

If the engine files (*.worker.js, *.wasm.js and *.wasm, etc.) are not in the same location with the main library file (dlr.js), you can use the API engineResourcePath to specify the engine path, for example:

//The following code uses the jsDelivr CDN, feel free to change it to your own location of these files.
Dynamsoft.DLR.LabelRecognizer.engineResourcePath = "https://cdn.jsdelivr.net/npm/dynamsoft-label-recognizer@2.2.3/dist/";
Dynamsoft.DCE.CameraEnhancer.engineResourcePath = "https://cdn.jsdelivr.net/npm/dynamsoft-camera-enhancer@2.3.2/dist/";

This configuration is usually required with frameworks like Angular or React where dlr.js is compiled into another file.

Add a visual cue about the loading of a .data file

The .data files are crucial for the recognition of certain types of text. For example, to read the MRZ zone on passports, the file MRZ.data must be loaded first. These .data files are loaded from the server on demand at runtime. At present, these files are quite large, for example, MRZ.data is about 10MB. Although these files are cached locally as soon as they are downloaded, loading them for the first time can be quite time-consuming. To make the process user-friendly, it's recommended to show a visual cue about the loading process to the user with the help of the APIs onResourcesLoadStarted, onResourcesLoadProgress and onResourcesLoaded :

Dynamsoft.DLR.LabelRecognizer.onResourcesLoadStarted = (resourcePath) => {
    console.log("Loading " + resourcePath);
    // Show a visual cue that a model file is being 
}
Dynamsoft.DLR.LabelRecognizer.onResourcesLoadProgress = (resourcePath, progress) => {
    console.log(resourcePath + "loading progress: " + progress.loaded + "/" + progress.total);
    // Show the progress
}
Dynamsoft.DLR.LabelRecognizer.onResourcesLoaded = (resourcePath) => {
    console.log("Finished loading " + resourcePath);
    // Hide the visual cue
}

Interact with the library

Create a LabelRecognizer object

To use the library, we first create a LabelRecognizer object.

let recognizer = null;
try {
    recognizer = await Dynamsoft.DLR.LabelRecognizer.createInstance();
} catch (ex) {
    console.error(ex);
}
Create a CameraEnhancer object and bind it to the LabelRecognizer object

A CameraEnhancer object is required for video recognition. Also, the object should make use of a customized UI from the Label Recognition SDK to streamline the recognition.

Dynamsoft.DCE.CameraEnhancer.defaultUIElementURL = Dynamsoft.DLR.LabelRecognizer.defaultUIElementURL;
let cameraEnhancer = await Dynamsoft.DCE.CameraEnhancer.createInstance();
recognizer.setImageSource(cameraEnhancer);
Change the camera settings if necessary

In some cases, a different camera might be required instead of the default one. Also, a different resolution might work better. To change the camera or the resolution, we use the CameraEnhancer object. Learn more here.

// set which camera and what resolution to use
let allCameras = await cameraEnhancer.getAllCameras();
await cameraEnhancer.selectCamera(allCameras[0]);
await cameraEnhancer.setResolution(1280, 720);
Set up the recognition process

Check out the following code:

let scanSettings = await recognizer.getScanSettings();
// disregard duplicated results found in a specified time period (in milliseconds)
scanSettings.duplicateForgetTime = 6000;
// set a scan interval in milliseconds so the library may release the CPU from time to time
scanSettings.intervalTime = 100;
await recognizer.updateScanSettings(scanSettings);
// use one of the built-in RuntimeSetting templates: 
// "number", "letter", "numberLetter", "numberUppercase", "VIN", "VIN_NA", "MRZ", "passportMRZ", "visaMRZ"
// "video-number", "video-letter", "video-numberLetter", "video-numberUppercase", "video-VIN", "video-VIN_NA", "video-MRZ", "video-passportMRZ", "video-visaMRZ"
// For convenience, these names are not case-sensitive.
// You can also pass in a JSON string as the template.
await recognizer.updateRuntimeSettingsFromString("video-MRZ");

As you can see from the above code snippets, there are two types of configurations:

  • get/updateScanSettings: Configures the behavior of the recognizer which includes duplicateForgetTime and intervalTime.

  • updateRuntimeSettingsFromString: Configures the recognizer engine with a built-in template or a template represented by a JSON string. If a template was configured at creation, it will be replaced.

    Note that templates starting with "video-" are only valid after a CameraEnhancer instance has been bound to this LabelRecognizer instance.

Customize the UI

The built-in UI of the LabelRecognizer object is defined in the file dist/dlr.ui.html . There are a few ways to customize it:

  • Modify the file dist/dlr.ui.html directly.

    This option is only possible when you host this file on your own web server instead of using a CDN. This file can then be passed to a CameraEnhancer object with Dynamsoft.DLR.LabelRecognizer.defaultUIElementURL .

  • Copy the file dist/dlr.ui.html to your application, modify it and use the API defaultUIElementURL to set it as the default UI.

    Dynamsoft.DLR.LabelRecognizer.defaultUIElementURL = "THE-URL-TO-THE-FILE";
    
  • Append the default UI element to your page, customize it before showing it.

    <div id="recognizerUI"></div>
    
    await cameraEnhancer.setUIElement(Dynamsoft.DLR.LabelRecognizer.defaultUIElementURL);
    document.getElementById('recognizerUI').appendChild(cameraEnhancer.getUIElement());
    document.getElementsByClassName('dce-btn-close')[0].hidden = true; // Hide the close button
    
  • Build the UI element into your own web page and specify it with the API setUIElement(HTMLElement).

    • Embed the video

      <div id="div-ui-container" style="width:100%;height:100%;">
          <div class="dce-video-container" style="position:relative;width:100%;height:500px;"></div>
      </div>
      <script>
          (async () => {
              let cameraEnhancer = await Dynamsoft.DCE.CameraEnhancer.createInstance();
              await cameraEnhancer.setUIElement(document.getElementById('div-ui-container'));
              let recognizer = await Dynamsoft.DLR.LabelRecognizer.createInstance();
              recognizer.setImageSource(cameraEnhancer);
              await recognizer.updateRuntimeSettingsFromString("video-MRZ");
              recognizer.onImageRead = results => {
                  for (let result of results) {
                      for (let lineResult of result.lineResults) {
                          console.log(lineResult.text);
                      }
                  }
              };
              recognizer.onMRZRead = (txt, results) => {
                  alert(txt);
              };
              await recognizer.startScanning(true);
          })(); 
      </script>
      

      The video element will be created and appended to the DIV element with the class dce-video-container, make sure the class name is the same. Besides, the CSS property position of the DIV element must be either relative, absolute, fixed, or sticky.

    • Add the camera list and resolution list

      If the class names for these lists match the default ones, dce-sel-camera and dce-sel-resolution , the library will automatically populate the lists and handle the camera/resolution switching.

      <select class="dce-sel-camera"></select>
      
      <select class="dce-sel-resolution"></select>
      

      Generally, you need to provide a resolution that the camera supports. However, in case a camera does not support the specified resolution, it usually uses the nearest supported resolution. As a result, the selected resolution may not be the actual resolution used. In this case, add an option with the class name dce-opt-gotResolution (as shown below) and the library will then use it to show the actual resolution.

      <select class="dce-sel-resolution">
          <option class="dce-opt-gotResolution" value="got"></option>
          <option data-width="1920" data-height="1080">1920x1080</option>
          <option data-width="1280" data-height="720">1280x720</option>
          <option data-width="640" data-height="480">640x480</option>
      </select>
      

API Documentation

You can check out the detailed documentation about the APIs of the library at https://www.dynamsoft.com/label-recognition/programming/javascript/api-reference/?ver=latest&utm_source=npm&product=dlr&package=js.

System Requirements

This library requires the following features which are supported by all modern mainstream browsers:

  • WebAssembly, Blob, URL/createObjectURL, Web Workers

    The above four features are required for the library to work.

  • MediaDevices/getUserMedia

    This API is only required for in-browser video streaming.

  • getSettings

    This API inspects the video input which is a MediaStreamTrack object about its constrainable properties.

The following table is a list of supported browsers based on the above requirements:

Browser NameVersion
Chromev61+1
Firefoxv52+ (v55+ on Android/iOS1)
Edge2v16+
Safari3v11+

1 iOS 14.3+ is required for camera video streaming in Chrome and Firefox or Apps using webviews.

2 On Edge, due to strict Same-origin policy, you must host the library files on the same domain as your web page.

3 Safari 11.2.2 ~ 11.2.6 are not supported.

Apart from the browsers, the operating systems may impose some limitations of their own that could restrict the use of the library. Browser compatibility ultimately depends on whether the browser on that particular operating system supports the features listed above.

Hosting the Library

Step 1: Deploy the dist folder

Once you have downloaded the library, you can locate the "dist" directory and copy it to your server (usually as part of your website / web application). The following shows some of the files in this directory:

  • dlr.js // The main library file
  • dlr.ui.html // Defines the default recognizer UI
  • dlr-<version>.worker.js // Defines the worker thread for text reading
  • dlr-<version>.wasm.js // The recognition engine (.js)
  • dlr-<version>.wasm // The recognition engine (.wasm)

NOTE: the files for Dynamsoft Camera Enhancer are often required as well and can be copied to the same location as the above "dist" directory.

Step 2: Configure the Server

  • Set the MIME type for .wasm as application/wasm and .data as application/octet-stream on your webserver.

    The goal is to configure your server to send the correct Content-Type header for the wasm file so that it is processed correctly by the browser.

    Different types of webservers are configured differently, for example:

  • Enable HTTPS

    To use the library, you must access your website / web application via a secure HTTPS connection. This is due to browser security restrictions which only grant camera video streaming access to a secure context.

    For convenience, self-signed certificates can be used during development and testing.

Step 3: Include the library from the server

Now that the library is hosted on your server, you can include it accordingly.

<script src="https://www.yourwebsite.com/dynamsoft-label-recognizer/dist/dlr.js"></script>
<script src="https://www.yourwebsite.com/dynamsoft-camera-enhancer/dist/dce.js"></script>

Optionally, you may also need to specify the location of the "engine" files.

FAQ

Can I open the web page directly from the hard drive?

Yes, for simple testing purposes, it's perfectly fine to open the file directly from the hard drive. However, you might encounter some issues in doing so (like unable to access the camera, etc.). The recommendation is to deploy this page to your web server and run it over HTTPS. If you don't have a ready-to-use web server but have a package manager like npm or yarn, you can set up a simple HTTP server in minutes. Check out http-server on npm.

Why can't I use my camera?

If you open the web page as file:/// or http:// , the camera may not work and you see the following error in the browser console:

[Deprecation] getUserMedia() no longer works on insecure origins. To use this feature, you should consider switching your application to a secure origin, such as HTTPS. See https://goo.gl/rStTGz for more details.

  • In Safari 12 the equivalent error is:

Trying to call "getUserMedia" from an insecure document.

You get this error because the API getUserMedia requires HTTPS to access the camera.

  • If you use Chrome or Firefox, you might not get the error because these two browsers allow camera access via file:/// and http://localhost.

To make sure your web application can access the camera, please configure your web server to support HTTPS. The following links may help.

  1. NGINX: Configuring HTTPS servers
  2. IIS: Create a Self Signed Certificate in IIS
  3. Tomcat: Setting Up SSL on Tomcat in 5 minutes
  4. Node.js: npm tls

Keywords

FAQs

Last updated on 14 Apr 2022

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts

SocketSocket SOC 2 Logo

Product

  • Package Alerts
  • Integrations
  • Docs
  • Pricing
  • FAQ
  • Roadmap

Stay in touch

Get open source security insights delivered straight into your inbox.


  • Terms
  • Privacy
  • Security

Made with ⚡️ by Socket Inc