
Security News
Security Community Slams MIT-linked Report Claiming AI Powers 80% of Ransomware
Experts push back on new claims about AI-driven ransomware, warning that hype and sponsored research are distorting how the threat is understood.
tesseract.js-core
Advanced tools

Core part of tesseract.js, which compiles original tesseract from C to JavaScript WebAssembly.
To build tesseract-core.js by yourself, please install docker and run:
bash build-with-docker.sh
The generated files will be stored in root path. When compiling, errors sometimes occur due to race conditions (some dependencies do not appear to compile properly in parallel). Re-running generally resolves.
build-scripts folderjavascript folderthird_party folder
CMakeLists.txt to build with emscriptenltrresultiterator.h and ltrresultiterator.cpp to add WordChoiceIterator classsrc/arch_sse folder, which is used instead of src/arch for the simd-enabled build
src/textord/colfind.cpp to prevent this from printing to consolesrc/ccmain/thresholder.cpp, src/ccmain/thresholder.h, src/api/baseapi.cpp, and include/tesseract/baseapi.h to add exif and angle arguments for rotating imagesFindLines from "protected" to "public" in baseapi.h to expose to Javascript
GetGradient function to baseapi.h and baseapi.cpp for reporting page angle
src/ccmain/tesseractclass.h, src/ccmain/pagesegmain.cpp, src/textord/textord.cpp, and src/textord/textord.h
WriteImage function to baseapi.h and baseapi.cpp for saving images (original, grey, and binary)SaveParameters and RestoreParameters functions to baseapi.h and baseapi.cpp for saving and restoring parametersEM_ASM_ARGS to src/ccmain/control.cpp for progress logging (and added <emscripten.h> header)tprintf function in src/ccutil/tprintf.cpp to force flushingSetImage to src/api/baseapi.cpp and include/tesseract/baseapi.h that reads image from filesystem
ParamUtils::PrintParams in src/ccutil/params.cpp to remove description text (resolves bug)
src/ccmain/tessedit.cpp to save error log to separate file (/debugDev.txt)src/api/jsonrenderer.cpp, modified CMakeLists.txt, include/tesseract/baseapi.h, and include/tesseract/renderer.hTo run the browser examples, launch a web server in the root of the repo (i.e. run http-server).  Then navigate to the pages in examples/web/minimal/ in your browser.
To run the node examples, navigate to examples/node/minimal/ and then run e.g. node index.wasm.js [input_file].
The "benchmark" examples behave similarly, except that they take longer to run and report runtime instead of recognition text. All other examples are experimental and should not be expected to run.
As we leverage git-submodule to manage dependencies, remember to add recursive when cloning the repository:
git clone --recursive https://github.com/naptha/tesseract.js-core
ocrad.js is a JavaScript port of the OCRAD OCR engine. It is a pure JavaScript library that can be used in the browser or in Node.js. Compared to tesseract.js-core, ocrad.js is simpler and may be easier to integrate for basic OCR tasks, but it may not be as powerful or accurate as Tesseract.
node-tesseract-ocr is a Node.js wrapper for the Tesseract OCR engine. It provides a simple interface for performing OCR on images. While it offers similar functionalities to tesseract.js-core, it is specifically designed for Node.js and may not be suitable for browser environments.
FAQs
Tesseract C++ API in Pure Javascript
The npm package tesseract.js-core receives a total of 334,363 weekly downloads. As such, tesseract.js-core popularity was classified as popular.
We found that tesseract.js-core demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 4 open source maintainers collaborating on the project.
Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Security News
Experts push back on new claims about AI-driven ransomware, warning that hype and sponsored research are distorting how the threat is understood.

Security News
Ruby's creator Matz assumes control of RubyGems and Bundler repositories while former maintainers agree to step back and transfer all rights to end the dispute.

Research
/Security News
Socket researchers found 10 typosquatted npm packages that auto-run on install, show fake CAPTCHAs, fingerprint by IP, and deploy a credential stealer.