New Case Study:See how Anthropic automated 95% of dependency reviews with Socket.Learn More →

pdf-parser-client-side

Package Overview

Dependencies

Advanced tools

Install Socket

Detect and block malicious and high-risk dependencies

Install

pdf-parser-client-side

A lightweight easy to use package to parse text from PDF files on client side without any server dependency.

1.0.3
Source
npm

Version published: 9 months ago

Weekly downloads: 358; increased by59.82%

Maintainers: 1

Weekly downloads

Created: last year

Source

PDF Parser Client Side

A lightweight easy to use package to parse text from PDF files on client side without any server dependency.

How to Install ?

Use npm or yarn to install this npm package

npm i pdf-parser-client-side

yarn add pdf-parser-client-side

Include the package

import extractTextFromPDF from "pdf-parser-client-side";

Basic Example:

import React from "react";
import extractTextFromPDF from "pdf-parser-client-side";

export default function Test() {
  return (
    <div>
      <input
        type="file"
        name=""
        id="file-selector"
        accept=".pdf"
        onChange={(e) => {
          // Selecting the first file
          const file = e.target.files[0];
          //   If file exists then we will call our function
          if (file) {
            extractTextFromPDF(file).then((data) => {
              console.log(data);
            });
          }
        }}
      />
    </div>
  );
}

`variant` Parameter

The variant parameter is used to specify the type of text extraction and replacement to be performed on the extractedText. Depending on the value of the variant parameter, different types of characters will be removed or retained.

`variant` Value	Description	Regular Expression	Retained Characters
`clean`	Removes all non-ASCII characters and any spaces that follow them.	`/[^\x00-\x7F]+\ *(?:[^\x00-\x7F]	)*/g`
`alphanumeric`	Retains only alphanumeric characters (letters and numbers).	`/[^a-zA-Z0-9]+/g`	A-Z, a-z, 0-9
`alphanumericwithspace`	Retains alphanumeric characters and spaces.	`/[^a-zA-Z0-9 ]+/g`	A-Z, a-z, 0-9, space
`alphanumericwithspaceandpunctuation`	Retains alphanumeric characters, spaces, and basic punctuation marks (.,!?,).	`/[^a-zA-Z0-9 .,!?]+/g`	A-Z, a-z, 0-9, space, .,!?
`alphanumericwithspaceandpunctuationandnewline`	Retains alphanumeric characters, spaces, basic punctuation marks (.,!?), and newlines.	`/[^a-zA-Z0-9 .,!?]+/g`	A-Z, a-z, 0-9, space, .,!?

Example Usage

import React from "react";
import extractTextFromPDF from "pdf-parser-client-side";

let extractedText = "Example text with special characters: !@#$%^&*()_+";

export default function Test() {
  return (
    <div>
      <input
        type="file"
        name=""
        id="file-selector"
        accept=".pdf"
        onChange={(e) => {
          // Selecting the first file
          const file = e.target.files[0];
          //   If file exists then we will call our function
          if (file) {
            extractTextFromPDF(file, "clean").then((data) => {
              console.log(data);
            });
          }
        }}
      />
    </div>
  );
}

Contributing

Feel free to contribute!

Fork the repository
Make changes
Submit a pull request

</> with 💛 by Vishwa Gaurav

Keywords

FAQs

What is pdf-parser-client-side?

Is pdf-parser-client-side popular?

Is pdf-parser-client-side well maintained?

Package last updated on 15 Jun 2024

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install