Huge News!Announcing our $40M Series B led by Abstract Ventures.Learn More
Socket
Sign inDemoInstall
Socket

pdf-parser-client-side

Package Overview
Dependencies
Maintainers
1
Versions
12
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

pdf-parser-client-side

A lightweight easy to use package to parse text from PDF files on client side without any server dependency.

  • 1.1.1
  • latest
  • Source
  • npm
  • Socket score

Version published
Maintainers
1
Created
Source


PDF Parser Client Side

PDF Parser Client Side

A lightweight easy to use package to parse text from PDF files on client side without any server dependency.

How to Install ?

Use npm or yarn to install this npm package

npm i pdf-parser-client-side

or

yarn add pdf-parser-client-side

Include the package

import extractTextFromPDF from "pdf-parser-client-side";
variant Parameter

The variant parameter is used to specify the type of text extraction and replacement to be performed on the extractedText. Depending on the value of the variant parameter, different types of characters will be removed or retained.

variant ValueDescriptionRegular ExpressionRetained Characters
cleanRemoves all non-ASCII characters and any spaces that follow them.`/[^\x00-\x7F]+\ *(?:[^\x00-\x7F])*/g`
alphanumericRetains only alphanumeric characters (letters and numbers)./[^a-zA-Z0-9]+/gA-Z, a-z, 0-9
alphanumericwithspaceRetains alphanumeric characters and spaces./[^a-zA-Z0-9 ]+/gA-Z, a-z, 0-9, space
alphanumericwithspaceandpunctuationRetains alphanumeric characters, spaces, and basic punctuation marks (.,!?,)./[^a-zA-Z0-9 .,!?]+/gA-Z, a-z, 0-9, space, .,!?
alphanumericwithspaceandpunctuationandnewlineRetains alphanumeric characters, spaces, basic punctuation marks (.,!?), and newlines./[^a-zA-Z0-9 .,!?]+/gA-Z, a-z, 0-9, space, .,!?
Example Usage

Javascript

import React from "react";
import extractTextFromPDF from "pdf-parser-client-side";

export default function Test() {
  const handleFileChange = async (e, variant) => {
    const file = e.target.files?.[0];
    if (file) {
      try {
        const text = await extractTextFromPDF(file, variant);
        console.log("Extracted Text:", text);
      } catch (error) {
        console.error("Error extracting text from PDF:", error);
      }
    }
  };

  return (
    <div>
      <input
        type="file"
        name=""
        id="file-selector"
        accept=".pdf"
        onChange={(e) => handleFileChange(e, "clean")}
      />
    </div>
  );
}

Typescript

import React from "react";
import extractTextFromPDF, { Variant } from "pdf-parser-client-side";

export default function Test() {
  const handleFileChange = async (
    e: React.ChangeEvent<HTMLInputElement>,
    variant: Variant
  ) => {
    const file = e.target.files?.[0];
    if (file) {
      try {
        const text = await extractTextFromPDF(file, variant);
        console.log("Extracted Text:", text);
      } catch (error) {
        console.error("Error extracting text from PDF:", error);
      }
    }
  };

  return (
    <div>
      <input
        type="file"
        name=""
        id="file-selector"
        accept=".pdf"
        onChange={(e) => handleFileChange(e, "clean")}
      />
    </div>
  );
}

Contributing

Feel free to contribute!

  1. Fork the repository
  2. Make changes
  3. Submit a pull request

</> with 💛 by Vishwa Gaurav

Keywords

FAQs

Package last updated on 16 Jun 2024

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts

SocketSocket SOC 2 Logo

Product

  • Package Alerts
  • Integrations
  • Docs
  • Pricing
  • FAQ
  • Roadmap
  • Changelog

Packages

npm

Stay in touch

Get open source security insights delivered straight into your inbox.


  • Terms
  • Privacy
  • Security

Made with ⚡️ by Socket Inc