Yet another library to extract text from MS Office and PDF files
Javascript SDK for Sensible, the developer-first platform for extracting structured data from documents so that you can build document-automation features into your SaaS products
docx parser
A simple library that converts .docx files to plain text in the browser
A NodeJS library to parse pdf, txt, doc and docx files to JSON and CSV
This npm package offers a straightforward method to extract text content from various binary and text file formats. The package comes with a pre-built configuration that works out-of-the-box, requiring no additional setup. It is designed for use in Browse
Extracts comments and other data from docx files
Fork of office-text-extractor with unreleased changes that include browser support
The Structured Parser JS/TS SDK allows developers to easily integrate Structured Parser's advanced structured data extraction capabilities from unstructured documents such as PDF, DOCX, XLSX.
A NodeJS library to parse pdf, txt, doc and docx files to JSON and CSV
Yet another library to extract text from MS Office and PDF files
A dead simple docx parser.
A node script which can fill DOCX placeholders and convert to PDFs
Primitives for building and extending readers, including definitions for context, parsers, modes, commands, plugins, and more.
A Text extracting package docx, pdf and pptx files
A NodeJS library to parse pdf, txt, doc and docx files to JSON and CSV
> **Note** > This repository is automatically generated from the [main parser monorepo](https://github.com/TrialAndErrorOrg/parsers). Please submit any issues or pull requests there.
Javascript port of python-docx.