Yet another library to extract text from MS Office and PDF files
docx parser
Javascript SDK for Sensible, the developer-first platform for extracting structured data from documents so that you can build document-automation features into your SaaS products
A NodeJS library to parse pdf, txt, doc and docx files to JSON and CSV
A simple library that converts .docx files to plain text in the browser
Extracts comments and other data from docx files
A NodeJS library to parse pdf, txt, doc and docx files to JSON and CSV
This npm package offers a straightforward method to extract text content from various binary and text file formats. The package comes with a pre-built configuration that works out-of-the-box, requiring no additional setup. It is designed for use in Browse
A dead simple docx parser.
Fork of office-text-extractor with unreleased changes that include browser support
Yet another library to extract text from MS Office and PDF files
A NodeJS library to parse pdf, txt, doc and docx files to JSON and CSV
Primitives for building and extending readers, including definitions for context, parsers, modes, commands, plugins, and more.
A node script which can fill DOCX placeholders and convert to PDFs
Javascript port of python-docx.
> **Note** > This repository is automatically generated from the [main parser monorepo](https://github.com/TrialAndErrorOrg/parsers). Please submit any issues or pull requests there.