Yet another library to extract text from MS Office and PDF files
docx parser
Javascript SDK for Sensible, the developer-first platform for extracting structured data from documents so that you can build document-automation features into your SaaS products
A simple library that converts .docx files to plain text in the browser
Convert documents to markdown text content. Originally inspired by microsoft's markitdown python library.
This npm package offers a straightforward method to extract text content from various binary and text file formats. The package comes with a pre-built configuration that works out-of-the-box, requiring no additional setup. It is designed for use in Browse
A NodeJS library to parse pdf, txt, doc and docx files to JSON and CSV
A node script which can fill DOCX placeholders and convert to PDFs
A NodeJS library to parse pdf, txt, doc and docx files to JSON and CSV
A lightweight library to parse .docx files in Cloudflare Workers
Extracts comments and other data from docx files
A NodeJS library to parse pdf, txt, doc and docx files to JSON and CSV
A dead simple docx parser.
Docx parser for JavaScript/TypeScript
The Structured Parser JS/TS SDK allows developers to easily integrate Structured Parser's advanced structured data extraction capabilities from unstructured documents such as PDF, DOCX, XLSX.
Fork of office-text-extractor with unreleased changes that include browser support
Yet another library to extract text from MS Office and PDF files
Primitives for building and extending readers, including definitions for context, parsers, modes, commands, plugins, and more.
Javascript port of python-docx.
> **Note** > This repository is automatically generated from the [main parser monorepo](https://github.com/TrialAndErrorOrg/parsers). Please submit any issues or pull requests there.
A Text extracting package docx, pdf and pptx files