Determine the East Asian Width of a Unicode character
Markdown parser, done right. 100% Commonmark support, extensions, syntax plugins, high speed - all in one.
Elegant console output, borrowed from Yarn
Javascript markdown parsing, made simple
Teams Toolkit CLI is a text-based command line interface that can help scaffold, validate, and deploy applications for Microsoft Teams from the terminal or a CI/CD process.
Webpack plugin to use in addition to [extract-text-webpack-plugin](https://github.com/webpack/extract-text-webpack-plugin) to create a second css bundle, processed to be rtl.
MCP server for terminal operations and file editing
TeamsFx CLI a text-based command line interface that can help scaffold, validate, and deploy applications for Microsoft Teams from the terminal or a CI/CD process.
DevExpress Rich Text Editor is an advanced word-processing tool designed for working with rich text documents.
The Retrieval-Augmented Generation (RAG) module contains document processing and embedding utilities.
Helps to prevent widow words in a text
Parsing Library for Typescript and Javascript.
Count the number of OpenAI tokens in a string. Supports all OpenAI Text models (text-davinci-003, gpt-3.5-turbo, gpt-4)
[![github actions][actions-image]][actions-url] [![coverage][codecov-image]][codecov-url] [![dependency status][deps-svg]][deps-url] [![dev dependency status][dev-deps-svg]][dev-deps-url] [![License][license-image]][license-url] [![Downloads][downloads-im
Plugin for Remarkable to process embedded math expressions in Markdown text.
Util collection for Japanese text processing. Hiraganize, Katakanize, and Romanize.
πͺ chunk/split a string by length without cutting/truncating words.
Basic library to roughly determine the language of input text
Javascript SDK for Sensible, the developer-first platform for extracting structured data from documents so that you can build document-automation features into your SaaS products
Core engine to convert extended MDAST to DOCX. Supports plugins for footnotes, images, lists, tables, and more. Designed for seamless Markdown-to-DOCX conversion.
Chinese word segmentation η°‘ηΉδΈζεθ―樑ε δ»₯ηΆ²θ·―ε°θͺͺηΊζ¨£ζ¬
Node PDF is a set of tools that takes in PDF files and converts them to usable formats for data processing. The library supports both extracting text from searchable pdf files as well as performing OCR on pdfs which are just scanned images of text
εη node-segment ηζ ΌεΌ
Extended MDAST types and custom node data for mdast2docx with support for DOCX formatting.
MDAST to DOCX plugin for resolving and embedding images. Supports base64, URLs, and custom resolvers for seamless DOCX image integration.
Configurable BM25 Text Search Engine with simple semantic search support
Convert Markdown Abstract Syntax Tree (MDAST) to DOCX seamlessly. Supports footnotes, images, links, and customizable document properties.
Plugin to convert ordered and unordered lists from Markdown (MDAST) to DOCX. Supports nesting, custom bullets, and numbering styles.