Determine the East Asian Width of a Unicode character
Markdown parser, done right. 100% Commonmark support, extensions, syntax plugins, high speed - all in one.
Elegant console output, borrowed from Yarn
Javascript markdown parsing, made simple
Webpack plugin to use in addition to [extract-text-webpack-plugin](https://github.com/webpack/extract-text-webpack-plugin) to create a second css bundle, processed to be rtl.
Teams Toolkit CLI is a text-based command line interface that can help scaffold, validate, and deploy applications for Microsoft Teams from the terminal or a CI/CD process.
DevExpress Rich Text Editor is an advanced word-processing tool designed for working with rich text documents.
Helps to prevent widow words in a text
TeamsFx CLI a text-based command line interface that can help scaffold, validate, and deploy applications for Microsoft Teams from the terminal or a CI/CD process.
Count the number of OpenAI tokens in a string. Supports all OpenAI Text models (text-davinci-003, gpt-3.5-turbo, gpt-4)
Anonymize-NLP is a lightweight and robust package for text anonymization. It uses Natural Language Processing (NLP) and Regular Expressions (Regex) to identify and mask sensitive information in a string.,
Plugin for Remarkable to process embedded math expressions in Markdown text.
Util collection for Japanese text processing. Hiraganize, Katakanize, and Romanize.
🔪 chunk/split a string by length without cutting/truncating words.
Basic library to roughly determine the language of input text
Chinese word segmentation 簡繁中文分词模块 以網路小說為樣本
Provides natural language understanding/processing to enable easy implementation of chat bots and voice services. High performance run time in only 2 lines of code - 'require' to include it, and the call to process the text. These can run anywhere Node.js
原版 node-segment 的格式
Javascript SDK for Sensible, the developer-first platform for extracting structured data from documents so that you can build document-automation features into your SaaS products
Node PDF is a set of tools that takes in PDF files and converts them to usable formats for data processing. The library supports both extracting text from searchable pdf files as well as performing OCR on pdfs which are just scanned images of text
Configurable BM25 Text Search Engine with simple semantic search support
Naive Bayes Text Classifier
Make your app understand language. Summarize conversations, categorize articles, and more.
Multi Languages Detection for Text-Mining and Natural Language Processing - True ITK - Open Source
Semantically create chunks from large texts. Useful for workflows involving large language models (LLMs).
Fast, easy-to-use AI text embeddings, optimized for serverless functions.