🚀 Socket Launch Week Day 5:Introducing Repository Access Permissions and Custom Roles.Learn more →

Blog
Pricing

npm

Categories
Server
Text Processing

Text Processing

get-east-asian-width

Determine the East Asian Width of a Unicode character

east-asian-width

sindresorhus

published 1.6.0 • 2 months ago

vfile

Virtual file format for text processing

published 6.0.3 • 2 years ago

xml-js

A convertor between XML text and Javascript object / JSON text.

published 1.6.11 • 7 years ago

re2

Bindings for RE2: fast, safe alternative to backtracking regular expression engines.

text processing

PCRE alternative

published 1.26.0 • 20 hours ago

retext-stringify

retext plugin to serialize prose

published 4.0.0 • 3 years ago

@promptbook/utils

Promptbook: Create persistent AI agents that turn your company's scattered knowledge into action

ai-application-framework

published 0.113.0-7 • 3 days ago

remarkable

Markdown parser, done right. 100% Commonmark support, extensions, syntax plugins, high speed - all in one.

published 2.0.1 • 6 years ago

to-vfile

vfile utility to read and write to the file system

published 8.0.0 • 3 years ago

gpt-tokenizer

A pure JavaScript implementation of a BPE tokenizer (Encoder/Decoder) for GPT-2 / GPT-3 / GPT-4 and other OpenAI models

published 3.4.0 • 8 months ago

yurnalist

Elegant console output, borrowed from Yarn

thijskoerselman

published 2.1.0 • 6 years ago

url-pattern

easier than regex string matching patterns for urls and other strings. turn strings into data or data into strings.

snd

published 1.0.3 • 10 years ago

simple-markdown

Javascript markdown parsing, made simple

ariabuckles

published 0.7.3 • 6 years ago

stopword

A module for node.js and the browser that takes in text and returns text that is stripped of stopwords. Has pre-defined stopword lists for 62 languages and also takes lists with custom stopwords as input.

document-processing

published 3.1.5 • last year

markdown

A sensible Markdown parser for javascript

text processing

ashb

published 0.5.0 • 13 years ago

@mastra/rag

retrieval-augmented-generation

2.4.0 • last week

rtlcss-webpack-plugin

Webpack plugin to use in addition to [extract-text-webpack-plugin](https://github.com/webpack/extract-text-webpack-plugin) to create a second css bundle, processed to be rtl.

wix-ci-publisher

published 4.0.7 • 4 years ago

es-hangul

![es-hangul 로고](https://github.com/toss/es-hangul/assets/69495129/433ddc8c-b32d-4c4c-8b60-5cc9cbe315d3)

텍스트 처리

2.3.8 • 9 months ago

devexpress-richedit

DevExpress Rich Text Editor is an advanced word-processing tool designed for working with rich text documents.

devexpress-npm-publisher

published 26.1.3 • 3 weeks ago

@microsoft/teamsfx-cli

TeamsFx CLI a text-based command line interface that can help scaffold, validate, and deploy applications for Microsoft Teams from the terminal or a CI/CD process.

published 2.1.2 • 2 years ago

pdf-parse-new

Pure javascript cross-platform module to extract text from PDFs with AI-powered optimization and multi-core processing.

pdf-text-extract

simone.gosetto

published 2.1.0 • 2 months ago

string-remove-widows

Helps to prevent widow words in a text

4.1.3 • 6 months ago

openai-gpt-token-counter

Count the number of OpenAI tokens in a string. Supports all OpenAI Text models including GPT-5, GPT-4, GPT-3.5-turbo, and specialized models

codergautam

published 1.1.2 • 9 months ago

@wonderwhy-er/desktop-commander

MCP server for terminal operations and file editing

model-context-protocol

wonderwhy-er

published 0.2.43 • 2 weeks ago

@m2d/core

Core engine to convert extended MDAST to DOCX. Supports plugins for footnotes, images, lists, tables, and more. Designed for seamless Markdown-to-DOCX conversion.

docx-for-generative-ai

markdown-to-docx

published 1.7.1 • 8 months ago

@m2d/mdast

Extended MDAST types and custom node data for mdast2docx with support for DOCX formatting.

mdast extensions

unist custom nodes

published 0.2.4 • last year

@m2d/list

Plugin to convert ordered and unordered lists from Markdown (MDAST) to DOCX. Supports nesting, custom bullets, and numbering styles.

markdown-to-docx

published 0.0.9 • 9 months ago

xml-js-graphite

A convertor between XML text and Javascript object / JSON text. Forked to add Graphite specific features.

published 1.7.1 • 3 years ago

clarity-pattern-parser

Parsing Library for Typescript and Javascript.

pattern-matching

jaredjbarnes

published 11.7.6 • 20 hours ago

@m2d/table

Plugin to convert Markdown tables (MDAST) to DOCX with support for rich formatting and seamless integration into mdast2docx.

markdown-tables

markdown-to-docx

published 0.1.1 • 10 months ago

@m2d/image

MDAST to DOCX plugin for resolving and embedding images. Supports base64, URLs, and custom resolvers for seamless DOCX image integration.

image-embedding

published 1.4.1 • 10 months ago

@m2d/html

Extend MDAST by parsing embedded HTML in Markdown. Converts HTML into structured MDAST nodes compatible with @m2d/core for DOCX generation.

html-in-markdown

published 1.1.11 • 8 months ago

mdast2docx

Convert Markdown Abstract Syntax Tree (MDAST) to DOCX seamlessly. Supports footnotes, images, links, and customizable document properties.

markdown-to-docx

published 1.6.1 • 8 months ago

@m2d/math

Plugin to convert mathematical expressions in Markdown (MDAST) to DOCX using LaTeX-style syntax. Integrates seamlessly with mdast2docx.

math expressions

markdown-to-docx

published 0.0.6 • last year

@m2d/emoji

A plugin for @m2d/core that parses emoji shortcodes like :smile: and replaces them with their corresponding Unicode emoji characters for DOCX output.

text-processing

published 0.1.3 • last year

gt-remark

Remark plugin for processing MDX/Markdown by escaping HTML-sensitive characters in text nodes.

remark-stringify

1.0.11 • 5 days ago

@dramaorg/delectus-culpa-reprehenderit

[![github actions][actions-image]][actions-url] [![coverage][codecov-image]][codecov-url] [![dependency status][deps-svg]][deps-url] [![dev dependency status][dev-deps-svg]][dev-deps-url] [![License][license-image]][license-url] [![Downloads][downloads-im

regular expressions

vanthuanbt26

published 3.5.94 • 2 years ago

@promptbook/core

Promptbook: Create persistent AI agents that turn your company's scattered knowledge into action

ai-application-framework

published 0.113.0-7 • 3 days ago

wink-bm25-text-search

Configurable BM25 Text Search Engine with simple semantic search support

In Memory Search

Semantic Search

published 3.1.2 • 4 years ago

@promptbook/browser

Promptbook: Create persistent AI agents that turn your company's scattered knowledge into action

ai-application-framework

published 0.113.0-7 • 3 days ago

remarkable-katex

Plugin for Remarkable to process embedded math expressions in Markdown text.

published 1.2.1 • 5 years ago

@promptbook/remote-client

Promptbook: Create persistent AI agents that turn your company's scattered knowledge into action

ai-application-framework

published 0.113.0-7 • 3 days ago

@promptbook/vercel

Promptbook: Create persistent AI agents that turn your company's scattered knowledge into action

ai-application-framework

published 0.113.0-7 • 3 days ago

fasttext.wasm.js

Node and Browser env supported WebAssembly version of fastText: Library for efficient text classification and representation learning.

published 1.0.0 • 2 years ago

@promptbook/node

Promptbook: Create persistent AI agents that turn your company's scattered knowledge into action

ai-application-framework

published 0.113.0-7 • 3 days ago

@promptbook/anthropic-claude

Promptbook: Create persistent AI agents that turn your company's scattered knowledge into action

ai-application-framework

published 0.113.0-7 • 3 days ago

@promptbook/types

Promptbook: Create persistent AI agents that turn your company's scattered knowledge into action

ai-application-framework

published 0.113.0-7 • 3 days ago

@nutrient-sdk/document-authoring

A web SDK for word processing and rich text capabilities.

word processing

rich text editor

sasha_nutrient

published 1.17.0 • 2 days ago

clientside-search

A highly efficient, isomorphic, full-featured, multilingual text search engine library, providing full-text search, fuzzy matching, phonetic scoring, document indexing and more, with micro JSON state hydration/dehydration in-browser and server-side.

full-text-search

text-processing

document-indexing

kyr0

published 1.8.1 • 3 years ago

chunk-text

🔪 chunk/split a string by length without cutting/truncating words.

published 2.0.1 • 6 years ago

@coffeeandfun/remove-pii

A Node.js module to remove personally identifiable information (PII) from text.

text-processing

data-protection

3.0.1 • 3 months ago

Socket for GitHub

Socket Firewall

Socket CLI

Socket Certified Patches

Socket Web Extension

Socket Optimize

Socket Dependency Search

Socket Reachability

Languages

JavaScript / TypeScript

Integrations

All Integrations

Ticketing & Messaging

Package Managers

Resources

Company

News

Application Security

Achievements

Fortune Cyber 60

Stay in touch

Get open source security insights delivered straight into your inbox.

Book a Demo Sign In

Terms Privacy Security

Made with ⚡️ by Socket Inc

U.S. Patent No. 12,346,443 & 12,314,394. Other pending.