Natural Language Toolkit
An accurate natural language detection library, suitable for short text and mixed-language text
Python package and command-line tool designed to gather text on the Web, includes all necessary discovery and text processing components to perform web crawling, downloads, scraping, and extraction of main texts, metadata and comments.
Module for automatic summarization of text documents and HTML pages.
Open-source tool for exploring, labeling, and monitoring data for NLP projects.
Textile processing for python.
Thai Natural Language Processing library
Microsoft Azure Text Analytics Client Library for Python
Extract quantities from unstructured text.
Natural language processing augmentation library for deep neural networks
Pyap is an MIT Licensed text processing library, written in Python, for detecting and parsing addresses. Currently it supports USA, Canadian and British addresses.
Python package for Korean natural language processing.
Identification and conversion functions for Chinese text processing
Functions to preprocess and normalize text.
NeMo text processing for ASR and TTS
Generalist model for NER (Extract any entity types from texts)
Wrappers for several pre-processing scripts from the Moses toolkit.
A base class for wrapping text-processing tools
NLP, before and after spaCy
Python library for processing Chinese text
STAM is a library for dealing with standoff annotations on text, this is the python binding.
Natural Language Processing (NLP) library for Urdu language.
Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.
pre-processing package for text strings
A command to manage a header section for a source code tree
A text summarization and keyword extraction package based on TextRank
Nonsense String Evaluator
The goal of the Indic NLP Library is to build Python based libraries for common text processing and Natural Language Processing in Indian languages.
Text2Text: Crosslingual NLP/G toolkit
Utils for automatic document images processing
A library for augmenting text for natural language processing applications.
A Python library for a _FULL_ Zalgo experience
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batching, and more. Supports datasets from Huggingface, torchdata iterables, or simple lists of dictionaries.
A text-to-intent parsing framework.
Unsupervised Korean Natural Language Processing Toolkits
A library for calculating a variety of features from text using spaCy
Short Text Mining
Text processing with pandas DataFrames.
Natural language processing support for Pandas dataframes.
Phrase Tree from Natural Language Toolkit
A module that helps process text data
Python bindings for MeTA
Analiticcl is an approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation
A neural network intent parser