NLP NN Framework
工程提取字段相关的一些定义以及一些常用函数封装
Annotator combining different NLP pipelines
Text utilities and datasets for PyTorch
NLP Sandbox Client Library for Python
h1 align="center">
MXNet GluonNLP Toolkit (DeepNumpy Version)
Caikit NLP
The goal of the Indic NLP Library is to build Python based libraries for common text processing and Natural Language Processing in Indian languages. This fork is specialized for IndicTrans2.
A collection of scripts for NLP-related command line tasks
A package to extract Triples in form of [predictate , object , subject] form text
NLP Feature Extractors
Python wrapper of lightning fast Finite State Machine based NLP library.
PyTorch version of Google AI BERT model with script to load Google pre-trained models
A powerful toolkit for automating NLP workflows.
Drug Named Entity Recognition library to find and resolve drug names in a string (drug named entity linking)
A Python wrapper for the NLPearl API
A Python package to parse structured information from recipe ingredient sentences
The SDK library and command-line interface to Geneea Interpretor, an NLP REST API.
tools for Natural Language Processing
Boilerplate code to wrap different libs for NLP tasks.
SeqIO: Task-based datasets, preprocessing, and evaluation for sequence models.
A web scraping library based on LangChain which uses LLM and direct graph logic to create scraping pipelines.
Utility for parsing Wikipedia SQL dumps into CSVs.
Python wrapper for HanLP: Han Language Processing
NLP tools for Russian language
Core NLP library for extracting and analysing emotion and group dynamics in a group setting.
Spello: Fast and Smart Spell Correction
Natural language processing tool for Turkish
NLP package
PolyFuzz performs fuzzy string matching, grouping, and evaluation.
Remove duplicates and near-duplicates from text corpora, no matter the scale.
分词工具
A Python Wrapper for VnCoreNLP
A Python wrapper for VnCoreNLP using a bidirectional communication channel.
Frequency vocabulary for NLP purposes
A robust NLP pipeline for stemming, lemmatization, and vectorization
A simple NLP library allows profiling datasets with one or more text columns.
NLP Annotation Helpers
A small and general nlp util collection for nlper
GiNZA, An Open Source Japanese NLP Library, based on Universal Dependencies