Basic tools for working with natural language text data
Supplementary data about languages used by the langcodes module
This project is a Python version of the language-tags Javascript project.
Google Cloud Language API client library
Python Library for Tom's Obvious, Minimal Language
Google Ai Generativelanguage API client library
Client library to download and publish models, datasets and other repos on the huggingface.co hub
A framework for managing and maintaining multi-language pre-commit hooks.
This package provides 29 stemmers for 28 languages generated from Snowball algorithms.
Checks grammar using LanguageTool.
Natural Language Toolkit
Multi-Language Server WebSocket proxy for Jupyter Notebook/Lab server
A language and compiler for custom Deep Learning operations
Industrial-strength Natural Language Processing (NLP) in Python
Tools for labeling human languages with IETF language tags
Python 3.6 and 3.7 language support for the CloudFormation CLI
Typescript language support for the CloudFormation CLI
Binary Python wheels for all tree sitter languages.
An accurate natural language detection library, suitable for short text and mixed-language text
RDFLib is a Python library for working with RDF, a simple yet powerful language for representing information.
Microsoft Azure Question Answering Client Library for Python
Microsoft Azure Conversational Language Understanding Client Library for Python
A library that normalises language codes
Language detection library ported from Google's language-detection.
John Snow Labs Spark NLP is a natural language processing library built on top of Apache Spark ML. It provides simple, performant & accurate NLP annotations for machine learning pipelines, that scale easily in a distributed environment.
A domain-specific language for modeling convex optimization problems in Python.
Hassle-free computation of shareable, comparable, and reproducible BLEU, chrF, and TER scores
Mustache templating language renderer
Lightweight static analysis for many languages. Find bug variants with patterns that look like source code.
A language server for Jedi!
List of pre-commit hooks meant to format your source code.
Stone is an interface description language (IDL) for APIs.
Tools for language models
An abstraction layer for distributed computation
Open reproduction of consastive language-image pretraining (CLIP) and related.
Python Language Server for the Language Server Protocol
ISO 639 language codes, names, and other associated information
Fully customizable language detection for spaCy pipeline
Simple inference for large language models
pysbd (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box across many languages.
Python package and command-line tool designed to gather text on the Web, includes all necessary discovery and text processing components to perform web crawling, downloads, scraping, and extraction of main texts, metadata and comments.
80x faster and 95% accurate language identification with Fasttext
Data loaders and abstractions for text and NLP
Clean, filter and sample URLs to optimize data collection – includes spam, content type and language filters.
Parse Accept-Language HTTP header
Train transformer language models with reinforcement learning.