An abstraction layer for distributed computation
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML.
Python bindings for Jsonnet - The data templating language
Clean, filter and sample URLs to optimize data collection – includes spam, content type and language filters.
Python Language Server for the Language Server Protocol
pysbd (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box across many languages.
Stone is an interface description language (IDL) for APIs.
John Snow Labs Spark NLP is a natural language processing library built on top of Apache Spark ML. It provides simple, performant & accurate NLP annotations for machine learning pipelines, that scale easily in a distributed environment.
Python types for Language Server Protocol.
AWS Access Policy Language creation library
A flexible free and unlimited python tool to translate between different languages in a simple way using multiple translators
A pythonic generic language server (pronounced like 'pie glass')
Thai Natural Language Processing library
A python API for evaluating language support in the Google Fonts collection.
Parse Accept-Language HTTP header
Typing stubs for tree-sitter-languages
Simple inference for large language models
Tools for language models
An open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.
langid.py is a standalone Language Identification (LangID) tool.
A framework for evaluating language models
📧 Email reply parser library for Python with multi-language support
Python bindings around Google Chromium's embedded compact language detection library (CLD2)
Pure python spell checker based on work by Peter Norvig
80x faster and 95% accurate language identification with Fasttext
Python interface to Graphviz's Dot language
Microsoft Azure Cognitive Services LUIS Client Library for Python
A Python NLP Library for Many Human Languages, by the Stanford NLP Group
Full Python ROUGE Score Implementation (not a wrapper)
Qwen Vision Language Model Utils - PyTorch
Look up the frequencies of words in many languages, based on many sources of data.
LLM framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data.
A Python engine for the Liquid template language.
Microsoft Azure Text Analytics Client Library for Python
Python interface to the R language (embedded R)
parse numbers written in natural language
Data loaders and abstractions for text and NLP
Zig is a general-purpose programming language and toolchain for maintaining robust, optimal, and reusable software.
The agile query language for semi-structured data. #JSON
Docspec is a JSON object specification for representing API documentation of programming languages.
A lightweight, optionally typed expression language with a custom grammar for matching arbitrary Python objects.
A framework for creating, editing, and invoking Noisy Intermediate Scale Quantum (NISQ) circuits.
SGLang is yet another fast serving framework for large language models and vision language models.
A Language Server Protocol implementation for Ruff.
DjangoQL: Advanced search language for Django
Get ISO code for a given language