Language detection library ported from Google's language-detection.
pysbd (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box across many languages.
A flexible free and unlimited python tool to translate between different languages in a simple way using multiple translators
Fully customizable language detection for spaCy pipeline
80x faster and 95% accurate language identification with Fasttext
langid.py is a standalone Language Identification (LangID) tool.
An accurate natural language detection library, suitable for short text and mixed-language text
Python bindings around Google Chromium's embedded compact language detection library (CLD2)
Multi-lingual Automatic Speech Recognition (ASR) based on Whisper models, with accurate word timestamps, access to language detection confidence, several options for Voice Activity Detection (VAD), and more.
Quickly detect text language and segment language
fasttext with wheels and no external dependency, but only the predict method (<1MB)
OCR, layout, reading order, and table recognition in 90+ languages
Fork of the language identification tool langid.py, featuring a modernized codebase and faster execution times.
Fully customizable language detection pipeline for spaCy
LLM-Guard is a comprehensive tool designed to fortify the security of Large Language Models (LLMs). By offering sanitization, detection of harmful language, prevention of data leakage, and resistance against prompt injection attacks, LLM-Guard ensures that your interactions with LLMs remain safe and secure.
A lightweight toolkit for multilingual lemmatization and language detection.
Translate, transliterate, get the language of texts in no time with the help of multiple APIs!
Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.
Language detection for news powered by fasttext
CLI for DeepChopper: A Genomic Language Model for Chimera Artifact Detection
This is a python API which allows you to check for swear words in a youtube video, srt file, text file, custom source with multi language support. There are additional features like getting youtube transcript of a video, srt parser etc.
RefChecker provides automatic checking pipeline for detecting fine-grained hallucinations generated by Large Language Models.
Detect the programming language of a source code
80x faster and 95% accurate language identification with Fasttext
HuSpaCy: industrial strength Hungarian natural language processing
Breame is a lightweight Python package with a number of tools to aid in the detection of words that have dual spellings and meanings in British and American English.
Python bindings for whatlang using pyo3
Detect language support for font binaries
MOMENT: A Family of Open Time Series Foundation Models
HuSpaCy: industrial strength Hungarian natural language processing
A Python library for language detection and translation using langchain ChatModels
Language Detection API Client
Voice-Activated Natural Language UI
Kestrel Threat Hunting Language
Python bindings for the Lingua(LanguageDetect) Rust library
Google's langdetect modified for Chinese texts
Faster port of Language detection built by Shuyo in Python
A tool to detect subtitle languages in media files
Guess programming language from a string or file.
UQLM (Uncertainty Quantification for Language Models) is a Python package for UQ-based LLM hallucination detection.
A Genomic Language Model for Chimera Artifact Detection in Nanopore Direct RNA Sequencing
A Python library for detecting lexical borrowings (with a focus on anglicisms in Spanish language)
A Python library for language detection and translation using OpenAI's GPT-4o.
pysbd (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box across many languages.
Detect languages via a fasttext model
A library for detecting profanity from different languages.
SONATA: SOund and Narrative Advanced Transcription Assistant
Language detection library ported from Google's language-detection.