Language detection library ported from Google's language-detection.
pysbd (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box across many languages.
A flexible free and unlimited python tool to translate between different languages in a simple way using multiple translators
Fully customizable language detection for spaCy pipeline
80x faster and 95% accurate language identification with Fasttext
langid.py is a standalone Language Identification (LangID) tool.
An accurate natural language detection library, suitable for short text and mixed-language text
Python bindings around Google Chromium's embedded compact language detection library (CLD2)
Quickly detect text language and segment language
fasttext with wheels and no external dependency, but only the predict method (<1MB)
OCR, layout, reading order, and table recognition in 90+ languages
Fork of the language identification tool langid.py, featuring a modernized codebase and faster execution times.
Multi-lingual Automatic Speech Recognition (ASR) based on Whisper models, with accurate word timestamps, access to language detection confidence, several options for Voice Activity Detection (VAD), and more.
Fully customizable language detection pipeline for spaCy
LLM-Guard is a comprehensive tool designed to fortify the security of Large Language Models (LLMs). By offering sanitization, detection of harmful language, prevention of data leakage, and resistance against prompt injection attacks, LLM-Guard ensures that your interactions with LLMs remain safe and secure.
A lightweight toolkit for multilingual lemmatization and language detection.
Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.
Translate, transliterate, get the language of texts in no time with the help of multiple APIs!
This is a python API which allows you to check for swear words in a youtube video, srt file, text file, custom source with multi language support. There are additional features like getting youtube transcript of a video, srt parser etc.
Language detection for news powered by fasttext
CLI for DeepChopper: A Genomic Language Model for Chimera Artifact Detection
RefChecker provides automatic checking pipeline for detecting fine-grained hallucinations generated by Large Language Models.
Detect the programming language of a source code
A Python library for language detection and translation using langchain ChatModels
80x faster and 95% accurate language identification with Fasttext
HuSpaCy: industrial strength Hungarian natural language processing
UQLM (Uncertainty Quantification for Language Models) is a Python package for UQ-based LLM hallucination detection.
Breame is a lightweight Python package with a number of tools to aid in the detection of words that have dual spellings and meanings in British and American English.
Detect language support for font binaries
MOMENT: A Family of Open Time Series Foundation Models
HuSpaCy: industrial strength Hungarian natural language processing
Language Detection API Client
Voice-Activated Natural Language UI
Kestrel Threat Hunting Language
Python bindings for the Lingua(LanguageDetect) Rust library
Google's langdetect modified for Chinese texts
Faster port of Language detection built by Shuyo in Python
A tool to detect subtitle languages in media files
Guess programming language from a string or file.
A Genomic Language Model for Chimera Artifact Detection in Nanopore Direct RNA Sequencing
A Python library for language detection and translation using OpenAI's GPT-4o.
Python bindings for whatlang using pyo3
Detect languages via a fasttext model
pysbd (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box across many languages.
A library for detecting profanity from different languages.
SONATA: SOund and Narrative Advanced Transcription Assistant
A Python library for classifying toxic comments using deep learning. It supports detecting multiple types of toxicity including obscene language, threats, and identity hate.
Ultra-fast, Low Latency LLM security solution