Language detection library ported from Google's language-detection.
pysbd (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box across many languages.
A flexible free and unlimited python tool to translate between different languages in a simple way using multiple translators
Fully customizable language detection for spaCy pipeline
80x faster and 95% accurate language identification with Fasttext
langid.py is a standalone Language Identification (LangID) tool.
Python bindings around Google Chromium's embedded compact language detection library (CLD2)
An accurate natural language detection library, suitable for short text and mixed-language text
Quickly detect text language and segment language
fasttext with wheels and no external dependency, but only the predict method (<1MB)
Fork of the language identification tool langid.py, featuring a modernized codebase and faster execution times.
OCR, layout, reading order, and table recognition in 90+ languages
Multi-lingual Automatic Speech Recognition (ASR) based on Whisper models, with accurate word timestamps, access to language detection confidence, several options for Voice Activity Detection (VAD), and more.
LLM-Guard is a comprehensive tool designed to fortify the security of Large Language Models (LLMs). By offering sanitization, detection of harmful language, prevention of data leakage, and resistance against prompt injection attacks, LLM-Guard ensures that your interactions with LLMs remain safe and secure.
Fully customizable language detection pipeline for spaCy
A lightweight toolkit for multilingual lemmatization and language detection.
Detect language support for font binaries
Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.
Translate, transliterate, get the language of texts in no time with the help of multiple APIs!
RefChecker provides automatic checking pipeline for detecting fine-grained hallucinations generated by Large Language Models.
CLI for DeepChopper: A Genomic Language Model for Chimera Artifact Detection
This is a python API which allows you to check for swear words in a youtube video, srt file, text file, custom source with multi language support. There are additional features like getting youtube transcript of a video, srt parser etc.
Python bindings for whatlang using pyo3
A Python library for language detection and translation using langchain ChatModels
Detect the programming language of a source code
HuSpaCy: industrial strength Hungarian natural language processing
Language detection for news powered by fasttext
Breame is a lightweight Python package with a number of tools to aid in the detection of words that have dual spellings and meanings in British and American English.
Language Detection API Client
Kestrel Threat Hunting Language
UQLM (Uncertainty Quantification for Language Models) is a Python package for UQ-based LLM hallucination detection.
A Python library for language detection and translation using OpenAI's GPT-4o.
Faster port of Language detection built by Shuyo in Python
Claim Processor provides automatic checking pipeline for detecting fine-grained hallucinations generated by Large Language Models.
80x faster and 95% accurate language identification with Fasttext
MOMENT: A Family of Open Time Series Foundation Models
HuSpaCy: industrial strength Hungarian natural language processing
Python bindings for the Lingua(LanguageDetect) Rust library
A Genomic Language Model for Chimera Artifact Detection in Nanopore Direct RNA Sequencing
Google's langdetect modified for Chinese texts
A Python library for detecting lexical borrowings (with a focus on anglicisms in Spanish language)
Detect languages via a fasttext model
Library that combines Robotics Hardware, iPhone and AI for Everyone
A library for detecting profanity from different languages.
Voice-Activated Natural Language UI
Ultra-fast, Low Latency LLM security solution
Guess programming language from a string or file.
Python script designed to clone Git repositories, detect programming languages, validate the code, and log the results to an Excel file.