Language detection library ported from Google's language-detection.
pysbd (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box across many languages.
80x faster and 95% accurate language identification with Fasttext
A flexible free and unlimited python tool to translate between different languages in a simple way using multiple translators
Fork of the language identification tool langid.py, featuring a modernized codebase and faster execution times.
langid.py is a standalone Language Identification (LangID) tool.
An accurate natural language detection library, suitable for short text and mixed-language text
Python bindings around Google Chromium's embedded compact language detection library (CLD2)
Fully customizable language detection for spaCy pipeline
OCR, layout, reading order, and table recognition in 90+ languages
Multi-lingual Automatic Speech Recognition (ASR) based on Whisper models, with accurate word timestamps, access to language detection confidence, several options for Voice Activity Detection (VAD), and more.
LLM-Guard is a comprehensive tool designed to fortify the security of Large Language Models (LLMs). By offering sanitization, detection of harmful language, prevention of data leakage, and resistance against prompt injection attacks, LLM-Guard ensures that your interactions with LLMs remain safe and secure.
Fully customizable language detection pipeline for spaCy
Quickly detect text language and segment language
CLI for DeepChopper: A Genomic Language Model for Chimera Artifact Detection
fasttext with wheels and no external dependency, but only the predict method (<1MB)
A lightweight toolkit for multilingual lemmatization and language detection.
Translate, transliterate, get the language of texts in no time with the help of multiple APIs!
A Python library for language detection and translation using OpenAI's GPT-4o.
HuSpaCy: industrial strength Hungarian natural language processing
Detect language support for font binaries
Language Detection API Client
A Genomic Language Model for Chimera Artifact Detection in Nanopore Direct RNA Sequencing
Python bindings for the Lingua(LanguageDetect) Rust library
Kestrel Threat Hunting Language
Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.
Python bindings for whatlang using pyo3
This is a python API which allows you to check for swear words in a youtube video, srt file, text file, custom source with multi language support. There are additional features like getting youtube transcript of a video, srt parser etc.
Guess programming language from a string or file.
Claim Processor provides automatic checking pipeline for detecting fine-grained hallucinations generated by Large Language Models.
RefChecker provides automatic checking pipeline for detecting fine-grained hallucinations generated by Large Language Models.
Language detection for news powered by fasttext
HuSpaCy: industrial strength Hungarian natural language processing
A Python library for language detection and translation using OpenAI's GPT-4o.
Detect the programming language of a source code
Voice-Activated Natural Language UI
Ultra-fast, Low Latency LLM security solution
Faster port of Language detection built by Shuyo in Python
A Python library for detecting lexical borrowings (with a focus on anglicisms in Spanish language)
Breame is a lightweight Python package with a number of tools to aid in the detection of words that have dual spellings and meanings in British and American English.
An open-source Python library for data cleaning tasks. Includes profanity detection, and removal. Now includes offensive language and hate speech detection using an AI model.
Python script designed to clone Git repositories, detect programming languages, validate the code, and log the results to an Excel file.
Detect languages via a fasttext model
Python bindings for the Lingua(LanguageDetect) Rust library
Fine-tuned BERT model for language detection.
Library that combines Robotics Hardware, iPhone and AI for Everyone
Web-Based Tool for Computer-Assisted Language Comparison
A lightweight language detection server