Language detection library ported from Google's language-detection.
pysbd (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box across many languages.
A flexible free and unlimited python tool to translate between different languages in a simple way using multiple translators
80x faster and 95% accurate language identification with Fasttext
langid.py is a standalone Language Identification (LangID) tool.
An accurate natural language detection library, suitable for short text and mixed-language text
Fully customizable language detection for spaCy pipeline
Python bindings around Google Chromium's embedded compact language detection library (CLD2)
Fork of the language identification tool langid.py, featuring a modernized codebase and faster execution times.
Quickly detect text language and segment language
fasttext with wheels and no external dependency, but only the predict method (<1MB)
OCR, layout, reading order, and table recognition in 90+ languages
Multi-lingual Automatic Speech Recognition (ASR) based on Whisper models, with accurate word timestamps, access to language detection confidence, several options for Voice Activity Detection (VAD), and more.
LLM-Guard is a comprehensive tool designed to fortify the security of Large Language Models (LLMs). By offering sanitization, detection of harmful language, prevention of data leakage, and resistance against prompt injection attacks, LLM-Guard ensures that your interactions with LLMs remain safe and secure.
Fully customizable language detection pipeline for spaCy
A lightweight toolkit for multilingual lemmatization and language detection.
pysbd (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box across many languages.
Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.
Translate, transliterate, get the language of texts in no time with the help of multiple APIs!
This is a python API which allows you to check for swear words in a youtube video, srt file, text file, custom source with multi language support. There are additional features like getting youtube transcript of a video, srt parser etc.
A Python library for language detection and translation using langchain ChatModels
HuSpaCy: industrial strength Hungarian natural language processing
Kestrel Threat Hunting Language
Detect the programming language of a source code
HuSpaCy: industrial strength Hungarian natural language processing
Faster port of Language detection built by Shuyo in Python
Language Detection API Client
Guess programming language from a string or file.
Detect language support for font binaries
Python bindings for whatlang using pyo3
Language detection for news powered by fasttext
80x faster and 95% accurate language identification with Fasttext
CLI for DeepChopper: A Genomic Language Model for Chimera Artifact Detection
Claim Processor provides automatic checking pipeline for detecting fine-grained hallucinations generated by Large Language Models.
MOMENT: A Family of Open Time Series Foundation Models
Ultra-fast, Low Latency LLM security solution
Python script designed to clone Git repositories, detect programming languages, validate the code, and log the results to an Excel file.
A simple language detection library for short texts.
RefChecker provides automatic checking pipeline for detecting fine-grained hallucinations generated by Large Language Models.
Voice-Activated Natural Language UI
Library that combines Robotics Hardware, iPhone and AI for Everyone
Detect languages via a fasttext model
A Python library for detecting lexical borrowings (with a focus on anglicisms in Spanish language)
Breame is a lightweight Python package with a number of tools to aid in the detection of words that have dual spellings and meanings in British and American English.
A Python library for language detection and translation using OpenAI's GPT-4o.
Language detection library ported from Google's language-detection.
A PDF language detection and OCR tool
A Genomic Language Model for Chimera Artifact Detection in Nanopore Direct RNA Sequencing