Speech to Text command using IBM Watson API
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
The official Python SDK for the Deepgram automated speech recognition platform.
Client for communication with Phonexia Enhanced Speech To Text Built On Whisper microservice.
Desktop AI Assistant powered by models: OpenAI o1, GPT-4o, GPT-4, GPT-4 Vision, GPT-3.5, DALL-E 3, Llama 3, Mistral, Gemini, Claude, DeepSeek, Bielik, and other models supported by Langchain, Llama Index, and Ollama. Features include chatbot, text completion, image generation, vision analysis, speech-to-text, internet access, file handling, command execution and more.
تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.
A powerful yet lightweight Python package to calculate and analyze the Word Error Rate (WER).
A python speech to text library
Leopard Speech-to-Text Engine.
Real time speech to text
Google EMEA gTech Ads Data Science Team's solution to automatically translate and dub video ads into multiple languages using AI.
Breton language speech-to-text tools
This code is for speech to text created by me
Voicegain Speech-to-Text Python SDK
Cheetah Speech-to-Text Engine.
Real time speech to text
An Optimized Speech-to-Text Pipeline for the Whisper Model.
Tatt creates a uniform API for multiple speech-to-text (STT) services.
Generate chat data from multi-speaker audio files
Leopard speech-to-text engine demos
The official Python SDK for the Deepgram automated speech recognition platform.
Cheetah speech-to-text engine demos
Transcription tool for audio files based on Whisper and Pyannote
A library for sending Sinhala audio files to a Flask API and decoding the received text
HuggingSound: A toolkit for speech-related tasks based on HuggingFace's tools.
Python SDK for Aurora
A toolkit for whisper.cpp with audio processing and model management
Voice control using Whisper.cpp with LangChain cleanup
openai/whisper speech to text model + extra features
Fast GPT-3 client for Windows and Unix that supports both text and speech in any language.
framework for synchronous batch speech-to-text transcription using backends like AWS, Watson, etc.
A real-time speech-to-text clipboard tool.
Dead simple speech-to-text
A package for text-to-speech and speech-to-text tools
tpro processes transcripts from speech-to-text services and outputs to various formats.
at16k is a Python library to perform automatic speech recognition or speech to text conversion.
framework for synchronous batch speech-to-text transcription using backends like AWS, Watson, etc.
ASRecognition: just an easy-to-use library for Automatic Speech Recognition.
A Speech-to-Text toolkit with VAD, punctuation, and emotion classification
S.T.A.R.K - Speech and Text Algorithmic Recognition Kit. Modern framework for creating powerfull voice assistants.
high quality multi-lingual speech to text
Using Gladia's Whisper API for transcribing YouTube videos
A web interface for the ScAIbe speech-to-text transcription tool
Client for communication with Phonexia Speech To Text Whisper Enhanced microservice.
Jarvis - Voice Personal Assistant