Speech to Text command using IBM Watson API
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
The official Python SDK for the Deepgram automated speech recognition platform.
Desktop AI Assistant powered by models: OpenAI o1, GPT-4o, GPT-4, GPT-4 Vision, GPT-3.5, DALL-E 3, Llama 3, Mistral, Gemini, Claude, Bielik, and other models supported by Langchain, Llama Index, and Ollama. Features include chatbot, text completion, image generation, vision analysis, speech-to-text, internet access, file handling, command execution and more.
A python speech to text library
This code is for speech to text created by me
Client for communication with Phonexia Enhanced Speech To Text Built On Whisper microservice.
Leopard Speech-to-Text Engine.
تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.
A unified interface for various AI model providers
A powerful yet lightweight Python package to calculate and analyze the Word Error Rate (WER).
Breton language speech-to-text tools
Voicegain Speech-to-Text Python SDK
Uses whisper AI to transcribe speach from video and audio files. Also accepts urls for youtube, rumble, bitchute, clear file, etc.
Cheetah Speech-to-Text Engine.
Google EMEA gTech Ads Data Science Team's solution to automatically translate and dub video ads into multiple languages using AI.
The official Python SDK for the Deepgram automated speech recognition platform.
Tatt creates a uniform API for multiple speech-to-text (STT) services.
Cheetah speech-to-text engine demos
A Speech-to-Text toolkit with VAD, punctuation, and emotion classification
A Python client library for the Aristech Speech-to-Text API
Fast GPT-3 client for Windows and Unix that supports both text and speech in any language.
A package for text-to-speech and speech-to-text tools
A simple speech-to-text application using Wit.ai
HuggingSound: A toolkit for speech-related tasks based on HuggingFace's tools.
An Optimized Speech-to-Text Pipeline for the Whisper Model.
Leopard speech-to-text engine demos
A library for sending Sinhala audio files to a Flask API and decoding the received text
Python SDK for Aurora
Transcription tool for audio files based on Whisper and Pyannote
framework for synchronous batch speech-to-text transcription using backends like AWS, Watson, etc.
A toolkit for whisper.cpp with audio processing and model management
openai/whisper speech to text model + extra features
framework for synchronous batch speech-to-text transcription using backends like AWS, Watson, etc.
A web interface for the ScAIbe speech-to-text transcription tool
at16k is a Python library to perform automatic speech recognition or speech to text conversion.
ASRecognition: just an easy-to-use library for Automatic Speech Recognition.
FrogBase simplifies the download-transcribe-embed-index workflow for multi-media content. It does so by linking content from various platforms with speech-to-text models, image & text encoders and embedding stores.
tpro processes transcripts from speech-to-text services and outputs to various formats.
Using Gladia's Whisper API for transcribing YouTube videos
Jarvis - Voice Personal Assistant
A Python SDK for video processing, providing functionalities like speech-to-text, summarization, transcription, and chaptering.
S.T.A.R.K - Speech and Text Algorithmic Recognition Kit. Modern framework for creating powerfull voice assistants.
Convert images or audio files to plain text on the command line
Unified Speech-to-text Client
high quality multi-lingual speech to text
This package is for extract text from audio/video file