🚀 Big News: Socket Acquires Coana to Bring Reachability Analysis to Every Appsec Team.Learn more →

Demo Install Sign in

Demo Install Sign in

pypi
Categories
Server
Text Processing
Speech-to-Text

Speech-to-Text

speech-to-text

Speech to Text command using IBM Watson API

jiwer

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)

deepgram-sdk

The official Python SDK for the Deepgram automated speech recognition platform.

deepgram speech-to-text

airunner

Run local opensource AI models (Stable Diffusion, LLMs, TTS, STT, chatbots) in a lightweight Python GUI

stable diffusion

realtimestt

A fast Voice Activity Detection and Transcription System

voice-activity-detection

pygpt-net

Desktop AI Assistant powered by models: OpenAI o1, GPT-4o, GPT-4, GPT-4 Vision, GPT-3.5, DALL-E 3, Llama 3, Mistral, Gemini, Claude, DeepSeek, Bielik, and other models supported by Langchain, Llama Index, and Ollama. Features include chatbot, text completion, image generation, vision analysis, speech-to-text, internet access, file handling, command execution and more.

whisper-s2t

An Optimized Speech-to-Text Pipeline for the Whisper Model.

elevenlabs-mcp

ElevenLabs MCP Server

phonexia-enhanced-speech-to-text-built-on-whisper-client

Client for communication with Phonexia Enhanced Speech To Text Built On Whisper microservice.

werpy

A powerful yet lightweight Python package to calculate and analyze the Word Error Rate (WER).

word error rate

automatic speech recognition

livekit-plugins-gladia

Agent Framework plugin for services using Gladia's API.

livetranscriber

Real-time microphone transcription with Deepgram using Python.

werx

A high-performance Python package for calculating Word Error Rate (WER), powered by Rust.

word error rate

automatic speech recognition

whisperlivekit

Real-time, Fully Local Whisper's Speech-to-Text and Speaker Diarization

esperanto

A unified interface for various AI model providers

tafrigh

تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.

vaani-speech-to-text

Vaani is an open-source, AI-powered speech-to-text desktop app. Vaani (वाणी) refers to "speech" or "voice" in Sanskrit.

anaouder

Breton language speech-to-text tools

live-translation

A real-time translation tool using Whisper & Opus-MT

real-time translation

faster-whisper-hotkey

Push-to-talk transcription using faster-whisper

whispercpp-kit

A toolkit for whisper.cpp with audio processing and model management

py-listener

Real time speech to text

offline speech to text

pvleopard

Leopard Speech-to-Text Engine.

Speech Recognition

Voice Recognition

Automatic Speech Recognition

aiola-stt

Aiola Speech-To-Text Python SDK

gtech-ariel

Google EMEA gTech Ads Data Science Team's solution to automatically translate and dub video ads into multiple languages using AI.

python ai genai speech-to-text translation text-to-speech video dubbing youtube gcp

kaushik-speech-to-text

This code is for speech to text created by me

whispa-app

GUI for Whisper transcription & MarianMT translation

pvcheetah

Cheetah Speech-to-Text Engine.

Speech Recognition

Voice Recognition

Automatic Speech Recognition

phonexia-audio-manipulation-detection-client

Audio Manipulation Detection Client

audio-manipulation

pellipop

A graphical and command-line tool to extract key frames from videos along with their retranscription. It uses the Whisper API to transcribe the audio. It also generates a CSV file with the extracted key frames and their corresponding text.

my-mcp-casper

My MCP Server

phonexia-transcription-normalization-client

Transcription Normalization Client

allvoicelab-mcp

AllVoiceLab MCP Server

huggingsound

HuggingSound: A toolkit for speech-related tasks based on HuggingFace's tools.

automatic speech recognition

voice recognition

sinhala-speech-to-text

A library for sending Sinhala audio files to a Flask API and decoding the received text

auroraapi

Python SDK for Aurora

voice speech text speech-to-text text-to-speech stt tts aurora auroraapi

scraibe

Transcription tool for audio files based on Whisper and Pyannote

speech-recognition

python-speech-to-text

A python speech to text library

gptalk

Fast GPT-3 client for Windows and Unix that supports both text and speech in any language.

speech-recognition

deepgram-unstable-sdk

The official Python SDK for the Deepgram automated speech recognition platform.

deepgram speech-to-text

scribe-cli

scribe is a local speech recognition tool that provides real-time transcription using vosk and whisper AI, with the goal of serving as a virtual keyboard on a computer

speech recognition

at16k

at16k is a Python library to perform automatic speech recognition or speech to text conversion.

automatic speech recognition

speech recognition

speech analysis

robot-speech-to-text

stum

Tool for detecting and extracting text from intertitles in Swedish newsreels.

automatic speech recognition

aristech-stt-client

A Python client library for the Aristech Speech-to-Text API

asrecognition

ASRecognition: just an easy-to-use library for Automatic Speech Recognition.

automatic speech recognition

voice recognition

speech recognition

sonata-asr

SONATA: SOund and Narrative Advanced Transcription Assistant

mseep-elevenlabs-mcp

ElevenLabs MCP Server

asr2clip

A real-time speech-to-text clipboard tool.

tatt

Tatt creates a uniform API for multiple speech-to-text (STT) services.

Product

Package Alerts
Integrations
Docs
Pricing
FAQ
Roadmap
Changelog

About

About
Love
Blog
Glossary
Discord Community
CareersHiring
Send Feedback
Contact Us
System Status

Packages

npm

Directory
Explore
Random Package
Most Popular
Top Maintainers
Removed Packages

Go

Directory
Explore
Random Package

Maven

Directory
Explore
Random Package

NuGet

Directory
Explore
Random Package

PyPI

Directory
Explore
Random Package

Rubygems

Directory
Explore
Random Package

Stay in touch

Get open source security insights delivered straight into your inbox.

Enter your email

Terms
Privacy
Security

Made with ⚡️ by Socket Inc