Mastra Sarvam AI voice integration
Dictate Button (Web Component)
React Native Speech To Text for android
Node.js bindings for OpenAI's Whisper. Optimized for CPU.
A highly customizable React chatbot component with support for Gemini, OpenAI, Anthropic, and Groq APIs.
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
n8n node for GetTranscribe API integration - transcribe videos from Instagram, TikTok, YouTube and more
The speech to text API provides two endpoints, transcriptions and translations, based on our state-of-the-art open source large-v2 Whisper model.
Real-time Speech-to-Text React hook and component using Deepgram via a secure WebSocket proxy.
A lightweight JavaScript SDK for real-time audio and text processing via Speaqr's public api.
SDK for building AI agents with seamless voice-text context switching
Pure browser PCM S16LE audio recorder via AudioWorklet for ASR (no MediaRecorder, no Opus).
`vosk-speech-to-text`, ha sido marcado como obsoleto y ha sido trasladado a [`cordova-offline-speech`](https://www.npmjs.com/package/cordova-plugin-offline-speech).
Wrapper around any <TextInput /> in React Native that provides support for speech to text functionality—powerful & ease to use
Use the AssemblyAI piece for Activepieces to use AssemblyAI's models to [transcribe audio with Speech-to-Text models](https://www.assemblyai.com/products/speech-to-text?utm_source=activepieces), analyze audio with [audio intelligence models](https://www.a
MCP Server for Yandex SpeechKit with STT v3 API, TTS, and advanced audio processing
TypeScript typings for Cloud Speech-to-Text API v1p1beta1
TypeScript typings for Cloud Speech-to-Text API v1
Speech-to-text interface for Directus using OpenAI Whisper API
Google cloud speech to text streaming GRPC module
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Cheetah Speech-to-Text engine for web browsers (via WebAssembly)
A binding for OpenAI's Whisper Speech-to-Text system, supporting async operations.
React hook for Cheetah Web SDK
NodeJs developers API for Vosk-api speech-to-text engine.
Turn speech into text.
Servicio Angular para manejar el plugin Cordova cordova-plugin-offline-speech ## install - cordova plugins add cordova-plugin-offline-speech - npm i angular-speech-to-text
A React Native module for speech-to-text input functionality
Custom React Native native module for speech-to-text functionality using Android native APIs.
A React Native package for Azure Speech to Text
A simple Audio recorder and player to record audio content.
'react-native-voice-to-text' is a React Native module facilitating real-time conversion of spoken words into text, enabling hands-free interaction in mobile applications for tasks like messaging, note-taking, and search functionalities.
TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.
Picovoice Leopard Node.js binding
Real-time audio transcription in the browser using OpenAI's Whisper model via WebAssembly
lib for audio transcription and diarization
adalo speech to text component
Ready-made UI components to build a reactive voice interface to a web site or app. Uses Speechly's real-time cloud API for speech-to-text and NLU.
A minimal CLI tool that gives you quick voice typing in every Mac app.
Real-time speech-to-text CLI tool using OpenAI Realtime API
`state-speech-synth` is a lightweight wrapper around the native speech-to-text API [`speechSynthesis`](https://developer.mozilla.org/en-US/docs/Web/API/SpeechSynthesis)+[`SpeechSynthesisUtterance`](https://developer.mozilla.org/en-US/docs/Web/API/SpeechSy
Node bindings for OpenAI's Whisper. Optimized for CPU. (Updated with latest whisper.cpp September 2023)
Polyfill Web Speech API with Cognitive Services Speech-to-Text service
SpeechToText is a lightweight, multi-language voice-to-text tool for real-time transcription in web apps.
Leopard Speech-to-Text engine for web browsers (via WebAssembly)
A cross-platform Unity plugin that uses the Google Cloud Speech-To-Text API to perform real-time, streaming transcription of microphone input.