Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.
The AssemblyAI JavaScript SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, as well as the latest LeMUR models.
Simple cross-browser speech to text using react hooks.
Provides S2T support to any Textinput in react native.
speech to text with google speech api
Polyfill Web Speech API with Cognitive Services Speech-to-Text service
Node bindings for OpenAI's Whisper. Optimized for CPU.
Javascript client library for Soniox Speech-to-Text websocket API
Node.js bindings for OpenAI's Whisper. Runs local on CPU.
A speech to text module for the discord voice implementation
Nó n8n para integração com a API da ElevenLabs incluindo Speech-to-Text, Text-to-Speech e Conversational AI
An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.
Wrapper around any <TextInput /> in React Native that provides support for speech to text functionality—powerful & ease to use
Basic utility package to convert speech input to text input
A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
A React Native module for speech-to-text input functionality
Servicio Angular para manejar el plugin Cordova cordova-plugin-offline-speech ## install - cordova plugins add cordova-plugin-offline-speech - npm i angular-speech-to-text
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
React Native module that allows an React Native application to call native speech recognition APIs and to get the recognized text in return. This is a work in progress since it only works on Android. Although the work on iOS is almost finished.
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Google Speech-to-Text integration for IVR Tester
Mastra Sarvam AI voice integration
moved to speechless
A web component that a developer can use to enable accurate speech-to-text, either locally or in the cloud, in their web-based application, including a WebView.
Wrapper to transform any textinput into Speech-to-text Texinput
Record and recognize, and stream speech transcripts through the WebSpeech API.
AugnitoSDK lets you make use of the Speech Recognition AI. You can edit, format and complete reports at the speed of human speech, with the best-in-class accuracy
React Native speech recognition component for iOS 10+
TypeScript typings for Cloud Speech-to-Text API v1p1beta1
TypeScript typings for Cloud Speech-to-Text API v1
Mastra Sarvam AI voice integration
Real-time audio transcription in the browser using OpenAI's Whisper model via WebAssembly
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
A cross-platform Unity plugin that uses the Google Cloud Speech-To-Text API to perform real-time, streaming transcription of microphone input.
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
This Angular Module (Component) that converts speech to text
A library for using Web Speech API with Angular
A cross-platform Unity plugin that uses the Google Cloud Speech-To-Text API to perform real-time, streaming transcription of microphone input.
The speech to text API provides two endpoints, transcriptions and translations, based on our state-of-the-art open source large-v2 Whisper model.
Cheetah Speech-to-Text engine for web browsers (via WebAssembly)
A powerful, fully configurable React component for real-time voice chat powered by OpenAI's Realtime API. Create natural conversations with AI using advanced voice recognition and synthesis.
TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.
Lightweight TypeScript library for transcribing audio files using Google Gemini 2.0 models. Supports local files, remote URLs, and Blobs.
Custom React Native native module for speech-to-text functionality using Android native APIs.
Node.js plugin for speech recognition that works with OpenAI's Whisper models using ONNX.