A speech to text module.
Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.
The AssemblyAI JavaScript SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, as well as the latest LeMUR models.
Simple cross-browser speech to text using react hooks.
Polyfill Web Speech API with Cognitive Services Speech-to-Text service
A speech to text module for the discord voice implementation
Javascript client library for Soniox Speech-to-Text websocket API
Provides S2T support to any Textinput in react native.
speech to text with google speech api
Node bindings for OpenAI's Whisper. Optimized for CPU.
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Node.js bindings for OpenAI's Whisper. Runs local on CPU.
An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Google Speech-to-Text integration for IVR Tester
moved to speechless
This is a shared library for speech-to-text microservices
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text
aiOla javascript sdk
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
A sample browser app for Bluemix that use the speech-to-text service, fetching a token via Node.js
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Mastra Sarvam AI voice integration
React Native Speech To Text for android
Wrapper to transform any textinput into Speech-to-text Texinput
A web component that a developer can use to enable accurate speech-to-text, either locally or in the cloud, in their web-based application, including a WebView.
Record and recognize, and stream speech transcripts through the WebSpeech API.
AugnitoSDK lets you make use of the Speech Recognition AI. You can edit, format and complete reports at the speed of human speech, with the best-in-class accuracy
Mastra Sarvam AI voice integration
React Native module that allows an React Native application to call native speech recognition APIs and to get the recognized text in return. This is a work in progress since it only works on Android. Although the work on iOS is almost finished.
Client SDK for Maestra AI transcription services
Mastra Gladia AI voice integration
Google cloud speech to text streaming GRPC module
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
A powerful React hook for real-time voice streaming, designed for AI-powered applications. Perfect for real-time transcription, voice assistants, and audio processing with features like silence detection and configurable audio processing.
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Google cloud speech to text streaming GRPC module
This is a speech to text converter build in Vue2. It provides speech translations in almost all languages.
TypeScript typings for Cloud Speech-to-Text API v1p1beta1
TypeScript typings for Cloud Speech-to-Text API v1
React Component For Web Speech API
React Native speech recognition component for iOS 10+
Node module to convert speech to text using google services
A library for using Web Speech API with Angular
Node.js plugin for speech recognition that works with OpenAI's Whisper models using ONNX.