Launch Week Day 2: Introducing Reports: An Extensible Reporting Framework for Socket Data.Learn More →

Book a Demo Sign in

Book a Demo Sign in

npm

Categories
Server
Text Processing
Speech-to-Text

Speech-to-Text

web-voice-kit

完全解耦的浏览器语音 SDK：独立的唤醒词检测 + 语音转写，支持智能自动停止。

published 0.5.0 • 2 months ago

@lakshmiprasanth/react-voice-to-text

A modern React package for voice-to-text conversion with real-time speech recognition and file upload support

voice-recognition

speech-recognition

lakshmiprasanth

published 1.0.2 • 9 months ago

sonioxsrt

Soniox async transcription helpers and SRT generation (TypeScript port).

lucferreira-27

published 0.1.1 • 6 months ago

@wiro-ai/n8n-nodes-wiroai

n8n community node for Wiro AI — 290+ AI models: video, image, audio, LLM, 3D, and more.

n8n-community-node-package

2.0.1 • last month

@monsterapi/whisper-playground

Welcome to the MonsterAPI Whisper Playground! This React template allows you to quickly set up a real-time speech-to-text transcription application using the Whisper model from MonsterAPI.

whisper-playground

whisper-react-app

monsterapi-whisper

dheerajk483

published 0.0.7 • 2 years ago

@usefulsensors/moonshine-js

On-device speech-to-text and voice control for web applications with Moonshine.

published 0.1.21 • 10 months ago

@daitanjs/speech

A library for text-to-speech (TTS) and speech-to-text (STT) operations using Google Cloud and OpenAI.

published 1.0.6 • 9 months ago

openai-whisper-js

openai-whisper-js is a Node.js wrapper for the OpenAI Whisper library, enabling seamless audio transcription using Whisper models. This package simplifies the process of interacting with Whisper by providing a JavaScript interface to execute transcription

1.0.7 • last year

@jordanburke/nodejs-whisper

Node bindings for OpenAI's Whisper. Optimized for CPU.

0.1.23 • last year

react-native-voice-recognition

Essa solução de biblioteca é para quem usa estrutura funcional e não quer abdicar o uso de hooks já que a antiga só tem suporte para o uso de classes.

voice recognition

allysonfield

published 0.1.7 • 2 years ago

@maestra-ai/live-sdk

Live SDK for Maestra AI transcription services

published 1.0.2 • 4 weeks ago

better-speech-recognition

An improved speech recognition library with TypeScript support

dibasdauliya

published 0.3.1 • last year

getusermedia-to-text

getUserMedia to Text via Google's Speech to Text API

bret

published 1.0.5 • 9 years ago

@restackio/integrations-deepgram

This package provides an integration for Deepgram's speech-to-text and text-to-speech services within the Restack AI framework.

aboutphilippe

published 0.0.13 • last year

bardie-ts

A powerful AI package (built using typescript) for interacting with the Google Bard API, without needing to set your own cookie!

published 1.3.4 • 2 years ago

react-voice-search

React voice search component with audio visualization, speech recognition, and cross-browser support for Web Speech API. SSR-compatible with Next.js.

published 1.1.1 • 9 months ago

use-stt

React hook for speech-to-text using multiple STT providers

published 0.1.9 • 11 months ago

@vocantai/n8n-nodes-translation-vocantai

Audio file transcription services. Your speech. Private.

n8n-community-node-package

n8n-nodes-translation-vocantai

1.0.3 • 9 months ago

n8n-nodes-dudoxx

n8n community node package for interacting with the AssemblyAI API for speech-to-text transcription

n8n-community-node-package

published 0.1.1 • last year

@chikistudio/block-assemblyai

Use the AssemblyAI block for Studio to use AssemblyAI's models to [transcribe audio with Speech-to-Text models](https://www.assemblyai.com/products/speech-to-text?utm_source=studio), analyze audio with [audio intelligence models](https://www.assemblyai.co

1.0.2 • last year

react-native-ai-kit

A React Native utility kit for AI-powered apps with chat, image generation, speech-to-text, and text-to-speech capabilities

image-generation

published 0.1.3 • 7 months ago

speech2type

A minimal CLI tool that gives you quick voice typing in every Mac app.

gergomiklos

published 0.1.0 • 8 months ago

hyperbutter-microphone

Microphone plugin for use of speech-to-text in Hyper Butter

endoplasmic

published 1.0.3 • 9 years ago

node-ai-ragbot

Node.js backend package for building AI chatbots and voicebots with Retrieval-Augmented Generation (RAG). It ingests website pages or local files (PDF, DOCX, TXT, MD), creates embeddings with LangChain + OpenAI, stores them in a fast in-memory vector data

published 1.0.2 • 7 months ago

ember-cli-microsoft-speech-shim

The default blueprint for ember-cli addons.

mattmazzola

published 1.0.2 • 9 years ago

@argmax/local-server

Library for managing Argmax Local Server on macOS

published 0.0.5 • 8 months ago

susurro-audio

🎙️ Real-time conversational audio with AI transcription. Build ChatGPT-style voice interfaces in minutes with <300ms latency

bernarduriza

published 2.1.1 • 9 months ago

@argmaxinc/local-server

Library for managing Argmax Local Server on macOS

published 0.0.5 • 8 months ago

ejoy-react-native-speech

A text-to-speech library for React Native IOS version.

winglonelion

published 0.2.6 • 8 years ago

@raghavendra_kj/stt-js

A wrapper around the Web Speech API for speech-to-text functionality.

speech-recognition

voice-recognition

raghavendra_kj

published 1.0.7 • 12 months ago

react-native-bluemix

React Native module for IBM Bluemix services

react-component

published 1.4.5 • 8 years ago

d-ser-t-service

Dynamic Sentence Error Rate Testing: A Package for testing the CRIS speech-to-text model, quantifying the quality of the model with respect to its Word Error Rate

speech-recognition

sentence-error-rate

word-error-rate

published 1.3.0 • 7 years ago

testaugnitosdk

AugnitoSDK lets you make use of the Speech Recognition AI. You can edit, format and complete reports at the speed of human speech, with the best-in-class accuracy

1.0.33 • last year

@funfunfunco/react-native-speech

A text-to-speech library for React Native.

published 0.1.7 • 8 years ago

choo-stt

Simple speech-to-text in the browser for choo

published 1.0.0 • 8 years ago

node-red-watson-api-lite

For using the node `speech-to-text`, `sox` is needed for mac/windows users and `arecord` is needed for linux users.

published 1.1.2 • 6 years ago

voiceai-sdk

Official SDK for SLNG.AI Voice API - Text-to-Speech, Speech-to-Text, and LLM services

published 0.1.5 • 8 months ago

botbuilder-calling-speech

Dialog extensions for botbuilder-calling to manage speech-to-text and language understanding

published 1.0.21 • 9 years ago

stt-evaluation

Generic word-error-rate evaluation package

Word error rate

Speech recognition

macardoso95

published 1.4.0 • 5 years ago

@datafire/google_speech

DataFire integration for Cloud Speech-to-Text API

published 6.0.0 • 5 years ago

ngx-audiostream

This angular library helps you recording someone and put the output audio into a Google Speech-to-text compatible format.

published 1.0.0 • 6 years ago

meeting-whisperer

CLI tool that transcribes audio/video files using AssemblyAI's API and outputs formatted markdown transcripts

published 0.1.0 • 11 months ago

text-to-speech-mp3

A Node.js package that converts text to speech (TTS) and generates MP3 audio files, using the Google Translate API. A simple and efficient solution for adding voice to your projects.

nodejs audio package

mrravipandee

published 2.2.2 • 12 months ago

solask-voice-sdk

You can query the blockchain using voice.

susmitasanti

published 1.0.10 • 11 months ago

infobot-yandex-stt

Node.JS library for Yandex Cloud Speech-to-Text with streaming recognition

speech recognition

published 1.1.3 • 5 years ago

@aiola/web-sdk-stt

Aiola Speech-To-Text JavaScript SDK

published 0.1.5 • 10 months ago

react-transcribe

React component for speech-to-text transcription with silence detection

speech-recognition

silence-detection

published 0.1.0 • last year

web-speech-cognitive-services-root

Web Speech API adapter to use Cognitive Services Speech Services for both speech-to-text and text-to-speech service.

published 8.1.1-hip.5 • last year

quill-stt

A Quill rich text editor module that adds speech-to-text input capabilities

published 1.0.0 • last year

@chinchillaenterprises/mcp-elevenlabs

Multi-tenant ElevenLabs MCP server for advanced speech-to-text transcription with speaker diarization

speaker-diarization

published 1.0.0 • 9 months ago

Product

Package Alerts
Integrations
Docs
Pricing
FAQ
Roadmap
Changelog

About

About
Love
Blog
Glossary
CareersHiring
Send Feedback
Contact Us
System Status

Packages

Explore crates.io

Explore Chrome Web Store

Explore Packagist

Explore Go Modules

Explore Hugging Face Hub

Explore Maven Central

Explore Open VSX

Explore RubyGems.org

Stay in touch

Get open source security insights delivered straight into your inbox.

Enter your email

Terms
Privacy
Security

Made with ⚡️ by Socket Inc

U.S. Patent No. 12,346,443 & 12,314,394. Other pending.