
Research
/Security News
Critical Vulnerability in NestJS Devtools: Localhost RCE via Sandbox Escape
A flawed sandbox in @nestjs/devtools-integration lets attackers run code on your machine via CSRF, leading to full Remote Code Execution (RCE).
@speechmatics/browser-audio-input
Advanced tools
Manage audio input devices and persmissions across browsers
@speechmatics/browser-audio-input
This package provides Javascript utilities for managing audio input devices and permissions in the browser.
It also provides utilities for capturing PCM data from these devices, if desired.
npm i @speechmatics/browser-audio-input
import { getAudioDevicesStore } from "@speechmatics/browser-audio-input";
const audioDevices = getAudioDevicesStore();
audioDevices.addEventListener("changeDevices", (e) => {
if (audioDevices.permissionState === "granted") {
// This will print all available devices
console.log(audioDevices.devices)
}
});
See the README for @speechmatics/browser-audio-input-react
for a complete example.
We will add non-React examples soon. If you'd like to request a specific one, feel free to file an issue!
To capture PCM audio, you must supply an AudioContext
. From there, this library deals with dispatching audio events which can be subscribed to:
import {
type InputAudioEvent,
PCMRecorder,
} from '@speechmatics/browser-audio-input';
const PCMRecorder = new PCMRecorder("/path/to/pcm-audio-worklet.min.js"); // <- (see note below about this)
PCMRecorder.addEventListener('recordingStarted', () => {
console.log("Recording started!");
});
// Later in your app...
const audioContext = new AudioContext();
pcmRecorder.startRecording({ audioContext });
You can also pass a device ID like so:
import { getAudioDevicesStore } from "@speechmatics/browser-audio-input";
const audioContext = new AudioContext();
// This picks the first device ID (assuming permission has been granted)
const audioDevices = getAudioDevicesStore();
const deviceId = audioDevices.permissionState === "granted" ? audioDevices.devices[0] : undefined;
pcmRecorder.startRecording({ audioContext, deviceId });
You can pass whatever 'MediaTrackSettings' you want through the recordingOptions
property:
pcmRecorder.startRecording({
audioContext,
deviceId,
recordingOptions: {
noiseSuppression: false,
},
});
By default we enable the following to optimize for speech:
{
noiseSuppression: true,
echoCancellation: true,
autoGainControl: true,
}
Note that the last two may not be supported in Safari
AudioWorklet
script URLWhen recording audio in the browser, there are generally three approaches:
createScriptProcessor()
: Can capture PCM data on the main thread, but is deprecated and suffers from poor performance easily.MediaRecorder
: Provides a simple API, but cannot capture PCM data (only MPEG/OGG)AudioWorklet
: Captures/processes PCM on dedicated thread.This library leverages AudioWorklet
to capture PCM audio (specifically 32-bit Float PCM, which is the underlying representation in the browser).
Since AudioWorklets
run outside the main thread, their code must be run from an external source (i.e. a URL).
The code for this PCM audio processor is provided by this library at /dist/pcm-audio-worklet.min.js
. However, how this script is loaded depends on your bundler setup.
At the moment, Webpack doesn't have a great story for AudioWorklet
scripts (see Github issue). Instead, we recommend installing the copy-webpack-plugin
package to be able to copy our pcm-audio-worklet.min.js
directly into your /public
folder:
const CopyPlugin = require("copy-webpack-plugin");
module.exports = {
// ... rest of your Webpack config
plugins: [
new CopyWebpackPlugin({
patterns: [
{
from: path.resolve(
__dirname,
'node_modules/@speechmatics/browser-audio-input/dist/pcm-audio-worklet.min.js',
),
to: path.resolve(__dirname, 'public/js/[name][ext]'),
},
],
}),
]
};
See Webpack documentation for more details.
Vite supports referencing bundled code by URL for use in Workers. This can be used like so:
import {
type InputAudioEvent,
PCMRecorder,
} from '@speechmatics/browser-audio-input';
import PCMAudioWorkletUrl from "@speechmatics/browser-audio-input/pcm-audio-worklet.min.js?url";
const PCMRecorder = new PCMRecorder(PCMAudioWorkletUrl);
FAQs
Manage audio input devices and persmissions across browsers
The npm package @speechmatics/browser-audio-input receives a total of 1,108 weekly downloads. As such, @speechmatics/browser-audio-input popularity was classified as popular.
We found that @speechmatics/browser-audio-input demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 2 open source maintainers collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Research
/Security News
A flawed sandbox in @nestjs/devtools-integration lets attackers run code on your machine via CSRF, leading to full Remote Code Execution (RCE).
Product
Customize license detection with Socket’s new license overlays: gain control, reduce noise, and handle edge cases with precision.
Product
Socket now supports Rust and Cargo, offering package search for all users and experimental SBOM generation for enterprise projects.