Huge News!Announcing our $40M Series B led by Abstract Ventures.Learn More →

github.com/stts-se/TillStudpub/audio_streaming

Package Overview

Dependencies

Alerts

File Explorer

Install Socket

Detect and block malicious and high-risk dependencies

Install

github.com/stts-se/TillStudpub/audio_streaming

v0.0.0-20210519123831-7166b62c3186
Source
Go

Version published: 4 years ago

Created: 4 years ago

Source

Overview

audio_streaming is a simple proof of concept application for streaming user microphone audio from a client to a server. The audio is saved in the server as a wav audio file, along with a JSON file, containing relevant metadata about the recording.

In this document, we describe the application itself and how it works.

Technical description of the application

The communication between the client and the server is performed using WebSockets.

There are two available clients for testing the application:

JavaScript for browser use
A go command line client

The client opens a WebSocket connection for each recording, and sends a "handshake" message to the server. If the server is up and running, and the handshake is correct and valid, the server responds with the same handshake message, adding a unique identifier (UUID). Once this handshake is received by the client, the audio stream capture is started. The audio input is processed in chunks of 2048 bytes, converted to 16-bit depth, and sent to the server using the open WebSocket.

On the receiving end, the server continuously writes received bytes to a file buffer.

When the user stops the recording, the audio capture is terminated, and the WebSocket is closed. When the server receives the close message over the WebSocket, the buffered audio data is saved to disk as a wav file, along with a JSON file containing relevant metadata about the recording, including the audio parameters needed to play the file. The files are stored in the data directory on the server. Each file is given their unique (UUID) file name, with the extensions .wav and .json. The last files created are copied to "latest.wav" and "latest.json", as a convenience for testing.

Supported streaming technologies

More background on the different technologies can be found in the accompanying technical report.

ScriptProcessorNode

The ScriptProcessorNode was introduced to meet developers' need to process audio streams in the Web Audio API. Unlike other parts of the Web Audio API, the processing is run in the main thread, which can cause delays. It has since been deprecated and replaced by AudioWorklet (below).

AudioWorklet

The AudioWorklet has been developed to handle some critical design flaws in the ScriptProcessorNode.

The default settings for this application is to use the AudioWorklet.

The implementation in this demo has been tested with the following browsers:

Google Chrome - version 81 - supported, working
Opera - version 68 - supported, working
Firefox - version 75 - not supported
Firefox - beta version 76 - not working, but will be supported in version 76

Saved audio format

The audio is saved as a wav file. The wav output is still under development, and may be faulty in some cases.

If the wav output seems faulty, the server can be started with an option to save the raw audio data in a .raw file. To play a recorded .raw file on Linux systems, run play with the correct parameters, e.g.

play -e signed-integer -r 48000 -b 16 -c 1 <rawfile>

On Windows, you can for example use the Import function in Audacity.

Hints on what parameters to use can be found in the JSON files accompanying each .raw file.

Usage

To start the server, change directory to audio_streaming and run

go run cmd/audstr_server/main.go

If you prefer precompiled executables command from a published release:

$ unzip audio_demo.zip
$ cd audio_streaming
$ ./audstr_server

Clients:

JavaScript: Point your browser to http://localhost:7651

To use the deprecated ScritpProcessorNode implementation, use http://localhost:7651?mode=scriptprocessornode
Go command line client: See folder cmd/audstr_client

You can use the Go client to stream audio output via the sox play command:

rec -r 48000 -t raw -c 1 - 2> /dev/null | go run cmd/audstr_client/main.go -channels 1 -sample_rate 48000 -encoding linear16 -host 127.0.0.1 -port 7651 -

Instead of using go run, you can use the audstr_client command published release:

$ unzip audio_demo.zip
$ cd audio_streaming
$ ./audstr_client

End the recording with CTRL-c.

Recorded audio is saved in the data folder in the audio_streaming directory. The last recorded file is always saved in raw format as data/latest.raw, and with a wav header: data/latest.wav (the wav header is work in progress).

Testing a browser's AudioWorklet compatibility

Not all browsers support AudioWorklet yet (see above). Here's how to do a quick test:

Start the server (see above)
Point your browser to http://localhost:7651/audioworklet
Make sure you have audio in (microphone) and audio out enabled and fully functioning (use headphones if you can, to avoid audio feedback)
Click the START button and start talking

If your voice echoes back, your browser supports AudioWorklet.

If you cannot hear anything, the AudioWorklet probably doesn't work (or there could be something wrong with your audio settings). Some more info is usually found in the Console output. For Firefox, you could get the following error message: typeError: context.audioWorklet is undefined.

If you have audio feedback issues, click STOP or reload the page.

FAQs

What is github.com/stts-se/TillStudpub/audio_streaming?

Package last updated on 19 May 2021

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install