
Product
Introducing Tier 1 Reachability: Precision CVE Triage for Enterprise Teams
Socket’s new Tier 1 Reachability filters out up to 80% of irrelevant CVEs, so security teams can focus on the vulnerabilities that matter.
@picovoice/picovoice-node-demo
Advanced tools
Picovoice SDK for NodeJS file-based and microphone demos
This package provides two demonstration command-line applications for Picovoice: a file-based demo, which scans a compatible WAV file, and a microphone demo.
Made in Vancouver, Canada by Picovoice
Picovoice is an end-to-end platform for building voice products on your terms. It enables creating voice experiences similar to Alexa and Google, but it runs entirely on-device. Picovoice is:
These demos run Rhino on NodeJS 16+ on the following platforms:
These demos and the bindings upon which they are built are for NodeJS and do not work in a browser. Looking to run Picovoice in-browser? There are npm packages available for Web, and dedicated packages for Angular, React, and Vue.
Picovoice requires a valid Picovoice AccessKey
at initialization. AccessKey
acts as your credentials when using Picovoice SDKs.
You can get your AccessKey
for free. Make sure to keep your AccessKey
secret.
Signup or Login to Picovoice Console to get your AccessKey
.
To install the demos and make them available on the command line, use either of the following yarn
or npm
commands:
yarn global add @picovoice/picovoice-node-demo
(or)
npm install -g @picovoice/picovoice-node-demo
Here is an example which will understand commands from the "Smart Lighting" demo from the Rhino GitHub repository (note that context files are platform-dependent; choose the appropriate one for the platform you are using; this demo uses the "mac" version)
Using the 'global' install methods above should add pv-mic-demo
to your system path, which we can use to run the mic demo.
Use pv-mic-demo
to run the mic demo. First select an input audio device to start recording audio and provide your Picovoice AccessKey with --access_key
.
pv-mic-demo --access_key ${ACCESS_KEY} --show_audio_devices
This command prints a list of the available devices and its inputs:
index: 0, device name: USB Audio Device
index: 1, device name: MacBook Air Microphone
Specify the input audio device with --audio_device_index
(this may be empty if you
wish to use system default microphone). In this example we will use USB Audio Device.
Specify the Wake Word (.ppn) with --keyword_file_path
and the Speech-to-Intent context (.rhn file) with --context_file_path
.
pv-mic-demo \
--access_key ${ACCESS_KEY} \
--audio_device_index 0 \
--keyword bumblebee \
--context_file_path ../../resources/rhino/resources/contexts/mac/smart_lighting_mac.rhn
You can use custom Wake Word files (.ppn) with --keyword_file_path
:
pv-mic-demo \
--access_key ${ACCESS_KEY} \
--audio_device_index 0 \
--keyword_file_path ./hey_edison.ppn \
--context_file_path ../../resources/rhino/resources/contexts/mac/smart_lighting_mac.rhn
The Rhino context source in YAML format will be output to show you the grammar and options that the context supports. First, the demo will listen for the wake word (Porcupine engine). Upon the wake word detection, the demo will switch to follow-on command inference (Rhino engine). The demo will listen for a phrase that the context understands, and upon reaching a conclusion (or timeout), it will output the results.
Using device: sof-hda-dsp Digital Microphone...
Context info:
-------------
context:
expressions:
changeColor:
- (please) [change, set, switch] (the) $location:location (to) $color:color
- (please) [change, set, switch] (the) $location:location color (to) $color:color
- (please) [change, set, switch] (the) $location:location lights (to) $color:color
... (etc.) ...
Listening for speech within the context of 'smart_lighting_mac'. Please speak your phrase into the microphone.
# (say "bumblebee", or the custom Porcupine keyword you chose)
Wake word 'bumblebee' detected
# (say e.g. "please turn on the lights in the kitchen")
...
Inference result:
{
"isUnderstood": true,
"intent": "changeLightState",
"slots": {
"state": "on",
"location": "kitchen"
}
}
Now try again, but this time say something that the context is not designed to understand, like "tell me a joke":
pv-mic-demo --access_key ${ACCESS_KEY} --context_file_path ../../resources/contexts/mac/smart_lighting_mac.rhn
...
Listening for speech within the context of 'smart_lighting_mac'. Please speak your phrase into the microphone.
# (say "bumblebee", or the custom Porcupine keyword you chose)
Wake word 'bumblebee' detected
# (say e.g. "tell me a joke")
Inference result:
{
"isUnderstood": false
}
The file-based demo allows you to scan a compatible wave file with Rhino. Note: The demo requires 16KHz, 16-bit linear PCM, single-channel (mono) WAV files.
To run the file-based demo, we need to provide a Porcupine keyword and Rhino Speech-to-Intent context, along with a path to a compatible WAV file.
We can use the WAV file that are bundled in the Picovoice GitHub repository. This is intended to be used with the sample "Coffee Maker" context and the "Picovoice" keyword, also available in the Picovoice GitHub repository (note that keyword and context files are platform-dependent; choose the appropriate one for the platform you are using; this demo uses the "mac" version of each file)
Run the file demo and the successful inference with the intent "orderDrink" along with the specific details are returned:
pv-file-demo \
--access_key ${ACCESS_KEY} \
--input_audio_file_path ../../resources/audio_samples/picovoice-coffee.wav \
--keyword_file_path ../../resources/porcupine/resources/keyword_files/mac/picovoice_mac.ppn \
--context_file_path ../../resources/rhino/resources/contexts/mac/coffee_maker_mac.rhn
...
Wake word 'picovoice' detected
Listening for speech within the context of 'coffee'
Inference:
{
"isUnderstood": true,
"intent": "orderDrink",
"slots": {
"size": "large",
"coffeeDrink": "coffee"
}
}
The microphone and file demos both have additional options.
To see the full set of options, use --help
:
pv-mic-demo --help
pv-file-demo --help
Options:
-a, --access_key <string> AccessKey obtain from the Picovoice Console (https://console.picovoice.ai/)
-i, --input_audio_file_path <string> input audio wave file in 16-bit 16KHz linear PCM format (mono)
-k, --keyword_file_path <string> absolute path(s) to porcupine keyword files (.ppn extension)
-b, --keyword <string> built in keyword(s) (alexa,americano,blueberry,bumblebee,computer,grapefruit,grasshopper,hey google,hey siri,jarvis,ok
google,picovoice,porcupine,terminator)
-c, --context_file_path <string> absolute path to rhino context (.rhn extension)
-s, --sensitivity <number> sensitivity value between 0 and 1 (default: 0.5)
-e, --requires_endpoint If set, Rhino requires an endpoint (chunk of silence) before finishing inference
--porcupine_library_file_path <string> absolute path to porcupine dynamic library
--porcupine_model_file_path <string> absolute path to porcupine model
--rhino_library_file_path <string> absolute path to rhino dynamic library
--rhino_model_file_path <string> absolute path to rhino model
-h, --help display help for command
The sensitivity is a floating point value in the range [0,1] which specifies the tradeoff between miss rate and false alarm. The demo defaults to 0.5. You can override this with --sensitivity
:
pv-mic-demo \
--access_key ${ACCESS_KEY} \
--keyword GRASSHOPPER \
--context_file_path ../../resources/rhino/resources/contexts/mac/coffee_maker_mac.rhn \
--sensitivity 0.65
To create keywords and train into PPN files, and to design Speech-to-Intent contexts and train them into RHN files, see the Picovoice Console.
Files generated with the Picovoice Console carry restrictions including (but not limited to): training allowance, time limits, available platforms, and commercial usage.
If desired, you may override the Porcupine and Rhino model and dynamic libraries by specifying their absolute paths with --porcupine_model_file_path
and --porcupine_library_file_path
, respectively and the Rhino model and dynamic libraries with --rhino_model_file_path
and --rhino_library_file_path
. As with keyword and context files, the dynamic libraries are specific to the platform.
e.g. for macOS (x86_64):
pv-mic-demo \
--access_key ${ACCESS_KEY} \
--keyword_file_path ../../resources/porcupine/resources/keyword_files/mac/picovoice_mac.ppn \
--context_file_path ../../resources/rhino/resources/contexts/mac/coffee_maker_mac.rhn \
--porcupine_library_file_path ../../resources/porcupine/lib/mac/x86_64/libpv_porcupine.dylib \
--porcupine_model_file_path ../../resources/porcupine/lib/common/porcupine_params.pv
--rhino_library_file_path ../../resources/rhino/lib/mac/x86_64/libpv_rhino.dylib \
--rhino_model_file_path ../../resources/rhino/lib/common/rhino_params.pv
From the demo/nodejs
folder, use one of yarn
or npm
to install the package dependencies:
cd demo/nodejs
yarn
(or)
cd demo/nodejs
npm install
From the demo/nodejs
folder, use yarn mic
(or npm run mic
) to run the mic demo. For npm run
, note the extra --
needed before specifying commands. This is to disambiguate whether the options are intended for npm or for the demo script. As before, pick a context that matches the platform you are using (these examples use 'mac'):
yarn mic \
--access_key ${ACCESS_KEY} \
--keyword AMERICANO \
--context_file_path ../../resources/rhino/resources/contexts/mac/coffee_maker_mac.rhn
(or)
npm run mic -- --access_key ${ACCESS_KEY} --keyword AMERICANO --context_file_path ../../resources/contexts/mac/coffee_maker_mac.rhn
From the demo/nodejs
folder, use yarn file
or npm run file
. For npm run
, note the extra --
needed before specifying commands. This is to disambiguate whether the options are intended for npm itself, or for the demo script.
yarn file \
--access_key ${ACCESS_KEY} \
--input_audio_file_path ../../resources/audio_samples/test_within_context.wav \
--keyword_file_path ../../resources/porcupine/resources/keyword_files/mac/terminator_mac.ppn \
--context_file_path ../../resources/rhino/resources/contexts/mac/coffee_maker_mac.rhn
(or)
npm run file -- \
--access_key ${ACCESS_KEY} \
--input_audio_file_path ../../resources/audio_samples/test_within_context.wav \
--keyword_file_path ../../resources/porcupine/resources/keyword_files/mac/terminator_mac.ppn \
--context_file_path ../../resources/rhino/resources/contexts/mac/coffee_maker_mac.rhn
FAQs
Picovoice SDK for NodeJS file-based and microphone demos
We found that @picovoice/picovoice-node-demo demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 6 open source maintainers collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Product
Socket’s new Tier 1 Reachability filters out up to 80% of irrelevant CVEs, so security teams can focus on the vulnerabilities that matter.
Research
/Security News
Ongoing npm supply chain attack spreads to DuckDB: multiple packages compromised with the same wallet-drainer malware.
Security News
The MCP Steering Committee has launched the official MCP Registry in preview, a central hub for discovering and publishing MCP servers.