Big News: Socket raises $60M Series C at a $1B valuation to secure software supply chains for AI-driven development.Announcement →

@codexstar/pi-voice

Advanced tools

Install Socket

Detect and block malicious and high-risk dependencies

Install

Package was removed

Sorry, it seems this package was removed from the registry

@codexstar/pi-voice

Hold-to-talk voice input for Pi CLI — Deepgram streaming STT with live transcription, voice commands, and cross-platform support

latest

Source

npm

Version: 3.0.2

Version published: 3 months ago

Weekly downloads: 0

Maintainers: 1

Weekly downloads

Created: 3 months ago

Source

pi-voice

Hold-to-talk voice input for Pi.

What It Does

pi-voice adds hands-free voice input to the Pi coding agent CLI. Hold SPACE to record, release to transcribe — text appears in the editor in real time via Deepgram streaming STT.

Features

Feature	Description
Hold-to-talk	Hold `SPACE` to record, release to stop — text streams into the editor live
Streaming transcription	Deepgram Nova 3 WebSocket — interim results appear as you speak
Voice commands	"hey pi, run tests", "undo", "submit", "new line", "period"
Continuous dictation	`/voice dictate` for long-form speaking without holding keys
Double-escape clear	Press `Escape` twice to clear the editor
Cross-platform	macOS, Windows, Linux — Kitty protocol + non-Kitty fallback

Install

pi install npm:@codexstar/pi-voice

Prerequisites

SoX — microphone recording (brew install sox / apt install sox / choco install sox)
Deepgram API key — set DEEPGRAM_API_KEY env var (get $200 free credit)

Quick Setup

brew install sox                           # macOS
export DEEPGRAM_API_KEY="your-key-here"    # add to ~/.zshrc

Then open Pi — the onboarding wizard handles the rest.

Usage

Voice Input

Action	Keybinding	Notes
Record to editor	Hold `SPACE`	Release to finalize transcription
Toggle recording	`Ctrl+Shift+V`	Works in all terminals
Clear editor	`Escape` × 2	Double-tap within 500ms

Commands

/voice              # Toggle voice on/off
/voice on           # Enable voice
/voice off          # Disable voice
/voice setup        # Run onboarding wizard
/voice test         # Test microphone + Deepgram pipeline
/voice info         # Show current config and status
/voice dictate      # Continuous dictation mode
/voice stop         # Stop active recording or dictation
/voice history      # Show recent transcriptions

Voice Commands

Say these during recording — they're detected and executed automatically:

Trigger	Action
"hey pi, run tests"	Inserts `bun run test`
"undo" / "undo that"	Removes last word
"clear" / "clear all"	Clears editor
"submit" / "send it"	Submits editor content
"new line"	Inserts `\n`
"period" / "comma" / "question mark"	Inserts punctuation

How It Works

User holds SPACE
    ↓
SoX captures PCM audio from microphone
    ↓
Audio streams to Deepgram Nova 3 via WebSocket
    ↓
Interim transcripts update editor in real time
    ↓
User releases SPACE → CloseStream → final transcript

Hold Detection

Two terminal modes are supported:

Kitty protocol (Ghostty, Kitty, WezTerm, Windows Terminal 1.22+): True key-down/repeat/release events. First press enters warmup immediately.

Non-Kitty (macOS Terminal, older terminals): Gap-based detection. Counts rapid key-repeat events to distinguish hold from tap.

Both modes require holding for ≥800ms before recording activates. Quick taps type a normal space.

Architecture

extensions/voice.ts        Main extension — recording, UI, state machine
extensions/voice/config.ts Config loading, saving, migration
extensions/voice/onboarding.ts  First-run setup wizard

Configuration

Settings stored in Pi's settings files under the voice key:

Scope	Path
Global	`~/.pi/agent/settings.json`
Project	`<project>/.pi/settings.json`

{
  "voice": {
    "version": 2,
    "enabled": true,
    "language": "en",
    "scope": "global",
    "onboarding": {
      "completed": true,
      "schemaVersion": 2
    }
  }
}

Troubleshooting

/voice test     # Test full pipeline (mic + Deepgram)

Problem	Solution
"DEEPGRAM_API_KEY not set"	`export DEEPGRAM_API_KEY="your-key"` in `~/.zshrc`
"SoX error"	`brew install sox` (macOS) or `apt install sox` (Linux)
Space doesn't activate	Check `/voice info` — voice may be disabled
Double space in editor	Increase typing cooldown or use `Ctrl+Shift+V`

See docs/troubleshooting.md for more.

Security

Cloud STT: Audio is sent to Deepgram for transcription. No local fallback.
No telemetry: pi-voice does not collect or transmit usage data.
API key: Stored in env var or Pi settings file — never logged or exposed in errors.

See SECURITY.md for vulnerability reporting.

License

Keywords

FAQs

What is @codexstar/pi-voice?

Is @codexstar/pi-voice popular?

Is @codexstar/pi-voice well maintained?

Package last updated on 14 Mar 2026

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

@codexstar/pi-voice

pi-voice

What It Does

Features

Install

Prerequisites

Quick Setup

Usage

Voice Input

Commands

Voice Commands

How It Works

Hold Detection

Architecture

Configuration

Troubleshooting

Security

License

Links

Keywords

Related posts

@codexstar/pi-voice

pi-voice

What It Does

Features

Install

Prerequisites

Quick Setup

Usage

Voice Input

Commands

Voice Commands

How It Works

Hold Detection

Architecture

Configuration

Troubleshooting

Security

License

Links

Keywords

Related posts

Shai-Hulud Descends to Hades: Miasma Worm Campaign Spreads with New PyPI Wave

RubyGems Adds Cooldown Feature to Bundler for Newly Published Gems