Big News: Socket raises $60M Series C at a $1B valuation to secure software supply chains for AI-driven development.Announcement
Sign In

@askalf/hands

Package Overview
Dependencies
Maintainers
1
Versions
5
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

@askalf/hands

Computer-use agent. Your LLM on your mouse, keyboard, and screen. PowerShell-first, voice optional, safety guardrails. Routes through dario or any Anthropic-compat endpoint.

Source
npmnpm
Version
0.1.0
Version published
Weekly downloads
23
-86.86%
Maintainers
1
Weekly downloads
 
Created
Source

hands

Your LLM on your mouse, keyboard, and screen.

One npm install. Hands-on computer control — PowerShell-first for speed, screenshot tool for visual verification when needed, voice input optional. Routes through dario or any Anthropic-compat endpoint — keep using your Claude Max subscription and pay zero per-token on the happy path.

Status: seeded from @askalf/hands's v0.3.7 tree (commit bef177d), the last standalone computer-use state before that repo pivoted to a fleet-bridge role. This repo is the continuation of that work — pre-1.0, modernization in progress.

Install

npm i -g @askalf/hands

Requires Node.js 20+ and Claude CLI.

Quick Start

# 1. Install Claude CLI (if you don't have it)
npm i -g @anthropic-ai/claude-code
claude auth login

# 2. Authenticate
hands auth
# Select "Claude Login" (recommended)

# 3. Run
hands run "open notepad and type hello world"

# 4. Voice mode — talk to your computer
hands voice-setup          # one-time: downloads whisper.cpp
hands run "open notepad" --voice

That's it. Claude opens Notepad, types "Hello World", then asks "What next?" — type or speak your next command.

How It Works

$ hands run "open chrome and go to amazon.com"

✔ AskAlf Agent — Computer Control
ℹ Using Claude subscription (no per-token costs)
ℹ Type "exit" or Ctrl+C to quit

ℹ → open chrome and go to amazon.com

✔ Chrome is open with Amazon loaded.
ℹ (6 turns)

❯ What next? open notepad and type hello world

✔ Notepad now has "Hello World" in it.
ℹ (14 turns)

❯ What next?
🎙 Listening... (press Enter to stop)
Heard: "minimize everything and open spotify"

✔ Desktop minimized and Spotify is now open.
ℹ (4 turns)

❯ What next? exit
ℹ Session ended.

PowerShell-first — Claude runs PowerShell commands directly to open apps, browse the web, manage files, and automate tasks. No slow screenshot loops. A screenshot MCP tool is available when Claude needs to visually verify what's on screen, but most tasks complete entirely through PowerShell.

Voice control — Add --voice to speak commands instead of typing. Uses local whisper.cpp for transcription — free, private, completely offline. No cloud APIs, no data leaves your machine.

Authentication

Uses your existing Claude Pro/Max subscription. Zero extra API costs. This is the default.

npm i -g @anthropic-ai/claude-code
claude auth login
hands auth
# Select "Claude Login"

API Key (Fallback)

Paste your Anthropic API key. Pay per token. Uses the Anthropic SDK directly with the computer_20251124 tool.

hands auth
# Select "API Key" → paste your sk-ant-... key

Note: SDK mode uses computer-use API calls which cost per token. A simple task like "open notepad" can cost several dollars. Claude Login mode is strongly recommended.

Routing through dario (zero per-token cost in SDK mode)

If you're running dario locally, hands will auto-route SDK-mode calls through it — including SDK mode's computer-use calls, which can then bill against your Claude Max subscription instead of per-token API overage. The @anthropic-ai/sdk client defaults its baseURL and apiKey to the standard env vars, so this works with zero hands-side config:

# in whatever shell starts hands:
export ANTHROPIC_BASE_URL=http://localhost:3456
export ANTHROPIC_API_KEY=dario            # or your DARIO_API_KEY if you set one

dario proxy                                # keep this running
hands run "open notepad"

Verify the routing is live with hands check (reports the effective base URL) or by watching dario proxy -v while hands is running — a request should show up in dario's log. Claude Login mode (the default) spawns the claude CLI child process directly, so the env-var-routing flow only matters for SDK mode.

Commands

hands run "<prompt>"

Start an interactive computer control session.

hands run "resize all images in ./assets to 800px wide"
hands run "open VS Code and create a Flask hello world app"
hands run "go to github.com and star the askalf/hands repo"

Each task completes and prompts "What next?" for follow-up commands. Type exit or hit Ctrl+C to end the session.

Options:

  • -v, --voice — Use voice input (microphone → whisper transcription)
  • -m, --model <model> — Model to use (default: claude-sonnet-4-6)
  • -b, --budget <amount> — Max budget in USD for SDK mode (default: 5.00)
  • -t, --turns <count> — Max turns per task (default: 50)

hands auth

Configure authentication interactively.

  • hands auth --status — Show current auth status

hands voice-setup

Download whisper.cpp binary and speech model for voice control. One-time setup.

hands voice-setup                # default: base.en model (~148MB)
hands voice-setup --model tiny   # smaller/faster (~75MB)
hands voice-setup --model small  # more accurate (~466MB)

hands check

Verify platform dependencies are installed (including voice/whisper status).

hands config

View or update configuration.

hands config --model claude-opus-4-6 --turns 100

What It Can Do

CapabilityHow
Open appsStart-Process chrome, Start-Process notepad
Browse the webOpens Chrome, navigates sites, fills forms
Manage filesCreate, move, read, edit files anywhere on your system
Run commandsGit, npm, Docker, Python — any CLI tool
See your screenScreenshot tool for visual verification when needed
Voice controlSpeak commands via local whisper.cpp — offline, private
Chain tasksInteractive loop — complete a task, ask "What next?"
Session memoryRemembers what worked and what failed across the session
Self-correctionLearns from errors within the session and adapts approach

Safety Guardrails

Built-in command guardrails prevent catastrophic operations before they reach the shell:

  • Hard blocks — recursive root deletion, disk formatting, registry destruction, boot config changes, firewall disabling, user account creation, ransomware-like encryption patterns
  • System prompt injection — Claude is instructed to verify destructive operations and prefer safe alternatives
  • Voice pipeline hardening — input validation, temp file cleanup, process isolation

Platform Support

OSStatusComputer Control
WindowsFull supportPowerShell (pre-installed)
macOSFull supportcliclick (brew install cliclick)
Linux (X11)Full supportxdotool + scrot (apt install xdotool scrot)
Linux (Wayland)Full supportydotool + grim (apt install ydotool grim)

Voice control requires SoX (Windows/macOS) or arecord (Linux, pre-installed). Whisper binary is downloaded automatically by voice-setup.

Run hands check to verify your setup.

Architecture

hands run "open chrome" --voice
        │
        ├── Input ─────────────────────────────
        │       │
        │       ├── --voice OFF: readline (keyboard)
        │       └── --voice ON:  mic → whisper.cpp → text
        │
        ├── Claude Login (default)
        │       │
        │       ├── Spawns claude CLI
        │       ├── --append-system-prompt (computer control agent)
        │       ├── --mcp-config (screenshot tool)
        │       ├── Claude uses built-in bash → PowerShell
        │       └── Interactive loop: task → "What next?" → repeat
        │
        └── API Key (fallback)
                │
                ├── Anthropic SDK direct
                ├── computer_20251124 + bash + text_editor tools
                └── Single-run with cost summary

The MCP server exposes a single screenshot tool. All other computer control happens through Claude's built-in bash tool running PowerShell commands — this is dramatically faster than screenshot-based control loops.

Configuration

Config stored at ~/.hands/config.json:

{
  "authMode": "oauth",
  "model": "claude-sonnet-4-6",
  "maxBudgetUsd": 5.00,
  "maxTurns": 50,
  "voice": {
    "whisperModel": "base",
    "silenceThresholdDb": -40,
    "silenceDurationMs": 1500
  }
}

Full Platform

This CLI is a standalone computer control agent. For the full autonomous fleet — 7 core agents, persistent memory, 16 communication channels (including OpenClaw bridge), 28 marketplace packages, and a mission control dashboard:

curl -fsSL https://get.askalf.org | bash

askalf.org | GitHub | Architecture

License

MIT

Keywords

ai

FAQs

Package last updated on 24 Apr 2026

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts