🚀 Socket Launch Week Day 5:Introducing Repository Access Permissions and Custom Roles.Learn more →

hanzi-in-chrome

Advanced tools

Install Socket

Detect and block malicious and high-risk dependencies

Install

hanzi-in-chrome

Give your AI agent a real browser — click, type, fill forms, test workflows, post content, and read authenticated pages

latest

Source

npm

Version: 2.1.4

Version published: 3 months ago

Maintainers: 1

Created: 3 months ago

Source

Hanzi in Chrome — MCP Server

The MCP server exposes browser tools to MCP clients and forwards browser work to the Chrome extension over the local WebSocket relay.

Setup

cd mcp-server
npm install
npm run build

Add to your MCP config (e.g., ~/.claude/claude_desktop_config.json):

{
  "mcpServers": {
    "browser": {
      "command": "node",
      "args": ["/path/to/hanzi-in-chrome/mcp-server/dist/index.js"]
    }
  }
}

Prerequisites: The Chrome extension must be installed and running. See the main README for full setup.

How It Works

MCP client
  -> mcp-server (stdio)
  -> relay (WebSocket)
  -> Chrome extension
  -> browser agent

The extension is the browser executor. The MCP server should only manage MCP tool calls, local session bookkeeping, and blocking waits for completion.

Tools

`browser_start`

Start a browser task. Blocks until complete or timeout.

browser_start(
  task: "Search for flights to Tokyo on Google Flights",
  url: "https://flights.google.com",        // optional starting URL
  context: "Departing March 15, economy"     // optional extra info
)

→ {
  "session_id": "abc123",
  "status": "complete",
  "task": "Search for flights to Tokyo...",
  "answer": "Found 3 flights: JAL $850, ANA $920, United $780",
  "total_steps": 8,
  "recent_steps": ["Opened Google Flights", "Set destination to Tokyo", ...]
}

`browser_message`

Send follow-up instructions to an existing session. Also blocks until the agent finishes.

browser_message(session_id: "abc123", message: "Book the cheapest one")

`browser_status`

Check known sessions and their latest status.

browser_status()                    // all active sessions
browser_status(session_id: "abc123") // specific session

`browser_stop`

Stop a task.

browser_stop(session_id: "abc123")
browser_stop(session_id: "abc123", remove: true)  // also delete session

`browser_screenshot`

Capture the current browser state as an image.

browser_screenshot(session_id: "abc123")

Examples

Research:

browser_start("Find the top 3 competitors for Acme Corp and summarize their pricing")

Logged-in workflows:

browser_start("Go to Jira, find my open tickets, and summarize what needs attention this week")

Multi-turn:

s = browser_start("Go to LinkedIn and find AI Engineer jobs in Montreal")
→ { session_id: "x1", answer: "Found: Applied AI Engineer at Cohere" }

browser_message("x1", "Click into that job and tell me the requirements")
→ { answer: "Requirements: 3+ years Python, ML experience..." }

browser_message("x1", "Apply to this job using my profile")
→ { answer: "Application submitted successfully" }

Parallel execution:

browser_start("Check flight prices to Tokyo")
browser_start("Check hotel prices in Shibuya")
browser_start("Look up train pass costs")
// All three run simultaneously

Configuration

Environment Variable	Default	Description
`HANZI_IN_CHROME_MAX_SESSIONS`	`5`	Max concurrent browser tasks
`WS_RELAY_PORT`	`7862`	WebSocket relay port

Architecture

AI Tool (Claude Code, Cursor, etc.)
    ↓ MCP Protocol (stdio)
MCP Server
    ↓ WebSocket
Relay Server
    ↓ WebSocket
Chrome Extension
    ↓ Extension agent loop
Target Website

The relay server starts automatically when the MCP server connects. It routes messages between the MCP server and the Chrome extension and briefly queues messages while the extension service worker is asleep.

Principle: Hanzi is for real browser work in your signed-in Chrome. Agents should prefer code, logs, APIs, and existing tools first. Use Hanzi when the job needs a real browser session.

Prompts

The server exposes MCP prompts that clients auto-discover as slash commands:

Prompt	Description
`linkedin-prospector`	Goal-driven LinkedIn outreach — networking, sales, partnerships, or hiring
`e2e-tester`	Test your app in a real browser — reports bugs with screenshots and code references
`social-poster`	Post across LinkedIn, Twitter, Reddit, HN — drafts per-platform, posts from your browser

In Claude Code, use the built-in linkedin-prospector prompt from the MCP prompt list.

Skills CLI

hanzi-browser skills                              # list available skills
hanzi-browser skills install linkedin-prospector   # install SKILL.md to your project

Skills are portable SKILL.md files for agents that don't support MCP prompts (Cline, Codex). Each skill follows the same principle: use existing tools first, Hanzi only for real browser steps.

License

Polyform Noncommercial 1.0.0

Keywords

model-context-protocol

FAQs

What is hanzi-in-chrome?

Is hanzi-in-chrome well maintained?

Package last updated on 16 Mar 2026

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

hanzi-in-chrome

Hanzi in Chrome — MCP Server

Setup

How It Works

Tools

browser_start

browser_message

browser_status

browser_stop

browser_screenshot

Examples

Configuration

Architecture

Prompts

Skills CLI

License

Keywords

Related posts

Introducing Repository Access Permissions and Custom Roles

Socket MCP Adds Org Alerts, Threat Feed Review, and Package Inspection

`browser_start`

`browser_message`

`browser_status`

`browser_stop`

`browser_screenshot`