
Security News
CVE Volume Surges Past 48,000 in 2025 as WordPress Plugin Ecosystem Drives Growth
CVE disclosures hit a record 48,185 in 2025, driven largely by vulnerabilities in third-party WordPress plugins.
@runhuman/mcp-server
Advanced tools
Model Context Protocol (MCP) server for Runhuman - Human-powered QA testing for AI agents
A Model Context Protocol (MCP) server that allows AI agents to interact with the Runhuman QA testing service.
This MCP server provides tools for creating and managing human QA jobs through the Runhuman API. AI agents can use this server to:
Get your API key at: https://runhuman.com/dashboard/api-keys
Add to your Claude Desktop config (~/Library/Application Support/Claude/claude_desktop_config.json on Mac):
{
"mcpServers": {
"runhuman": {
"command": "npx",
"args": ["-y", "@runhuman/mcp-server", "--api-key=qa_live_xxxxxxxxxxxxx"]
}
}
}
That's it! The server will be automatically downloaded and run by Claude.
From the monorepo root:
npm install
npm run build --workspace=@runhuman/mcp-server
# Run with API key
node packages/mcp-server/dist/index.js --api-key=qa_live_xxxxx
create_jobCreate a new QA job with human testers.
Parameters:
url (string): The URL to testdescription (string): Instructions for the human tester describing what to testschema (object): Expected result schema that the tester response will be extracted intotargetDurationMinutes (number, optional): Time limit for tester (default: 5, range: 1-60)wait_for_resultCheck status, wait, and retrieve results for a QA job in a single convenient call.
Parameters:
jobId (string): The ID of the job to checkwaitSeconds (number, optional): How long to wait before checking again (default: 30, range: 1-300)Behavior:
Usage Pattern:
// After creating a job, call repeatedly with increasing wait times:
let result = await wait_for_result(jobId, { waitSeconds: 30 });
if (result.status !== 'completed') {
result = await wait_for_result(jobId, { waitSeconds: 45 });
}
if (result.status !== 'completed') {
result = await wait_for_result(jobId, { waitSeconds: 60 });
}
Returns:
result: Structured test results extracted from tester's responsestatus: Job status (completed, failed, timeout, pending, claimed, in_progress)costUsd: Exact cost in USD with full precision (e.g., 0.396)testDurationSeconds: Time spent by tester in seconds (rounded up)testerResponse: Raw natural language feedback from the human testertesterAlias: Anonymized tester name (e.g., "Tester Alpha")testerAvatarUrl: Avatar image URL for UI displaytesterColor: Hex color code for theming (e.g., "#4A90E2")Cost Calculation:
duration × $0.0018/second (general-use tier)The MCP server needs to be configured with your Runhuman API credentials.
Option A: Via Dashboard
npm run dev --workspace=@runhuman/apiqa_live_)Option B: Use Default Test Key
qa_live_test_key_123packages/api/data/api-keys.jsonCreate a .env file in the MCP server directory:
# For local development
RUNHUMAN_API_URL=http://localhost:3400
RUNHUMAN_API_KEY=qa_live_test_key_123
# For production
RUNHUMAN_API_URL=https://api.runhuman.com
RUNHUMAN_API_KEY=qa_live_xxxxxxxxxxxxxxxxxxxxx
Important: Never commit .env files to git! They're already in .gitignore.
Test your API key works:
curl http://localhost:3400/api/jobs \
-H "Authorization: Bearer qa_live_test_key_123" \
-H "Content-Type: application/json" \
-d '{"url":"https://example.com","description":"test","outputSchema":{}}'
Should return a job ID if authentication works.
For more details, see docs/API-AUTHENTICATION.md
The MCP server includes automated tests to verify it's working correctly:
# Build first
npm run build --workspace=@runhuman/mcp-server
# Run simple automated test
npm run test --workspace=@runhuman/mcp-server
# Or use the MCP Inspector (interactive testing)
npm run test:inspector --workspace=@runhuman/mcp-server
The test script will:
✅ Server initialized successfully
✅ Tools listed: create_job, wait_for_result
✅ create_job tool called successfully
# Watch mode (auto-rebuild on changes)
npm run dev --workspace=@runhuman/mcp-server
# Build
npm run build --workspace=@runhuman/mcp-server
# Test after building
npm run test --workspace=@runhuman/mcp-server
To use this MCP server with Claude Desktop, add it to your configuration:
{
"mcpServers": {
"runhuman": {
"command": "node",
"args": ["/path/to/qa-experiment/packages/mcp-server/dist/index.js"]
}
}
}
Once connected to an AI agent (like Claude), the agent can use these tools naturally:
User: "Can someone test my checkout page at https://myapp.com/checkout?"
Agent uses create_job:
✅ Job created successfully!
Job ID: job_abc123
Status: pending
...
Agent calls wait_for_result repeatedly until complete:
⏳ Job Status: in_progress
Waited 30s, job not complete yet.
💡 Suggestion: Call wait_for_result again with waitSeconds: 45
Finally:
✅ Test completed!
Results Summary:
- Checkout Flow: ✅ Working
- Payment Processing: ✅ Successful
...
For developers working on this MCP server:
FAQs
Model Context Protocol (MCP) server for Runhuman - Human-powered QA testing for AI agents
The npm package @runhuman/mcp-server receives a total of 566 weekly downloads. As such, @runhuman/mcp-server popularity was classified as not popular.
We found that @runhuman/mcp-server demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 2 open source maintainers collaborating on the project.
Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Security News
CVE disclosures hit a record 48,185 in 2025, driven largely by vulnerabilities in third-party WordPress plugins.

Security News
Socket CEO Feross Aboukhadijeh joins Insecure Agents to discuss CVE remediation and why supply chain attacks require a different security approach.

Security News
Tailwind Labs laid off 75% of its engineering team after revenue dropped 80%, as LLMs redirect traffic away from documentation where developers discover paid products.