
Company News
Socket Named Top Sales Organization by RepVue
Socket won two 2026 Reppy Awards from RepVue, ranking in the top 5% of all sales orgs. AE Alexandra Lister shares what it's like to grow a sales career here.
@pleaseai/agent-eval-playground
Advanced tools
Web-based playground for browsing agent-eval experiment results
A web-based dashboard for browsing agent-eval experiment results. Ships as a standalone Next.js app
# Run from your eval project root (where results/ and evals/ live)
npx @pleaseai/agent-eval-playground
# With options
npx @pleaseai/agent-eval-playground --results-dir ./results --evals-dir ./evals --port 3001
| Flag | Default | Description |
|---|---|---|
--results-dir <dir> | ./results | Path to experiment results directory |
--evals-dir <dir> | ./evals | Path to eval fixtures directory |
--port, -p <port> | 3000 | HTTP server port |
--help, -h | Show help |
| Route | Description |
|---|---|
/ | Dashboard with stats, recent experiments, eval previews |
/experiments | Full experiment list |
/experiments/[name]/[timestamp] | Experiment detail with per-eval breakdown and run results |
/evals | All eval fixtures |
/evals/[name] | Eval detail with prompt, files, and content |
/compare | Side-by-side comparison of two experiment runs |
/transcript/[experiment]/[timestamp]/[evalName]/[run] | Full agent transcript viewer |
lib/data.ts reads fs directly)ComparePage dropdowns, TranscriptViewer collapsibles)force-dynamic on all pages — data can change between requestsThe CLI (bin.mjs) resolves the bundled next binary, sets RESULTS_DIR and EVALS_DIR as environment variables, and spawns next dev from the package directory. Server Components in lib/data.ts read directly from the filesystem.
Published alongside @pleaseai/agent-eval via changesets. Run npx changeset, select @pleaseai/agent-eval-playground, and merge to main. The release workflow handles the rest via OIDC.
FAQs
Web-based playground for browsing agent-eval experiment results
We found that @pleaseai/agent-eval-playground demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Company News
Socket won two 2026 Reppy Awards from RepVue, ranking in the top 5% of all sales orgs. AE Alexandra Lister shares what it's like to grow a sales career here.

Security News
NIST will stop enriching most CVEs under a new risk-based model, narrowing the NVD's scope as vulnerability submissions continue to surge.

Company News
/Security News
Socket is an initial recipient of OpenAI's Cybersecurity Grant Program, which commits $10M in API credits to defenders securing open source software.