Latest Threat ResearchGlassWorm Loader Hits Open VSX via Developer Account Compromise.Details
Socket
Book a DemoInstallSign in
Socket

@vercel/agent-eval-playground

Package Overview
Dependencies
Maintainers
373
Versions
7
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

@vercel/agent-eval-playground

Web-based playground for browsing agent-eval experiment results

latest
Source
npmnpm
Version
0.1.3
Version published
Weekly downloads
255
Maintainers
373
Weekly downloads
 
Created
Source

@vercel/agent-eval-playground

A web-based dashboard for browsing agent-eval experiment results. Ships as a standalone Next.js app

Usage

# Run from your eval project root (where results/ and evals/ live)
npx @vercel/agent-eval-playground

# With options
npx @vercel/agent-eval-playground --results-dir ./results --evals-dir ./evals --port 3001

CLI Options

FlagDefaultDescription
--results-dir <dir>./resultsPath to experiment results directory
--evals-dir <dir>./evalsPath to eval fixtures directory
--port, -p <port>3000HTTP server port
--help, -hShow help

Pages

RouteDescription
/Dashboard with stats, recent experiments, eval previews
/experimentsFull experiment list
/experiments/[name]/[timestamp]Experiment detail with per-eval breakdown and run results
/evalsAll eval fixtures
/evals/[name]Eval detail with prompt, files, and content
/compareSide-by-side comparison of two experiment runs
/transcript/[experiment]/[timestamp]/[evalName]/[run]Full agent transcript viewer

Architecture

  • Server Components for all data fetching (lib/data.ts reads fs directly)
  • Client Components only for interactivity (ComparePage dropdowns, TranscriptViewer collapsibles)
  • No API routes — all data is read server-side, no client-side fetching
  • No database — filesystem is the source of truth
  • force-dynamic on all pages — data can change between requests

How it works

The CLI (bin.mjs) resolves the bundled next binary, sets RESULTS_DIR and EVALS_DIR as environment variables, and spawns next dev from the package directory. Server Components in lib/data.ts read directly from the filesystem.

Publishing

Published alongside @vercel/agent-eval via changesets. Run npx changeset, select @vercel/agent-eval-playground, and merge to main. The release workflow handles the rest via OIDC.

FAQs

Package last updated on 09 Feb 2026

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts