
Security News
Frontier AI Is Now Critical Infrastructure
The Fable shutdown shows how quickly model access can become a business continuity risk for AI-dependent engineering teams.
@evalguard/sdk
Advanced tools
Official EvalGuard SDK — LLM evaluation, red-team security, runtime guardrails, observability, and FinOps. Note: a third-party package named `evalguard` exists on npm and is not affiliated with EvalGuard, Inc.
Official Node.js/TypeScript SDK for the EvalGuard API -- evaluate, red-team, and guard LLM applications programmatically.
npm install @evalguard/sdk
import { EvalGuard } from "@evalguard/sdk";
const client = new EvalGuard({ apiKey: "eg_live_..." });
// Run an evaluation
const evalResult = await client.eval({
name: "qa-check",
projectId: "my-project",
model: "gpt-4o",
prompt: "Answer: {{input}}",
cases: [
{ input: "What is 2+2?", expectedOutput: "4" },
],
scorers: ["exact-match", "contains"],
});
console.log(`Score: ${evalResult.score}/${evalResult.maxScore} · ${(evalResult.passRate * 100).toFixed(0)}% pass`);
// Run a security scan
const scan = await client.securityScan({
projectId: "my-project",
model: "gpt-4o",
prompt: "You are a helpful assistant.",
attackTypes: ["prompt-injection", "jailbreak", "data-extraction"],
});
console.log(`Scan ID: ${scan.id}`);
// Fetch a historical run by its id (run ids come from client.listEvals())
const run = await client.getEvalRun("evalrun_..." /* a run id from listEvals() */);
console.log(`Status: ${run.status}, Score: ${run.score}`);
// Send trace data
await client.trace({
projectId: "my-project",
sessionId: "session-123",
steps: [
{ type: "llm", input: "Hello", output: "Hi there!", duration: 450 },
],
});
const client = new EvalGuard({
apiKey: "eg_live_...",
baseUrl: "https://your-self-hosted-instance.com/api/v1", // optional
});
| Method | Description |
|---|---|
client.eval(params) | Run an evaluation with scorers and test cases |
client.getEvalRun(id) | Fetch results of a specific eval run |
client.securityScan(params) | Run a red-team security scan against a model |
client.trace(params) | Send agent/LLM trace data for monitoring |
The SDK exports all types from @evalguard/core for full type safety:
import type { EvalGuardConfig } from "@evalguard/sdk";
Full documentation at evalguard.ai/docs/sdk.
Apache-2.0 -- see LICENSE for details.
FAQs
Official EvalGuard SDK — LLM evaluation, red-team security, runtime guardrails, observability, and FinOps. Note: a third-party package named `evalguard` exists on npm and is not affiliated with EvalGuard, Inc.
The npm package @evalguard/sdk receives a total of 545 weekly downloads. As such, @evalguard/sdk popularity was classified as not popular.
We found that @evalguard/sdk demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Security News
The Fable shutdown shows how quickly model access can become a business continuity risk for AI-dependent engineering teams.

Security News
AI agents are pulling packages into environments no scanner is watching, creating exposure before security teams can see it.

Security News
GitHub Actions checkout now blocks risky pull_request_target checkouts by default to help prevent pwn request supply chain attacks.