
Research
Shai-Hulud Descends to Hades: Miasma Worm Campaign Spreads with New PyPI Wave
Socket found 37 malicious PyPI wheels that abuse Python startup hooks to launch a Bun-powered credential stealer tied to Mini Shai-Hulud/Miasma.
@plasius/ai-evals
Advanced tools
Golden datasets, scorecards, and cost-quality evaluation contracts for Plasius AI workloads.
Golden datasets, scorecards, threshold, and regression comparison contracts for Plasius AI workloads.
This package is part of the layered @plasius/ai-* package family. It provides evaluator contracts, fixture definitions, and scorecard utilities used for AI quality and safety governance.
ai.evals-scorecards.enabledAI_EVALS_SCORECARDS_ENABLEDAI_EVALSThe package supports:
npm install @plasius/ai-evals
import {
AiEvalFixtureAdapter,
AI_EVALS_FEATURE_FLAG_ID,
AiEvalMetricExpectation,
defineAiEvalGoldenDataset,
evaluateAiEvalScorecard,
isAiEvalsScorecardsEnabled,
} from "@plasius/ai-evals";
const expectations: readonly AiEvalMetricExpectation[] = [
{ metricId: "quality", threshold: { min: 0.8 } },
{ metricId: "latency", threshold: { max: 500 } },
];
const dataset = defineAiEvalGoldenDataset({
datasetId: "example-1",
version: "1.0.0",
name: "Example moderation fixtures",
taskType: "moderation",
baselineExpectations: expectations,
fixtureCases: [
{ fixtureId: "case-1", input: { prompt: "flag-check" } },
{ fixtureId: "case-2", input: { prompt: "safe-response" } },
],
});
const adapter: AiEvalFixtureAdapter<{ prompt: string }> = {
adapterId: "fake-golden-adapter",
tier: "development",
async runFixture(fixture) {
return {
fixtureId: fixture.fixtureId,
metrics: [
{ metricId: "quality", value: 0.93 },
{ metricId: "latency", value: 320 },
],
};
},
};
if (isAiEvalsScorecardsEnabled({ AI_EVALS_SCORECARDS_ENABLED: "true" })) {
const scorecard = await evaluateAiEvalScorecard({
runId: "manual-smoke",
dataset,
adapter,
featureEnabled: true,
});
console.log(AI_EVALS_FEATURE_FLAG_ID, scorecard.status);
}
npm install
npm run build
npm test
npm run test:coverage
npm run pack:check
ai.evals-scorecards.enabled to avoid automatic production grade evaluation runs, and rerun with known-good baseline scorecards.Apache-2.0
FAQs
Golden datasets, scorecards, and cost-quality evaluation contracts for Plasius AI workloads.
The npm package @plasius/ai-evals receives a total of 149 weekly downloads. As such, @plasius/ai-evals popularity was classified as not popular.
We found that @plasius/ai-evals demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Research
Socket found 37 malicious PyPI wheels that abuse Python startup hooks to launch a Bun-powered credential stealer tied to Mini Shai-Hulud/Miasma.

Security News
RubyGems and Bundler 4.0.13 introduced an opt-in cooldown feature that delays newly published gems during dependency resolution.

Security News
pnpm 11.5 now recognizes npm staged publish approvals in release metadata, preventing those releases from being mistaken for lower-trust package publishes.