Big News: Socket raises $60M Series C at a $1B valuation to secure software supply chains for AI-driven development.Announcement
Sign In

@agentsystemlabs/core

Package Overview
Dependencies
Maintainers
1
Versions
6
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

@agentsystemlabs/core

Install AgentSystem skills into any project — works with Claude Code, Codex, Cursor, OpenCode, or any tool that reads SKILL.md files.

latest
Source
npmnpm
Version
0.50.2
Version published
Maintainers
1
Created
Source

AgentSystem

Website: agentsystem.dev · Mission Control Pro: agentsystem.dev

npx @agentsystemlabs/core init --harness claude

A Claude Code plugin marketplace providing agentsystem-core — 37 skills focused on forcing AI to ship production-ready code: feature delivery, debugging, code-quality enforcement, audits, release management. The core plugin also ships 16 read-only reviewer subagents that the skills fan out to for security, perf, contracts, concurrency, data-integrity, accessibility, and bundle audits.

Install

Claude Code (plugin marketplace)

/plugin marketplace add https://github.com/AgentSystemLabs/core
/plugin install agentsystem-core@agentsystem

To update later: /plugin marketplace update agentsystem. To remove: /plugin uninstall agentsystem-core@agentsystem.

Manual install (any tool that reads SKILL.md)

For users not on Claude Code — Claude Agent SDK, Cursor, custom agents, or anything that loads Anthropic-format skills from disk — install via the npm CLI:

# One-shot via npx
npx @agentsystemlabs/core init                                    # → ./.claude/{skills,agents}/
npx @agentsystemlabs/core init --harness codex                    # → ./.codex/{skills,agents}/
npx @agentsystemlabs/core init --harness cursor                   # → ./.cursor/{skills,agents}/
npx @agentsystemlabs/core init --harness opencode                 # → ./.opencode/{skills,agents}/
npx @agentsystemlabs/core init --global                           # → ~/.claude/{skills,agents}/
npx @agentsystemlabs/core init --harness codex --global           # → ~/.codex/{skills,agents}/
npx @agentsystemlabs/core init --harness cursor --global          # → ~/.cursor/{skills,agents}/
npx @agentsystemlabs/core init --harness opencode --global        # → ~/.config/opencode/{skills,agents}/
npx @agentsystemlabs/core init --plugin core                      # subset (short name)
npx @agentsystemlabs/core init --dest ./my-skills                 # custom skills path
npx @agentsystemlabs/core init --skip-agents                      # skills only, no subagents
npx @agentsystemlabs/core list                                    # show available skills + subagents
npx @agentsystemlabs/core uninstall                               # remove installed skills + subagents

# Or install globally
npm install -g @agentsystemlabs/core
agentsystem init

Each skill lands as <harness-root>/skills/<skill-name>/SKILL.md (with its references/ folder if present), so any agent that reads SKILL.md format picks them up directly. Subagents land in parallel at <harness-root>/agents/<agent-name>.<ext>.md for Claude Code and Cursor (which read the same format), .toml for Codex (auto-converted from the source .md with descriptions truncated to Codex's 1024-char limit), and OpenCode-native .md for OpenCode (adds mode: subagent and read-only permissions). The supported harnesses are claude (alias: claude-code), codex, cursor (alias: cursor-cli), and opencode; --dest overrides the skills path and --agents-dest overrides the subagents path. By default init skips files that already exist — pass --force to overwrite. The --plugin flag accepts short names (e.g., core for agentsystem-core), can be repeated, or comma-separated. Pass --skip-agents to install only the skills.

OpenCode

OpenCode can run alongside Codex, Copilot, and OpenCode Go plan in the same project. Install AgentSystem skills and subagents into OpenCode's native layout:

npx @agentsystemlabs/core init --harness opencode                 # project: .opencode/{skills,agents}/
npx @agentsystemlabs/core init --harness opencode --global          # global: ~/.config/opencode/{skills,agents}/

Skills are discovered from .opencode/skills/*/SKILL.md and loaded on demand via OpenCode's skill tool. Subagents are written as markdown agents with mode: subagent so primary agents (Build, Plan) can invoke them via @mention or the Task tool. See the OpenCode agents, skills, and plugins docs for configuration beyond the install step.

Autopilot — start here

/ship <describe your goal>

/ship is the front door. Hand it any engineering goal and it does three things you'd otherwise have to do yourself:

  • Classifies intent — CREATE / EVOLVE / POLISH / REMOVE / FIX / AUDIT — from your phrasing.
  • Picks a depth modefast / balanced / production — from risk and complexity signals (auth, payments, migrations, destructive deletions, jobs/webhooks, external APIs, cache invalidation, concurrency-sensitive mutations, and multi-subsystem changes auto-bump to production).
  • Routes to the matching core skill (add-feature, modify-feature, polish-ui, remove-feature, fix-bug, audit) with the resolved mode, announces the pipeline, and runs it.

It deliberately stops at "code is production-ready" — never commits, pushes, or opens PRs. After it finishes, choose how to publish: /commit (grouped commits only), /commit-and-push, or /open-pr.

# auto-infer everything
/ship "add stripe webhook handler"            # → CREATE + production (touches payments)
/ship "tweak the navbar color"                # → EVOLVE + fast (cosmetic)
/ship "the login button doesn't redirect"     # → FIX + balanced
/ship "delete the old beta-flags page"        # → REMOVE + balanced

# explicit overrides
/ship "scaffold a settings page" mode=fast
/ship "add OAuth flow" mode=production skip=7c

# call the underlying core skill directly when you don't want orchestration
/add-feature "scaffold a settings page" mode=fast

The core skills (add-feature, modify-feature, fix-bug, remove-feature) all accept mode=fast|balanced|production and include= / skip= overrides — /ship is sugar over them, not a replacement.

Skills in agentsystem-core

Workflow & Planning

  • /ship — Autopilot orchestrator. Classifies intent, picks a depth mode, routes to the right core skill. Stops at code-ready.
  • /add-feature — End-to-end feature delivery in an existing codebase: clarify → explore → design → plan-approval gate → implement → verify → gated reviews → tests. Enforces UI-convention parity (modals/forms/drawers/dialogs must match siblings).
  • /modify-feature — Adds a small extension to an existing feature. Pre-flight interrogates the proposed shape, surfaces alternative seams, audits shifted contracts, then runs gated checks.
  • /remove-feature — Leaf-first deletion with persisted-data awareness (existing rows holding values from removed enums/columns are first-class concerns). Honors mode=fast|balanced|production.
  • /testing-plan — Produce a concrete manual-QA testing plan (actor / route / expected result) for a branch or feature. Refuses to invent credentials; append-not-overwrite.

Debugging & Fixing

  • /fix-bug — Diagnose silent integration failures by leading with the runtime contract before hypotheses. Trigger / dispatch / receive / observe trace. Modes: fast / balanced / production.
  • /fix-pr-tests — Classify a failing PR test as real-bug / stale-test / flake (3× re-run rule), then fix the right side. Never edits CI config to make a test pass.
  • /add-regression-test — Pin a bug fix with a test that fails without the fix and passes with it. Modes A/B/C/D for staged / committed / squashed states.
  • /handoff-codex — Hand off a stalled task to the OpenAI Codex CLI for a second pass. Builds a handoff packet (original goal, what was tried, live diff), fires codex exec headless, verifies what changed.
  • /resolve-conflict — Walk a merge/rebase conflict file by file. Identical/complementary/incompatible block typology, lockfile-regenerate vs hand-merge, never --theirs on a source file without reading both.

Code Quality & Refactor

  • /audit — Whole-codebase tech-debt sweep. Maps architecture + data flow, then orchestrates every audit-* skill and reviewer-* subagent across the repo. Gates mechanical vs structural fixes.
  • /simplify — Diff-scoped refactor pass with parallel agent fan-out (Reuse / Quality / Efficiency). Magic numbers, naming, oversized files, parallel-enum drift, repeated literals. Per-fix safety-net rule.
  • /realign — Update a code model (types, state machines, vocabulary, persisted data) to match changed business requirements. Migrates persisted-data values as a first-class step.
  • /harden-types — Strip any, dangerous casts, missing return types, missing boundary validation. Mechanical / structural / legitimate classification.
  • /add-migration — Schema migration with three-class taxonomy (additive / mutating / destructive), explicit multi-step rollout plan, CONCURRENTLY awareness, NOT-NULL-on-populated-table trap, DEFAULT now() per-row gotcha.
  • /add-observability — Add structured logging, metrics, traces at the right boundaries. Replaces silent failure paths with explicit logs + telemetry.
  • /audit-authz — Identity vs ownership separation, IDOR detection, webhook-signature awareness on every server-side entry point.
  • /write-tests — Write tests for an existing code path. Smoke-test hard gate; real test DB over mocks; never mock the layer the bug lives in.
  • /add-e2e-test — Playwright workflow with explicit smoke gate, selector preference order, 3× flake check.
  • /reorganize-files — Reorganize a directory into a clear grouped layout, then update every reference (static imports, ?url, import.meta.glob, CSS url(), OG metadata). Five reference styles tracked separately.

UX & UI

  • /polish-ui — Apply a UX polish checklist to existing UI without changing behavior. Auto-fixes mechanical gaps inline.
  • /propagate-ui-pattern — Sibling sweep with per-instance APPLY/SKIP/ASK. Rule-of-three extraction gate; behavior+affordance pairing.
  • /add-empty-error-states — Three-state UI model (loading / empty / error) with concrete tsx examples. Handles useSuspenseQuery error boundary correctly.
  • /audit-a11y — Whole-app accessibility audit with WCAG thresholds. Sensible auto-fix vs structural split.
  • /audit-responsive — Whole-app responsive audit. Inline failure-mode catalog with Tailwind snippets, 4 canonical viewports, 100vh vs 100dvh vs 100svh distinction.
  • /audit-perf — Whole-app performance audit. Evidence-anchored — no potential findings.
  • /audit-seo-meta — Tag-by-tag SEO/meta severity table. Distinguishes per-route vs repo-level.
  • /audit-analytics — Analytics-event coverage audit. SDK detection table, PII classification with severity, MUST/SHOULD/OK rubric.

Release & Docs

  • /commit — Split the working tree into one or more logically-grouped commits (schema → backend → frontend; deps before consumers; types before usages). Detects conventional-commits style; never pushes.
  • /commit-and-push — Run /commit, then push the current branch to its remote (sets upstream with -u when missing). Never force-pushes.
  • /open-pr — Open a GitHub PR from an already-committed branch. Three-dot diff, draft detection, mandatory confirmation gate, integrates release-risk briefing.
  • /release — Cut a versioned release. Multi-manifest detection (package.json, pyproject.toml, Cargo.toml, VERSION), plugin-marketplace diff-driven bump, dirty-tree refusal, push-confirm gate.
  • /check-pr-readiness — Fail-fast gauntlet before pushing: typecheck, lint, format-only auto-fix, lockfile-aware residue sweep on added lines only.
  • /check-release-risk — Pre-publish risk briefing invoked by /commit-and-push, /open-pr, /release. Seven categorizers (public API, persistence, auth, env vars, manual QA, doc/changelog drift, rollback). Informs; never blocks.
  • /address-pr-comments — Resolve unresolved GitHub PR review threads end-to-end. Fans out one pr-comment-resolver subagent per thread; sequential commits (one per thread); replies + resolves via the GitHub API.
  • /update-changelog — Append a categorized entry to CHANGELOG.md. Canonical emoji set; dedup check; "would a user a year from now care".
  • /sync-docs — Update existing project documentation in-place after code changes (Swagger/OpenAPI, README, ADRs, .env.example, setup guides). Never creates new doc files.

Reviewer subagents (read-only, in agentsystem-core/agents/)

The skills above fan out to these read-only subagents for diff-scoped audits. They never edit files; they return severity-ranked findings with file:line refs and concrete fix snippets. They are invoked indirectly by the orchestrating skills (add-feature, modify-feature, fix-bug, audit, etc.); you generally don't call them directly.

SubagentWhat it catches
reviewer-authzMissing/wrong access checks on server entry points; IDOR; role-without-scope; unsigned webhooks.
reviewer-security-regressionLogged/leaked secrets, SSRF, unsafe uploads, dangerous HTML, open redirects, missing rate limits.
reviewer-data-integrityNOT NULL without backfill, orphan-creating deletes, missing DB constraints, unsafe migrations, stale fixtures.
reviewer-contractsProducer/consumer drift — server fn ↔ client, zod ↔ DB, route params ↔ links, stale generated clients.
reviewer-concurrencyDouble-submit duplicates, missing idempotency on webhooks, read-modify-write races, missing transactions, stale async writes.
reviewer-error-boundariesPromise rejections leaking to blank screens, server errors becoming generic toasts, double-submit on failure, loader without errorComponent.
reviewer-loading-statesSubmit buttons not disabled, spinner-vs-skeleton inconsistency, optimistic updates without rollback, missing pendingComponent.
reviewer-accessibility-regressionIcon buttons missing names, dialogs missing focus trap, custom clickable divs, unbound labels, missing alt. (Changed-files only — /audit-a11y is the whole-app sibling.)
reviewer-client-bundleServer-only modules in client bundles, non-public env vars in client files, heavy deps added to first-load, lodash full-imports, oversized assets.
reviewer-observability-coverageCritical paths without structured logs, swallowed errors, missing correlation ids on jobs/webhooks, PII in log lines.
reviewer-perfN+1 queries, missing indexes, oversized SELECT *, sequential awaits, unmemoized hot paths, unbounded fetches, missing virtualization.
runtime-contract-tracerTrace an integration end-to-end: trigger → dispatch → receive → observe — with file:line refs. Used by /fix-bug to map silent integration failures.
crud-surface-mapperInventory every place the user creates / edits / configures / imports / duplicates an artifact (Task, Project, etc.) so new fields ship to every CRUD touchpoint.
ui-pattern-inspectorReturns a sibling-instance inventory for recurring UI surfaces (Modal, Dialog, Drawer, Form, Card, Toast) with convention summary (hotkeys, focus, footer chrome).
utility-finderPre-write lookup for existing helpers — returns ranked candidates with reuse/extend/write-new verdict. Prevents duplicate utilities at the source.
pr-comment-resolverPer-thread subagent used by /address-pr-comments. Reads, classifies, applies the fix, replies via the GitHub API, resolves the thread.

Notes on conventions

Every skill in this marketplace follows a few shared conventions:

  • AskUserQuestion protocol — Skills never print numbered options for the user to type a number; they invoke the AskUserQuestion tool for an interactive picker. Free-form questions stay in prose.
  • NEVER sections — Each skill ends with a NEVER section. Every entry follows the pattern **NEVER X** / **Instead:** Y / **Why:** Z so the model can judge edge cases, not just follow rules.
  • Mode tiers — Core delivery skills (add-feature, modify-feature, fix-bug, remove-feature, audit, ship) accept mode=fast|balanced|production plus include= / skip= overrides.
  • Subagent dispatch — Where reviewers exist as both a skill (whole-repo) and a subagent (diff-only), the skill is the user-invocable entry; the subagent is the read-only report-only variant the skills fan out to.

License

See LICENSE in the repository root.

Keywords

claude

FAQs

Package last updated on 27 May 2026

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts