You're Invited:Meet the Socket Team at RSAC and BSidesSF 2026, March 23–26.RSVP β†’
Socket
Book a DemoSign in
Socket

agentic-qe

Package Overview
Dependencies
Maintainers
1
Versions
153
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

agentic-qe

Agentic Quality Engineering V3 - Domain-Driven Design Architecture with 13 Bounded Contexts, O(log n) coverage analysis, ReasoningBank learning, 60 specialized QE agents, mathematical Coherence verification, deep Claude Flow integration

Source
npmnpm
Version
3.7.9
Version published
Weekly downloads
911
-35.07%
Maintainers
1
Weekly downloads
Β 
Created
Source

Agentic Quality Engineering Fleet

npm version License: MIT TypeScript Node.js NPM Downloads

Release Notes | Changelog | Contributors | Issues | Discussions

V3 brings Domain-Driven Design architecture, 13 bounded contexts, 60 specialized QE agents, TinyDancer intelligent model routing, ReasoningBank learning with Dream cycles, HNSW vector search, mathematical Coherence verification, full MinCut/Consensus integration across all 13 domains, RVF cognitive container integration with portable brain export/import, 11 coding agent platform integrations (Claude Code, OpenCode, Kiro, GitHub Copilot, Cursor, Cline, Kilo Code, Roo Code, OpenAI Codex CLI, Windsurf, Continue.dev), and deep integration with Claude Flow and Agentic Flow.

πŸ—οΈ DDD Architecture | 🧠 ReasoningBank + Dream Cycles | 🎯 TinyDancer Model Routing | πŸ” HNSW Vector Search | πŸ‘‘ Queen Coordinator | πŸ“Š O(log n) Coverage | πŸ”— Claude Flow Integration | 🎯 13 Bounded Contexts | πŸ“š 78 QE Skills | 🧬 Coherence Verification | βœ… Trust Tiers | πŸ›‘οΈ Governance | 🌐 11 Platform Integrations

⚑ Quick Start

Install & Initialize

# Install globally
npm install -g agentic-qe

# Initialize your project (interactive mode)
cd your-project
aqe init

# Or with auto-configuration (no prompts, configures MCP automatically)
aqe init --auto

# Include OpenCode assets (agents, skills, tools, permissions)
aqe init --auto --with-opencode

# Include AWS Kiro IDE assets (agents, skills, hooks, steering)
aqe init --auto --with-kiro

# Include any coding agent platform (Copilot, Cursor, Cline, etc.)
aqe init --auto --with-copilot --with-cursor --with-cline

# Or include all 8 additional platforms at once
aqe init --auto --with-all-platforms

Note: aqe init automatically configures the MCP server in .mcp.json β€” Claude Code will auto-start it when connecting. For standalone MCP server usage (non-Claude-Code clients), run aqe-mcp or npx agentic-qe mcp.

Use from MCP-compatible agent clients (11 platforms supported)

AQE is exposed as an MCP server and can be used from any client that supports MCP tool connections.

# For Claude Code: aqe init --auto configures .mcp.json automatically
# Claude Code will auto-start the MCP server on connection

# For OpenCode: provision assets automatically during init
aqe init --auto --with-opencode   # installs agents, skills, tools, permissions, opencode.json

# For AWS Kiro: provision Kiro-native assets during init
aqe init --auto --with-kiro       # installs .kiro/ agents, skills, hooks, steering, MCP config

# For GitHub Copilot, Cursor, Cline, Kilo Code, Roo Code, Codex, Windsurf, Continue.dev:
aqe init --auto --with-copilot    # .vscode/mcp.json + copilot-instructions.md
aqe init --auto --with-cursor     # .cursor/mcp.json + .cursorrules
aqe init --auto --with-cline      # cline_mcp_settings.json + custom QE mode
aqe init --auto --with-all-platforms  # all 8 platforms at once

# Or set up a specific platform after init:
aqe platform setup copilot        # configure a single platform
aqe platform list                 # show all platforms with install status
aqe platform verify cursor        # validate config format and content

# For other MCP clients: start the server manually
aqe-mcp                  # if installed globally
npx agentic-qe mcp       # without global install

For client-specific setup examples, see Platform Setup Guide.

What V3 provides:

  • βœ… 13 DDD Bounded Contexts: Organized by business domain (test-generation, coverage-analysis, security-compliance, enterprise-integration, etc.)
  • βœ… 60 QE Agents: Including Queen Coordinator for hierarchical orchestration (53 main + 7 TDD subagents)
  • βœ… TinyDancer Model Routing: 3-tier intelligent routing (Haiku/Sonnet/Opus) for cost optimization
  • βœ… ReasoningBank Learning: HNSW-indexed pattern storage with experience replay
  • βœ… O(log n) Coverage Analysis: Sublinear algorithms for efficient gap detection
  • βœ… Claude Flow Integration: Deep integration with MCP tools and swarm orchestration
  • βœ… Memory Coordination: Cross-agent communication via aqe/* namespaces
  • βœ… Coherence Verification (v3.3.0): Mathematical proof of belief consistency using WASM engines
  • βœ… RVF Cognitive Containers (v3.7.0): MinCut task routing, witness chain audit trail, portable brain export/import, unified HNSW search, production dual-write to native RVF
  • βœ… OpenCode Support (v3.7.1): 59 agent configs, 86 skill configs (78 QE + 8 general dev), 5 tool wrappers, SSE/WS/HTTP transport, output compaction, graceful degradation, aqe init --with-opencode auto-provisioning
  • βœ… AWS Kiro Support (v3.7.2): 87 agent configs, 86 skill configs, 5 event-driven hooks, 2 steering files, MCP config, aqe init --with-kiro auto-provisioning
  • βœ… Multi-Platform Support (v3.7.4): 8 new platform integrations β€” GitHub Copilot, Cursor, Cline, Kilo Code, Roo Code, OpenAI Codex CLI, Windsurf, Continue.dev β€” with JSON/TOML/YAML config generation, behavioral rules, and aqe platform list/setup/verify CLI
  • βœ… Loki-Mode Quality Gates (v3.7.7): Anti-sycophancy scoring, test quality gates, blind review, EMA calibration, edge-case injection, complexity-driven composition, auto-escalation β€” enabled by default (opt-out)
  • βœ… V2 Backward Compatibility: All V2 agents map to V3 equivalents
  • βœ… 78 QE Skills: 46 Tier 3 verified + 32 additional QE skills (QCSD swarms, n8n testing, enterprise integration, qe-* domains)

πŸš€ Get Value in 60 Seconds

# 1. Install
npm install -g agentic-qe

# 2. Initialize (auto-detects your project, enables all 13 domains, configures MCP)
cd your-project && aqe init --auto

# 3. Use from Claude Code β€” MCP tools are available immediately (prefix: mcp__agentic-qe__)
# Or start MCP server manually for other clients: aqe-mcp

What happens:

  • Auto-configuration detects your tech stack (TypeScript/JS, testing framework, CI setup)
  • All 13 DDD domains enabled automatically - no "No factory registered" errors
  • MCP server configured in .mcp.json β€” Claude Code auto-connects on next session
  • Pattern learning kicks in - your project's test patterns are learned and reused
  • AI agents generate tests, analyze coverage, and provide actionable recommendations

🎯 Why AQE?

ProblemAQE Solution
Writing comprehensive tests is tedious and time-consumingAI agents generate tests automatically with pattern reuse across projects
Test suites become slow and expensive at scaleSublinear O(log n) algorithms for coverage analysis and intelligent test selection
Flaky tests waste developer time debugging false failuresML-powered detection with root cause analysis and fix recommendations
AI testing tools are expensiveTinyDancer 3-tier model routing reduces costs by matching task complexity to appropriate model
No memory between test runsβ€”every analysis starts from scratchReasoningBank remembers patterns, strategies, and what works for your codebase
Agents waste tokens reading irrelevant codeCode Intelligence provides token reduction with semantic search and knowledge graphs
Quality engineering requires complex coordinationQueen Coordinator orchestrates 60 agents across 13 domains with consensus and MinCut topology
Tools don't understand your testing frameworksWorks with Jest, Cypress, Playwright, Vitest, Mocha, Jasmine, AVA

✨ V3 Features

πŸ—οΈ Domain-Driven Design Architecture

V3 is built on 13 DDD Bounded Contexts, each with dedicated agents and clear responsibilities:

DomainPurposeKey Agents
test-generationAI-powered test creationqe-test-architect, qe-tdd-specialist
test-executionParallel execution & retryqe-parallel-executor, qe-retry-handler
coverage-analysisO(log n) gap detectionqe-coverage-specialist, qe-gap-detector
quality-assessmentQuality gates & decisionsqe-quality-gate, qe-risk-assessor
defect-intelligencePrediction & root causeqe-defect-predictor, qe-root-cause-analyzer
requirements-validationBDD & testabilityqe-requirements-validator, qe-bdd-generator
code-intelligenceKnowledge graph & searchqe-code-intelligence, qe-kg-builder
security-complianceSAST/DAST & auditqe-security-scanner, qe-security-auditor
contract-testingAPI contracts & GraphQLqe-contract-validator, qe-graphql-tester
visual-accessibilityVisual regression & a11yqe-visual-tester, qe-accessibility-auditor
chaos-resilienceChaos engineering & loadqe-chaos-engineer, qe-load-tester
learning-optimizationCross-domain learningqe-learning-coordinator, qe-pattern-learner
enterprise-integrationSOAP, SAP, ESB, ODataqe-soap-tester, qe-sap-rfc-tester, qe-sod-analyzer

βœ… Skill Trust Tiers (v3.4.2)

AQE includes 78 QE skills (46 Tier 3 verified + 32 additional). Trust tiers apply to core QE skills:

TierBadgeCountDescription
Tier 3 - VerifiedTier 346Full evaluation test suite
Tier 2 - ValidatedTier 27Has executable validator
Tier 1 - StructuredTier 15Has JSON output schema
Tier 0 - AdvisoryTier 05SKILL.md guidance only

Tier 3 Skills are recommended for production use - they have:

  • JSON Schema validation for output structure
  • Executable validator scripts for correctness
  • Evaluation test suites with multi-model testing
# Check skill trust tier
aqe eval status --skill security-testing

# Run skill evaluation
aqe eval run --skill security-testing --model claude-sonnet-4

# View all trust tiers
cat .claude/skills/TRUST-TIERS.md

[Full documentation: docs/guides/skill-validation.md]

🌐 Browser Automation Integration (v3.1.0)

V3.1.0 adds full browser automation support via @claude-flow/browser integration:

ComponentDescription
BrowserSwarmCoordinatorParallel multi-viewport testing (4x faster)
BrowserSecurityScannerURL validation, PII detection with auto-masking
9 Workflow TemplatesYAML-based reusable browser workflows
TrajectoryAdapterSONA learning integration with HNSW indexing

Available Workflow Templates:

  • login-flow, oauth-flow - Authentication testing
  • form-validation, navigation-flow - User journey testing
  • visual-regression, accessibility-audit - Quality validation
  • performance-audit, api-integration, scraping-workflow - Advanced workflows
# Use browser automation from Claude Code
claude "Use security-visual-testing skill to test https://example.com across mobile, tablet, desktop viewports"

# Load and execute a workflow template
aqe workflow run login-flow --vars '{"username": "test", "password": "secret"}'

πŸ‘‘ Queen Coordinator & Hierarchical Orchestration

The qe-queen-coordinator manages the entire fleet with intelligent task distribution:

                    qe-queen-coordinator
                           (Queen)
                             |
        +--------------------+--------------------+
        |                    |                    |
   TEST DOMAIN          QUALITY DOMAIN       LEARNING DOMAIN
   (test-generation)    (quality-assessment) (learning-optimization)
        |                    |                    |
   - test-architect     - quality-gate       - learning-coordinator
   - tdd-specialist     - risk-assessor      - pattern-learner
   - integration-tester - deployment-advisor - transfer-specialist

Capabilities:

  • Orchestrate 60 QE agents concurrently across 13 domains
  • TinyDancer 3-tier model routing (Haiku/Sonnet/Opus) with confidence-based decisions
  • Byzantine fault-tolerant consensus for critical quality decisions
  • MinCut graph-based topology optimization for self-healing coordination
  • Memory-backed cross-agent communication with HNSW vector search
  • Work stealing with adaptive load balancing (3-5x throughput improvement)
claude "Use qe-queen-coordinator to orchestrate release validation for v2.1.0 with 90% coverage target"

🀝 Agent Teams & Fleet Coordination

The Queen Coordinator is extended with Agent Teams (ADR-064) for hybrid fleet communication:

FeatureDescription
Mailbox MessagingDirect agent-to-agent and domain-scoped broadcast messaging
Distributed TracingTraceContext propagation across messages for end-to-end task visibility
Dynamic ScalingWorkload-based auto-scaling with configurable policies and cooldowns
Competing HypothesesMulti-agent root cause investigation with evidence scoring, auto-triggered on critical failures
FederationCross-service routing with health monitoring and service discovery
Circuit BreakersPer-domain fault isolation with automatic recovery
Task DAGTopological ordering with cycle detection for multi-step workflows

Fleet Tiers β€” Activate the level of coordination your project needs:

TierAgentsBest For
Lite1-4Small projects, focused tasks
Standard5-10Team projects, multi-domain coordination
Full11-15Enterprise, cross-fleet federation
claude "Use qe-queen-coordinator with agent teams to investigate flaky test failures across test-execution and defect-intelligence domains"

🧠 ReasoningBank Learning System

V3 agents learn and improve through the ReasoningBank pattern storage:

ComponentDescription
Experience StorageStore successful patterns with confidence scores
HNSW IndexingFast O(log n) similarity search for pattern matching
Experience ReplayLearn from past successes and failures
Cross-Project TransferShare patterns between projects
# Search learned patterns
aqe hooks search --query "test patterns"

# View learning statistics
aqe learning stats

πŸ›‘οΈ Loki-Mode Quality Gates (v3.7.7)

V3.7.7 adds 7 adversarial quality features inspired by loki-mode β€” designed to catch sycophantic AI outputs, hollow tests, and routing drift. All features are enabled by default (opt-out via config flags).

FeatureConfig FlagDescription
Anti-Sycophancy ScorerenableSycophancyCheckDetects rubber-stamp consensus via Jaccard similarity, confidence uniformity, and reasoning overlap
Test Quality GatesenableTestQualityGateCatches tautological assertions (expect(true).toBe(true)), empty test bodies, and missing source imports
Blind ReviewN/A (API option)Runs N parallel test generators with varied temperatures, deduplicates via Jaccard
EMA CalibrationenableEMACalibrationExponential moving average tracks per-agent accuracy, derives dynamic voting weights
Edge-Case InjectionenableEdgeCaseInjectionQueries historical patterns and injects proven edge cases into test generation prompts
Complexity CompositionenableComplexityCompositionMaps 8-dimension complexity (AST + security + concurrency) to agent team composition
Auto-EscalationenableAutoEscalationConsecutive failures auto-promote agent tier; consecutive successes auto-demote
// All features are ON by default. To disable specific features:
const config: Partial<RoutingConfig> = {
  enableEMACalibration: false,    // Disable EMA voting weights
  enableAutoEscalation: false,    // Disable auto tier promotion
};

const consensusConfig: Partial<ConsensusEngineConfig> = {
  enableSycophancyCheck: false,   // Disable rubber-stamp detection
};

const testConfig: Partial<TestGeneratorConfig> = {
  enableTestQualityGate: false,   // Disable tautology detection
  enableEdgeCaseInjection: false, // Disable pattern injection
};

See docs/loki-mode-features.md for detailed usage examples and configuration reference.

πŸŒ™ Dream Cycles & Neural Learning

V3 introduces Dream cycles for neural consolidation and continuous improvement:

FeatureDescription
Dream CyclesBackground neural consolidation (30s max) with spreading activation
9 RL AlgorithmsQ-Learning, SARSA, DQN, PPO, A2C, DDPG, Actor-Critic, Policy Gradient, Decision Transformer
SONA IntegrationSelf-Optimizing Neural Architecture with <0.05ms adaptation
Novelty ScoringPrioritize learning from novel patterns
Concept GraphsBuild semantic connections between quality patterns
# Trigger dream cycle for pattern consolidation
aqe learning dream

# View learning dashboard and trajectory
aqe learning dashboard

🧬 RVF Cognitive Container Integration (v3.7.0)

V3.7 integrates RVF (RuVector Format) cognitive container capabilities across 4 workstreams:

FeatureDescription
MinCut Task RoutingModels task complexity as a graph problem using vertex connectivity (lambda) for intelligent 3-tier routing
RVCOW Dream BranchingCopy-on-write branches for safe dream cycle experimentation β€” speculative insights are isolated until merged
Cryptographic Witness ChainSHA-256 hash-chained audit trail for quality gate decisions, pattern promotions, and test completions
Unified HNSW SearchConsolidated 3 fragmented implementations behind a single progressive adapter (flat scan for small, full HNSW for large collections)
Brain Export/ImportPortable QE intelligence containers β€” export patterns, Q-values, and insights for sharing across environments
MinCut Test OptimizerIdentifies critical vs skippable tests using graph-based coverage analysis for faster test suites
RVF Dual-WriterBest-effort dual-write to SQLite + RVF containers, preparing for future native RVF storage
# Export your QE brain as a portable container
aqe brain export --output ./my-brain

# Import a brain into another environment
aqe brain import --input ./my-brain --dry-run

# View brain statistics
aqe brain info --input ./my-brain

πŸ—οΈ AWS Kiro IDE Support (v3.7.2)

V3.7.2 adds full AWS Kiro IDE integration, converting AQE agents, skills, hooks, and steering files to Kiro-native formats via aqe init --with-kiro.

AssetCountDescription
Agents87.kiro/agents/*.json β€” 62 QE + 15 n8n + 10 infra/testing agents
Skills86.kiro/skills/*/SKILL.md β€” full instructional content from Claude Code sources
Hooks5.kiro/hooks/*.kiro.hook β€” event-driven automation (test update, coverage, security, quality gate, pre-commit)
Steering2.kiro/steering/*.md β€” QE standards and testing conventions
MCP Config1.kiro/settings/mcp.json β€” auto-approved AQE tools

Setup:

# Provision Kiro assets during init
aqe init --auto --with-kiro

# Or add Kiro to an existing project
aqe init --with-kiro

What gets generated:

.kiro/
β”œβ”€β”€ agents/              # 87 agent definitions (Kiro JSON format)
β”œβ”€β”€ skills/*/SKILL.md    # 86 skill definitions (full markdown content)
β”œβ”€β”€ hooks/               # 5 event-driven hooks
β”‚   β”œβ”€β”€ aqe-test-updater.kiro.hook       # Auto-update tests on source changes
β”‚   β”œβ”€β”€ aqe-coverage-check.kiro.hook     # Coverage analysis on test edits
β”‚   β”œβ”€β”€ aqe-security-scan.kiro.hook      # Security scan on auth file changes
β”‚   β”œβ”€β”€ aqe-spec-quality-gate.kiro.hook  # Quality gate after spec tasks
β”‚   └── aqe-pre-commit-quality.kiro.hook # Quality check before agent stops
β”œβ”€β”€ steering/            # 2 project-wide guidance files
β”‚   β”œβ”€β”€ qe-standards.md              # QE standards (auto-triggered)
β”‚   └── testing-conventions.md       # Test file conventions (fileMatch)
└── settings/mcp.json    # MCP server config with auto-approved tools

Key features:

  • Converts OpenCode YAML agents and Claude Code markdown agents to Kiro JSON format
  • Reads full SKILL.md content from Claude Code sources (not thin YAML stubs)
  • Maps mcp:agentic-qe: references to Kiro's @agentic-qe/ format
  • Maps Claude Code tool names to Kiro builtins (bashβ†’shell, editβ†’write)
  • Auto-detects existing .kiro/ directory in --auto mode
  • Safe by default: won't overwrite existing files unless --upgrade is used

🌐 Multi-Platform Coding Agent Support (v3.7.4)

V3.7.4 adds support for 8 additional coding agent platforms, bringing the total to 11 supported platforms. A single AQE MCP server works across all of them.

PlatformUsersConfig FormatSetup Flag
Claude CodeNative.mcp.jsonBuilt-in
OpenCodeGrowingopencode.json--with-opencode
AWS KiroGrowing.kiro/settings/mcp.json--with-kiro
GitHub Copilot20M.vscode/mcp.json--with-copilot
Cursor360K paid.cursor/mcp.json--with-cursor
Cline5M installscline_mcp_settings.json--with-cline
Kilo Code1.5M.kilocode/mcp.json--with-kilocode
Roo CodeGrowing.roo/mcp.json--with-roocode
OpenAI Codex CLI1M+.codex/config.toml--with-codex
WindsurfLarge.windsurf/mcp_config.json--with-windsurf
Continue.devEnterprise.continue/config.yaml--with-continuedev

Each installer generates:

  • MCP config in the platform's native format (JSON, TOML, or YAML)
  • Behavioral rules (e.g., .cursorrules, copilot-instructions.md, AGENTS.md)
  • Auto-approve lists for platforms that support them (Cline, Kilo Code, Roo Code)
# Set up all platforms at once
aqe init --auto --with-all-platforms

# Or individual platforms
aqe platform setup copilot
aqe platform setup cursor

# Check what's installed
aqe platform list

# Validate a platform's config
aqe platform verify cursor

For detailed per-platform setup instructions, see Platform Setup Guide.

🎯 TinyDancer Intelligent Model Routing

TinyDancer (ADR-026) provides 3-tier intelligent model routing for cost optimization:

Complexity ScoreModelUse Cases
0-20 (Simple)HaikuSyntax fixes, type additions, simple refactors
20-70 (Moderate)SonnetBug fixes, test generation, code review
70+ (Critical)OpusArchitecture, security, complex reasoning

Routing Features:

  • Confidence-based decisions: Routes based on task complexity analysis
  • Automatic escalation: Escalates to higher-tier model if confidence is low
  • Learning from outcomes: Improves routing based on success/failure patterns
  • Token budget optimization: Minimizes cost while maintaining quality
# Route a task through TinyDancer
aqe llm route --task "fix type errors in user-service.ts"

# View routing cost analysis
aqe llm cost

πŸ”„ Cross-Phase Memory Unification (v3.3.5)

V3.3.5 unifies cross-phase feedback loops with UnifiedMemoryManager:

  • Single SQLite Backend: All QCSD signals stored in .agentic-qe/memory.db
  • Namespace-Based Storage: qcsd/strategic, qcsd/tactical, qcsd/operational, qcsd/quality-criteria
  • Automatic TTL: 30-90 day expiration per signal type
  • No File-Based Storage: Eliminated JSON file storage for cross-phase memory
  • Full Hook Integration: Pre/post hooks for cross-phase signal injection

🌐 AG-UI, A2A & A2UI Protocols (v3.4.0)

V3.4.0 adds support for industry-standard agent communication protocols:

ProtocolStandardPurpose
AG-UIAnthropicAgent-to-User streaming interface with lifecycle events
A2AGoogleAgent-to-Agent interoperability with task/artifact exchange
A2UIHybridUnified UI components combining streaming + events

Programmatic usage:

import { AGUIAdapter, A2AAdapter } from 'agentic-qe';

// AG-UI: Stream test generation progress to UI
const agui = new AGUIAdapter();
await agui.streamTask({
  type: 'test-generation',
  onProgress: (event) => updateProgressBar(event.progress),
  onArtifact: (test) => displayGeneratedTest(test),
});

// A2A: Inter-agent task delegation
const a2a = new A2AAdapter();
await a2a.sendTask({
  from: 'qe-test-architect',
  to: 'qe-security-scanner',
  task: { type: 'review-tests', files: generatedTests },
});

Benefits:

  • Streaming feedback - Real-time progress instead of waiting for completion
  • Agent interoperability - Standard protocols for multi-agent coordination
  • Framework integration - Works with React, Vue, or any UI framework

πŸ” Consensus & MinCut Coordination (v3.3.3)

V3.3.3 achieves full MinCut/Consensus integration across all 13 domains:

FeatureDescription
Byzantine ConsensusFault-tolerant voting for critical quality decisions
MinCut TopologyGraph-based self-healing agent coordination
Multi-Model VotingAggregate decisions from multiple model tiers
Claim VerificationCryptographic verification of agent work claims
13/13 Domain IntegrationAll domains use verifyFinding() for consensus
Topology-Aware RoutingRoutes tasks avoiding weak network vertices
Self-Healing TriggersshouldPauseOperations() for automatic recovery
# Check fleet health (includes consensus and topology status)
aqe fleet status

# Via MCP tools (from Claude Code)
# mcp__agentic-qe__fleet_health({ includeTopology: true })

🧬 Coherence-Gated Quality Engineering (v3.3.0)

V3.3.0 introduces mathematical coherence verification using Prime Radiant WASM engines:

FeatureDescription
Contradiction DetectionSheaf cohomology identifies conflicting requirements before test generation
Collapse PredictionSpectral analysis predicts swarm failures before they happen
Causal VerificationDistinguishes true causation from spurious correlations
Auto-Tuning ThresholdsEMA-based calibration adapts to your codebase

Compute Lanes - Automatic routing based on coherence energy:

Coherence EnergyActionLatency
< 0.1 (Reflex)Execute immediately<1ms
0.1-0.4 (Retrieval)Fetch more context~10ms
0.4-0.7 (Heavy)Deep analysis~100ms
> 0.7 (Human)Escalate to QueenAsync

Benefits:

  • Prevents contradictory test generation
  • Detects swarm drift 10x faster
  • Mathematical proof instead of statistical confidence
  • "Coherence Verified" CI/CD badges
# Coherence verification is available via MCP tools and programmatic API:
# mcp__agentic-qe__quality_assess({ scope: "coherence", includeMetrics: true })

# Verify learned patterns for consistency
aqe learning verify

πŸ” O(log n) Coverage Analysis

Efficient coverage gap detection using Johnson-Lindenstrauss algorithms:

  • Sublinear complexity: Analyze large codebases in logarithmic time
  • Risk-weighted gaps: Prioritize coverage by business impact
  • Intelligent test selection: Minimal tests for maximum coverage
  • Trend tracking: Monitor coverage changes over time
claude "Use qe-coverage-specialist to analyze gaps in src/ with risk scoring"

πŸ”— Claude Flow Integration

V3 deeply integrates with Claude Flow for:

  • MCP Server: All V3 tools available via Model Context Protocol
  • Swarm Orchestration: Multi-agent coordination with hierarchical topology
  • Memory Sharing: Cross-agent state via aqe/* namespaces
  • Hooks System: Pre/post task learning and optimization
  • Session Management: Persistent state across conversations
# Initialize swarm with Claude Flow
npx @claude-flow/cli@latest swarm init --topology hierarchical-mesh

# Spawn V3 agents
npx @claude-flow/cli@latest agent spawn -t qe-test-architect --name test-gen

πŸ“Š 60 Specialized QE Agents

CategoryCountHighlights
Main QE Agents53Test generation, coverage, security, performance, accessibility, enterprise integration, pentest validation
TDD Subagents7RED/GREEN/REFACTOR with code review

V2 Backward Compatibility: All V2 agents map to V3 equivalents automatically.

πŸ“‹ View All Main QE Agents (53)
AgentDomainPurpose
qe-queen-coordinatorcoordinationHierarchical fleet orchestration
qe-test-architecttest-generationAI-powered test creation
qe-tdd-specialisttest-generationTDD workflow coordination
qe-parallel-executortest-executionMulti-worker test execution
qe-retry-handlertest-executionIntelligent retry with backoff
qe-coverage-specialistcoverage-analysisO(log n) coverage analysis
qe-gap-detectorcoverage-analysisRisk-weighted gap detection
qe-quality-gatequality-assessmentQuality threshold validation
qe-risk-assessorquality-assessmentMulti-factor risk scoring
qe-deployment-advisorquality-assessmentGo/no-go deployment decisions
qe-defect-predictordefect-intelligenceML-powered defect prediction
qe-root-cause-analyzerdefect-intelligenceSystematic root cause analysis
qe-flaky-hunterdefect-intelligenceFlaky test detection & fix
qe-requirements-validatorrequirements-validationTestability analysis
qe-bdd-generatorrequirements-validationGherkin scenario generation
qe-code-intelligencecode-intelligenceSemantic code search
qe-kg-buildercode-intelligenceKnowledge graph construction
qe-dependency-mappercode-intelligenceDependency analysis
qe-security-scannersecurity-complianceSAST/DAST scanning
qe-security-auditorsecurity-complianceSecurity audit & compliance
qe-contract-validatorcontract-testingAPI contract validation
qe-graphql-testercontract-testingGraphQL testing
qe-visual-testervisual-accessibilityVisual regression testing
qe-accessibility-auditorvisual-accessibilityWCAG compliance testing
qe-responsive-testervisual-accessibilityCross-viewport testing
qe-chaos-engineerchaos-resilienceControlled fault injection
qe-load-testerchaos-resilienceLoad & performance testing
qe-performance-testerchaos-resiliencePerformance validation
qe-learning-coordinatorlearning-optimizationFleet-wide learning
qe-pattern-learnerlearning-optimizationPattern discovery
qe-transfer-specialistlearning-optimizationCross-project transfer
qe-metrics-optimizerlearning-optimizationHyperparameter tuning
qe-integration-testertest-executionComponent integration
qe-mutation-testertest-generationTest effectiveness validation
qe-property-testertest-generationProperty-based testing
qe-regression-analyzerdefect-intelligenceRegression risk analysis
qe-impact-analyzercode-intelligenceChange impact assessment
qe-code-complexitycode-intelligenceComplexity metrics
qe-qx-partnerquality-assessmentQA + UX collaboration
qe-fleet-commandercoordinationLarge-scale orchestration
qe-integration-architectcode-intelligenceV3 integration design
qe-product-factors-assessorquality-assessmentSFDIPOT product factors analysis
qe-test-idea-rewritertest-generationTransform passive tests to active actions
qe-quality-criteria-recommenderquality-assessmentHTSM v6.3 Quality Criteria analysis
qe-devils-advocatequality-assessmentAdversarial review of agent outputs
πŸ”§ TDD Subagents (7)
SubagentPhasePurpose
qe-tdd-redREDWrite failing tests
qe-tdd-greenGREENImplement minimal code
qe-tdd-refactorREFACTORImprove code quality
qe-code-reviewerREVIEWCode quality validation
qe-integration-reviewerREVIEWIntegration review
qe-performance-reviewerREVIEWPerformance review
qe-security-reviewerREVIEWSecurity review

πŸ’» V3 Usage Examples

Example 1: Queen-Coordinated Quality Pipeline

claude "Use qe-queen-coordinator to run full quality assessment:
1. Generate tests for src/services/*.ts
2. Execute tests with parallel workers
3. Analyze coverage gaps with risk scoring
4. Run security scan
5. Validate quality gate at 90% threshold
6. Provide deployment recommendation"

What happens:

  • Queen spawns domain coordinators for each task
  • Agents execute in parallel across 5 domains
  • Results aggregate through memory coordination
  • Queen synthesizes final recommendation

Example 2: Learning-Enhanced Test Generation

claude "Use qe-test-architect to create tests for PaymentService with:
- Property-based testing for validation
- 95% coverage target
- Apply learned patterns from similar services"

Output includes:

Generated 48 tests across 4 files
- unit/PaymentService.test.ts (32 unit tests)
- property/PaymentValidation.property.test.ts (8 property tests)
- integration/PaymentFlow.integration.test.ts (8 integration tests)
Coverage: 96.2%
Pattern reuse: 78% from learned patterns
Learning stored: "payment-validation-patterns" (confidence: 0.94)

Example 3: TDD Workflow with Subagents

claude "Use qe-tdd-specialist to implement UserAuthentication with full RED-GREEN-REFACTOR cycle"

Workflow:

  • qe-tdd-red: Writes failing tests defining behavior
  • qe-tdd-green: Implements minimal code to pass
  • qe-tdd-refactor: Improves code quality
  • qe-code-reviewer: Validates standards
  • qe-security-reviewer: Checks security concerns

Example 4: Cross-Domain Coordination

claude "Coordinate security audit across the monorepo:
- qe-security-scanner for SAST/DAST
- qe-dependency-mapper for vulnerability scanning
- qe-contract-validator for API security
- qe-chaos-engineer for resilience testing"

πŸŽ“ 78 QE Skills

V3 agents automatically apply relevant skills from the comprehensive QE skill library.

View All 78 QE Skills

Core Testing & Methodologies (12)

  • agentic-quality-engineering - Core PACT principles for AI-powered QE
  • holistic-testing-pact - Evolved testing model with PACT integration
  • context-driven-testing - Practices chosen based on project context
  • tdd-london-chicago - Test-driven development with both school approaches
  • xp-practices - Extreme programming practices for quality
  • risk-based-testing - Focus testing effort on highest-risk areas
  • test-automation-strategy - Strategic approach to automation
  • refactoring-patterns - Safe code improvement patterns
  • shift-left-testing - Early testing in development lifecycle
  • shift-right-testing - Production testing and observability
  • regression-testing - Strategic regression management
  • verification-quality - Quality verification practices

Specialized Testing (13)

  • accessibility-testing - WCAG 2.2 compliance and inclusive design
  • mobile-testing - iOS and Android platform testing
  • database-testing - Schema validation and data integrity
  • contract-testing - Consumer-driven contract testing
  • chaos-engineering-resilience - Fault injection and resilience testing
  • visual-testing-advanced - Visual regression and UI testing
  • security-visual-testing - Security-first visual testing with PII detection
  • compliance-testing - Regulatory compliance (GDPR, HIPAA, SOC2)
  • compatibility-testing - Cross-browser and platform testing
  • localization-testing - i18n and l10n testing
  • mutation-testing - Test suite effectiveness evaluation
  • performance-testing - Load, stress, and scalability testing
  • security-testing - OWASP and security vulnerability testing

V3 Domain Skills (14)

  • qe-test-generation - AI-powered test synthesis
  • qe-test-execution - Parallel execution and retry logic
  • qe-coverage-analysis - O(log n) sublinear coverage
  • qe-quality-assessment - Quality gates and deployment readiness
  • qe-defect-intelligence - ML defect prediction and root cause
  • qe-requirements-validation - BDD scenarios and acceptance criteria
  • qe-code-intelligence - Knowledge graphs and token reduction
  • qe-security-compliance - OWASP and CVE detection
  • qe-contract-testing - Pact and schema validation
  • qe-visual-accessibility - Visual regression and WCAG
  • qe-chaos-resilience - Fault injection and resilience
  • qe-learning-optimization - Transfer learning and self-improvement
  • qe-iterative-loop - QE iteration patterns
  • aqe-v2-v3-migration - Migration guide from v2 to v3

Strategic & Communication (8)

  • six-thinking-hats - Edward de Bono's methodology for QE
  • brutal-honesty-review - Unvarnished technical criticism
  • sherlock-review - Evidence-based investigative code review
  • cicd-pipeline-qe-orchestrator - CI/CD quality orchestration
  • bug-reporting-excellence - High-quality bug reports
  • consultancy-practices - QE consultancy workflows
  • quality-metrics - Effective quality measurement
  • pair-programming - AI-assisted pair programming

Testing Techniques & Management (9)

  • exploratory-testing-advanced - SBTM and RST heuristics
  • test-design-techniques - Test design methodologies
  • test-data-management - Test data strategies
  • test-environment-management - Environment configuration
  • test-reporting-analytics - Quality dashboards and KPIs
  • testability-scoring - Score code testability
  • technical-writing - Documentation practices
  • code-review-quality - Context-driven code reviews
  • api-testing-patterns - REST and GraphQL testing

n8n Workflow Testing (5) (contributed by @fndlalit)

  • n8n-workflow-testing-fundamentals - Execution lifecycle and data flow
  • n8n-expression-testing - Expression validation and testing
  • n8n-security-testing - Workflow security scanning
  • n8n-trigger-testing-strategies - Webhook and event testing
  • n8n-integration-testing-patterns - API contract testing for n8n

QCSD Swarms (4) - Quality Conscious Software Delivery lifecycle

  • qcsd-ideation-swarm - Phase 1: HTSM v6.3, Risk Storming, Testability analysis
  • qcsd-refinement-swarm - Phase 2: SFDIPOT analysis, BDD scenario generation
  • qcsd-development-swarm - Phase 3: TDD, coverage, code quality gates (SHIP/CONDITIONAL/HOLD)
  • qcsd-cicd-swarm - Phase 4: Pipeline quality gates (RELEASE/REMEDIATE/BLOCK)

Accessibility (2)

  • a11y-ally - Comprehensive WCAG auditing with video captions and EU compliance
  • accessibility-testing - WCAG 2.2 compliance and screen reader validation

πŸ”„ V2 to V3 Migration

V3 provides automatic backward compatibility with V2:

# Check migration status
aqe migrate status

# Run migration with backup
aqe migrate run --backup

# Verify migration
aqe migrate verify

What gets migrated:

  • βœ… Memory data (SQLite β†’ AgentDB with HNSW indexing)
  • βœ… Configuration files
  • βœ… Learned patterns (β†’ ReasoningBank)
  • βœ… Agent mappings (V2 names β†’ V3 equivalents)
V2 AgentV3 Agent
qe-test-generatorqe-test-architect
qe-coverage-analyzerqe-coverage-specialist
qe-quality-gateqe-quality-gate
qe-security-scannerqe-security-scanner
qe-coordinatorqe-queen-coordinator

πŸ€– LLM Provider Configuration

AQE V3 supports multiple LLM providers for maximum flexibility:

ProviderTypeCostBest For
OllamaLocalFREEPrivacy, offline
OpenRouterCloudVaries300+ models
GroqCloudFREEHigh-speed
Claude APICloudPaidHighest quality
Google AICloudFREEGemini models
# Configure provider
export GROQ_API_KEY="gsk_..."
aqe init --auto

πŸ“– Documentation

V3 Guides

V2 Documentation (Legacy)

Feature Guides

Testing Guides

πŸ“Š Project Architecture

agentic-qe/
β”œβ”€β”€ src/
β”‚   β”œβ”€β”€ kernel/              # Shared kernel
β”‚   β”œβ”€β”€ domains/             # 13 bounded contexts
β”‚   β”‚   β”œβ”€β”€ test-generation/
β”‚   β”‚   β”œβ”€β”€ coverage-analysis/
β”‚   β”‚   β”œβ”€β”€ quality-assessment/
β”‚   β”‚   └── ...
β”‚   β”œβ”€β”€ routing/             # Agent routing & registry
β”‚   β”œβ”€β”€ mcp/                 # MCP server
β”‚   └── cli/                 # CLI
β”œβ”€β”€ tests/                   # 17,000+ tests
β”œβ”€β”€ assets/agents/           # 60 QE agent definitions (53 main + 7 subagents)
β”œβ”€β”€ .claude/
β”‚   β”œβ”€β”€ agents/v3/           # Agent definitions (source)
β”‚   └── skills/              # QE-specific skills
β”œβ”€β”€ .kiro/                   # AWS Kiro IDE integration
β”‚   β”œβ”€β”€ agents/              # 87 Kiro agent definitions (JSON)
β”‚   β”œβ”€β”€ skills/              # 86 Kiro skill definitions (SKILL.md)
β”‚   β”œβ”€β”€ hooks/               # 5 event-driven hooks
β”‚   β”œβ”€β”€ steering/            # 2 QE steering files
β”‚   └── settings/            # MCP server configuration
β”œβ”€β”€ docs/                    # Documentation
β”‚   β”œβ”€β”€ plans/               # Migration plans
β”‚   β”œβ”€β”€ policies/            # Project policies
β”‚   └── implementation/      # ADRs and DDD docs
β”œβ”€β”€ package.json
└── README.md                # This file

πŸš€ Development

Setup

# Clone repository
git clone https://github.com/proffesor-for-testing/agentic-qe.git
cd agentic-qe

# Install V3 dependencies
cd v3
npm install

# Build
npm run build

# Run tests
npm test -- --run

V3 Scripts

ScriptDescription
npm run buildCompile TypeScript
npm test -- --runRun all tests
npm run cliRun CLI in dev mode
npm run mcpStart MCP server

🀝 Contributing

We welcome contributions! Please see CONTRIBUTING.md for details.

πŸ“ž Support

πŸ“ License

This project is licensed under the MIT License - see the LICENSE file for details.

πŸ‘₯ Contributors

Thanks to all the amazing people who have contributed to Agentic QE Fleet!


@proffesor-for-testing
Project Lead

@fndlalit
QX Partner, Testability

@shaal
Core Development

@mondweep
Architecture

View all contributors | Become a contributor

πŸ’– Support the Project

If you find Agentic QE Fleet valuable, consider supporting its development:

MonthlyAnnual (Save $10)
Price$5/month$50/year
BenefitsSponsor recognition, Priority supportAll monthly + Featured in README, Roadmap input
SubscribeMonthlyAnnual

View sponsorship details

πŸ™ Acknowledgments

V3 is built on the shoulders of giants:

  • Claude Flow by @ruvnet - Multi-agent orchestration, MCP integration, swarm coordination
  • Agentic Flow by @ruvnet - Agent patterns, learning systems, neural coordination
  • Built with TypeScript, Node.js, and better-sqlite3
  • HNSW indexing via hnswlib-node
  • Inspired by Domain-Driven Design and swarm intelligence
  • Integrates with Jest, Cypress, Playwright, k6, SonarQube, and more
  • Compatible with Claude Code via Model Context Protocol (MCP)

Made with ❀️ by the Agentic QE Team

⭐ Star us on GitHub | πŸ’– Sponsor | πŸ‘₯ Contributors

Keywords

quality-engineering

FAQs

Package last updated on 05 Mar 2026

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts