Latest Threat Research:SANDWORM_MODE: Shai-Hulud-Style npm Worm Hijacks CI Workflows and Poisons AI Toolchains.Details โ†’
Socket
Book a DemoInstallSign in
Socket

@techwavedev/agi-agent-kit

Package Overview
Dependencies
Maintainers
1
Versions
35
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

@techwavedev/agi-agent-kit

Enterprise-Grade Agentic Framework - Modular skill-based AI assistant toolkit with deterministic execution, semantic memory, and platform-adaptive orchestration.

Source
npmnpm
Version
1.3.5
Version published
Weekly downloads
710
-0.84%
Maintainers
1
Weekly downloads
ย 
Created
Source

๐Ÿš€ AGI Agent Kit

Stop hallucinating. Start executing.

npm version npm downloads License: Apache-2.0 Claude Code Gemini CLI Codex CLI Cursor GitHub Copilot OpenCode Antigravity IDE AdaL CLI Buy me a coffee

AGI Agent Kit is the enterprise-grade scaffolding that turns any AI coding assistant into a deterministic production machine. While LLMs are probabilistic (90% accuracy per step = 59% over 5 steps), this framework forces them through a 3-Layer Architecture โ€” Intent โ†’ Orchestration โ†’ Execution โ€” where business logic lives in tested scripts, not hallucinated code.

Why this exists

Most AI coding setups give you a prompt and hope for the best. AGI Agent Kit gives you:

  • ๐Ÿง  Semantic Memory โ€” Qdrant-powered caching that eliminates redundant LLM calls (90-100% token savings)
  • ๐ŸŽฏ 19 Specialist Agents โ€” Domain-bounded experts (Frontend, Backend, Security, Mobile, Game Dev...) with enforced file ownership
  • โšก 861 Curated Skills โ€” 4 core + 75 professional + 782 community skills across 16 domain categories
  • ๐Ÿ”’ Verification Gates โ€” No task completes without evidence. TDD enforcement. Two-stage code review.
  • ๐ŸŒ 8 Platforms, One Config โ€” Write once, run on Claude Code, Gemini CLI, Codex CLI, Cursor, Copilot, OpenCode, AdaL CLI, Antigravity IDE
npx @techwavedev/agi-agent-kit init

If this project helps you, consider supporting it here or simply โญ the repo.

๐Ÿš€ Quick Start

Scaffold a new agent workspace in seconds:

npx @techwavedev/agi-agent-kit init

You'll be prompted to choose a pack:

  • core โ€” 4 essential skills (webcrawler, pdf-reader, qdrant-memory, documentation)
  • medium โ€” Core + 75 specialized skills in 16 categories + .agent/ structure (API, Security, Design, Architecture)
  • full โ€” Complete suite: Medium + 782 community skills from antigravity-awesome-skills (861 total)

After installation, run the one-shot setup wizard to auto-configure your environment:

python3 skills/plugin-discovery/scripts/platform_setup.py --project-dir .

This detects your platform, scans the project stack, and configures everything with a single confirmation.

Then boot the memory system for automatic token savings:

python3 execution/session_boot.py --auto-fix

This checks Qdrant, Ollama, embedding models, and collections โ€” auto-fixing any issues.

โœจ Key Features

FeatureDescription
Deterministic ExecutionSeparates business logic (Python scripts) from AI reasoning (Directives)
Modular Skill System861 plug-and-play skills across 3 tiers, organized in 16 domain categories
Structured Plan ExecutionBatch or subagent-driven execution with two-stage review (spec + quality)
TDD EnforcementIron-law RED-GREEN-REFACTOR cycle โ€” no production code without failing test
Verification GatesEvidence before claims โ€” no completion without fresh verification output
Platform-AdaptiveAuto-detects Claude Code, Gemini CLI, Codex CLI, Cursor, Copilot, OpenCode, AdaL, Antigravity
Multi-Agent OrchestrationAgent Teams, subagents, Powers, or sequential personas โ€” adapts to platform
Semantic MemoryBuilt-in Qdrant-powered memory with 95% token savings via caching
Self-Healing WorkflowsAgents read error logs, patch scripts, and update directives automatically
One-Shot SetupPlatform detection + project stack scan + auto-configuration in one command

๐Ÿ†š How This Compares to Superpowers

The agi framework adopts all best patterns from obra/superpowers and extends them with capabilities superpowers does not have:

Capabilityobra/superpowersagi Framework
TDD Enforcementโœ…โœ… Adapted
Plan Execution + Reviewโœ…โœ… Adapted + platform-adaptive
Systematic Debuggingโœ…โœ… Adapted + debugger agent
Verification Gatesโœ…โœ… Adapted + 12 audit scripts
Two-Stage Code Reviewโœ…โœ… Adapted into orchestrator
Multi-Platform OrchestrationโŒ Claude onlyโœ… 4 platforms
Semantic Memory (Qdrant)โŒโœ… 90-100% token savings
19 Specialist AgentsโŒโœ… Domain boundaries
Agent Boundary EnforcementโŒโœ… File-type ownership
Dynamic Question GenerationโŒโœ… Trade-offs + priorities
Memory-First ProtocolโŒโœ… Auto cache-hit
Skill Creator + CatalogโŒโœ… 861 composable skills
Platform Setup WizardโŒโœ… One-shot config
Multi-Platform SymlinksโŒ Claude onlyโœ… 8 platforms

๐Ÿงช Real Benchmark: Subagents vs Agent Teams

The framework supports two orchestration modes. Here are real test results from execution/benchmark_modes.py running on local infrastructure (Qdrant + Ollama nomic-embed-text, zero cloud API calls):

MODE A: SUBAGENTS โ€” Independent, fire-and-forget
  ๐Ÿ“ค Explore Auth Patterns    โ†’ โœ… stored in cache + memory (127ms)
  ๐Ÿ“ค Query Performance        โ†’ โŒ FAILED (timeout โ€” fault tolerant)
  ๐Ÿ“ค Scan CVEs                โ†’ โœ… stored in cache + memory (14ms)
  Summary: 2/3 completed, 1 failed, 0 cross-references

MODE B: AGENT TEAMS โ€” Shared context, coordinated
  ๐Ÿ‘ค Backend Specialist       โ†’ โœ… stored in shared memory (14ms)
  ๐Ÿ‘ค Database Specialist      โ†’ โœ… stored in shared memory (13ms)
  ๐Ÿ‘ค Frontend Specialist      โ†’ ๐Ÿ”— Read Backend + Database output first
     โœ… Got context from team-backend: "API contract: POST /api/messages..."
     โœ… Got context from team-database: "Schema: users(id UUID PK, name..."
     โ†’ โœ… stored in shared memory (14ms)
  Summary: 3/3 completed, 0 failed, 2 cross-references

2nd run (cache warm): All queries hit cache at score 1.000, reducing total time from 314ms โ†’ 76ms (Subagents) and 292ms โ†’ 130ms (Agent Teams).

MetricSubagentsAgent Teams
Execution modelFire-and-forget (isolated)Shared context (coordinated)
Tasks completed2/3 (fault tolerant)3/3
Cross-references0 (not supported)2 (peers read each other's work)
Context sharingโŒ Each agent isolatedโœ… Peer-to-peer via Qdrant
Two-stage reviewโŒโœ… Spec + Quality
Cache hits (2nd run)5/55/5
Embedding providerOllama local (nomic-embed-text 137M)Ollama local (nomic-embed-text 137M)

Try it yourself:

# 1. Start infrastructure
docker run -d -p 6333:6333 -v qdrant_storage:/qdrant/storage qdrant/qdrant
ollama serve & ollama pull nomic-embed-text

# 2. Boot memory system
python3 execution/session_boot.py --auto-fix
# โœ… Memory system ready โ€” 5 memories, 1 cached responses

# 3. Run the full benchmark (both modes)
python3 execution/benchmark_modes.py --verbose

# 4. Or test individual operations:

# Store a decision (embedding generated locally via Ollama)
python3 execution/memory_manager.py store \
  --content "Chose PostgreSQL for relational data" \
  --type decision --project myapp
# โ†’ {"status": "stored", "point_id": "...", "token_count": 5}

# Auto-query: checks cache first, then retrieves context
python3 execution/memory_manager.py auto \
  --query "what database did we choose?"
# โ†’ {"source": "memory", "cache_hit": false, "context_chunks": [...]}

# Cache an LLM response for future reuse
python3 execution/memory_manager.py cache-store \
  --query "how to set up auth?" \
  --response "Use JWT with 24h expiry, refresh tokens in httpOnly cookies"

# Re-query โ†’ instant cache hit (score 1.000, zero re-computation)
python3 execution/memory_manager.py auto \
  --query "how to set up auth?"
# โ†’ {"source": "cache", "cache_hit": true, "tokens_saved_estimate": 12}

๐ŸŒ Platform Support

The framework automatically detects your AI coding environment and activates the best available features.

Skills are installed to the canonical skills/ directory and symlinked to each platform's expected path:

PlatformSkills PathInstruction FileOrchestration Strategy
Claude Code.claude/skills/CLAUDE.mdAgent Teams (parallel) or Subagents
Gemini CLI.gemini/skills/GEMINI.mdSequential personas via @agent
Codex CLI.codex/skills/AGENTS.mdSequential via prompts
Antigravity IDE.agent/skills/AGENTS.mdFull agentic orchestration
Cursor.cursor/skills/AGENTS.mdChat-based via @skill
GitHub CopilotN/A (paste)COPILOT.mdManual paste into context
OpenCode.agent/skills/OPENCODE.mdSequential personas via @agent
AdaL CLI.adal/skills/AGENTS.mdAuto-load on demand

Run /setup to auto-detect and configure your platform, or use the setup script directly:

# Interactive (one Y/n question)
python3 skills/plugin-discovery/scripts/platform_setup.py --project-dir .

# Auto-apply everything
python3 skills/plugin-discovery/scripts/platform_setup.py --project-dir . --auto

# Preview without changes
python3 skills/plugin-discovery/scripts/platform_setup.py --project-dir . --dry-run

๐Ÿ“ฆ What You Get

your-project/
โ”œโ”€โ”€ AGENTS.md              # Master instruction file
โ”œโ”€โ”€ GEMINI.md โ†’ AGENTS.md  # Platform symlinks
โ”œโ”€โ”€ CLAUDE.md โ†’ AGENTS.md
โ”œโ”€โ”€ OPENCODE.md โ†’ AGENTS.md
โ”œโ”€โ”€ COPILOT.md โ†’ AGENTS.md
โ”œโ”€โ”€ skills/                # Up to 861 skills (depends on pack)
โ”‚   โ”œโ”€โ”€ webcrawler/        # Documentation harvesting
โ”‚   โ”œโ”€โ”€ qdrant-memory/     # Semantic caching & memory
โ”‚   โ””โ”€โ”€ ...                # 860 more skills in full pack
โ”œโ”€โ”€ .claude/skills โ†’ skills/   # Platform-specific symlinks
โ”œโ”€โ”€ .gemini/skills โ†’ skills/
โ”œโ”€โ”€ .codex/skills โ†’ skills/
โ”œโ”€โ”€ .cursor/skills โ†’ skills/
โ”œโ”€โ”€ .adal/skills โ†’ skills/
โ”œโ”€โ”€ directives/            # SOPs in Markdown
โ”œโ”€โ”€ execution/             # Deterministic Python scripts
โ”‚   โ”œโ”€โ”€ session_boot.py    # Session startup (Qdrant + Ollama check)
โ”‚   โ””โ”€โ”€ memory_manager.py  # Store/retrieve/cache operations
โ”œโ”€โ”€ skill-creator/         # Tools to create new skills
โ””โ”€โ”€ .agent/                # (medium/full) Agents, workflows, rules
    โ””โ”€โ”€ workflows/         # /setup, /deploy, /test, /debug, etc.

๐Ÿ“– Architecture

The system operates on three layers:

โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  Layer 1: DIRECTIVES (Intent)                           โ”‚
โ”‚  โ””โ”€ SOPs written in Markdown (directives/)              โ”‚
โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
โ”‚  Layer 2: ORCHESTRATION (Agent)                         โ”‚
โ”‚  โ””โ”€ LLM reads directive, decides which tool to call     โ”‚
โ”‚  โ””โ”€ Platform-adaptive: Teams, Subagents, or Personas    โ”‚
โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
โ”‚  Layer 3: EXECUTION (Code)                              โ”‚
โ”‚  โ””โ”€ Pure Python scripts (execution/) do the actual work โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

Why? LLMs are probabilistic. 90% accuracy per step = 59% success over 5 steps. By pushing complexity into deterministic scripts, we achieve reliable execution.

๐Ÿง  Semantic Memory

Built-in Qdrant-powered memory with automatic token savings:

ScenarioWithout MemoryWith MemorySavings
Repeated question~2000 tokens0 tokens100%
Similar architecture~5000 tokens~500 tokens90%
Past error resolution~3000 tokens~300 tokens90%

Setup (requires Qdrant + Ollama):

# Start Qdrant
docker run -d -p 6333:6333 -v qdrant_storage:/qdrant/storage qdrant/qdrant

# Start Ollama + pull embedding model
ollama serve &
ollama pull nomic-embed-text

# Boot memory system (auto-creates collections)
python3 execution/session_boot.py --auto-fix

Agents automatically run session_boot.py at session start (first instruction in AGENTS.md). Memory operations:

# Auto-query (check cache + retrieve context)
python3 execution/memory_manager.py auto --query "your task summary"

# Store a decision
python3 execution/memory_manager.py store --content "what was decided" --type decision

# Health check
python3 execution/memory_manager.py health

โšก Prerequisites

The npx init command automatically creates a .venv and installs all dependencies. Just activate it:

source .venv/bin/activate   # macOS/Linux
# .venv\Scripts\activate    # Windows

If you need to reinstall or update dependencies:

.venv/bin/pip install -r requirements.txt

๐Ÿ”ง Commands

Initialize a new project

npx @techwavedev/agi-agent-kit init --pack=full

Auto-detect platform and configure environment

python3 skills/plugin-discovery/scripts/platform_setup.py --project-dir .

Update to latest version

npx @techwavedev/agi-agent-kit@latest init --pack=full
# or use the built-in skill:
python3 skills/self-update/scripts/update_kit.py

Boot memory system

python3 execution/session_boot.py --auto-fix

System health check

python3 execution/system_checkup.py --verbose

Create a new skill

python3 skill-creator/scripts/init_skill.py my-skill --path skills/

Update skills catalog

python3 skill-creator/scripts/update_catalog.py --skills-dir skills/

๐ŸŽฏ Activation Reference

Use these keywords, commands, and phrases to trigger specific capabilities:

Slash Commands (Workflows)

CommandWhat It Does
/setupAuto-detect platform and configure environment
/setup-memoryInitialize Qdrant + Ollama memory system
/createStart interactive app builder dialogue
/planCreate a structured project plan (no code)
/enhanceAdd or update features in existing app
/debugActivate systematic debugging mode
/testGenerate and run tests
/deployPre-flight checks + deployment
/orchestrateMulti-agent coordination for complex tasks
/brainstormStructured brainstorming with multiple options
/previewStart/stop local dev server
/statusShow project progress and status board
/updateUpdate AGI Agent Kit to latest version
/checkupVerify agents, workflows, skills, and core files

Agent Mentions (@agent)

MentionSpecialistWhen To Use
@orchestratorMulti-agent coordinatorComplex multi-domain tasks
@project-plannerPlanning specialistRoadmaps, task breakdowns, phase planning
@frontend-specialistUI/UX architectWeb interfaces, React, Next.js
@mobile-developerMobile specialistiOS, Android, React Native, Flutter
@backend-specialistAPI/DB engineerServer-side, databases, APIs
@security-auditorSecurity expertVulnerability scanning, audits, hardening
@debuggerDebug specialistComplex bug investigation
@game-developerGame dev specialist2D/3D games, multiplayer, VR/AR

Skill Trigger Keywords (Natural Language)

CategoryTrigger Words / PhrasesSkill Activated
Memory"don't use cache", "no cache", "skip memory", "fresh"Memory opt-out
Research"research my docs", "check my notebooks", "deep search", "@notebooklm"notebooklm-rag
Documentation"update docs", "regenerate catalog", "sync documentation"documentation
Quality"lint", "format", "check", "validate", "static analysis"lint-and-validate
Testing"write tests", "run tests", "TDD", "test coverage"testing-patterns / tdd-workflow
TDD"test first", "red green refactor", "failing test"test-driven-development
Plan Execution"execute plan", "run the plan", "batch execution"executing-plans
Verification"verify", "prove it works", "evidence", "show me the output"verification-before-completion
Debugging"debug", "root cause", "investigate", "why is this failing"systematic-debugging
Architecture"design system", "architecture decision", "ADR", "trade-off"architecture
Security"security scan", "vulnerability", "audit", "OWASP"red-team-tactics
Performance"lighthouse", "bundle size", "core web vitals", "profiling"performance-profiling
Design"design UI", "color scheme", "typography", "layout"frontend-design
Deployment"deploy", "rollback", "release", "CI/CD"deployment-procedures
API"REST API", "GraphQL", "tRPC", "API design"api-patterns
Database"schema design", "migration", "query optimization"database-design
Planning"plan this", "break down", "task list", "requirements"plan-writing
Brainstorming"explore options", "what are the approaches", "pros and cons"brainstorming
Code Review"review this", "code quality", "best practices"code-review-checklist
i18n"translate", "localization", "RTL", "locale"i18n-localization
AWS"terraform", "EKS", "Lambda", "S3", "CloudFront"aws / aws-terraform
Infrastructure"Consul", "service mesh", "OpenSearch"consul / opensearch

Memory System Commands

What You WantCommand / Phrase
Boot memorypython3 execution/session_boot.py --auto-fix
Check before a taskpython3 execution/memory_manager.py auto --query "..."
Store a decisionpython3 execution/memory_manager.py store --content "..." --type decision
Cache a responsepython3 execution/memory_manager.py cache-store --query "..." --response "..."
Health checkpython3 execution/memory_manager.py health
Skip cache for this taskSay "fresh", "no cache", or "skip memory" in your prompt

๐Ÿ“š Documentation

๐Ÿค Community Skills & Credits

The Full tier includes 782 community skills adapted from the Antigravity Awesome Skills project (v5.4.0) by @sickn33, distributed under the MIT License.

This collection aggregates skills from 50+ open-source contributors and organizations including Anthropic, Microsoft, Vercel Labs, Supabase, Trail of Bits, Expo, Sentry, Neon, fal.ai, and many more. For the complete attribution ledger, see SOURCES.md.

Each community skill has been adapted for the AGI framework with:

  • Qdrant Memory Integration โ€” Semantic caching and context retrieval
  • Agent Team Collaboration โ€” Orchestrator-driven invocation and shared memory
  • Local LLM Support โ€” Ollama-based embeddings for local-first operation

If these community skills help you, consider starring the original repo or supporting the author.

๐Ÿ›ก๏ธ Security

This package includes a pre-flight security scanner that checks for private terms before publishing. All templates are sanitized for public use.

โ˜• Support

If the AGI Agent Kit helps you build better AI-powered workflows, consider supporting the project:

๐Ÿ“„ License

Apache-2.0 ยฉ Elton Machado@TechWaveDev

Community skills in the Full tier are licensed under the MIT License. See THIRD-PARTY-LICENSES.md for details.

Keywords

ai

FAQs

Package last updated on 16 Feb 2026

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts