Latest Threat Research:SANDWORM_MODE: Shai-Hulud-Style npm Worm Hijacks CI Workflows and Poisons AI Toolchains.Details →

Book a Demo Install Sign in

@techwavedev/agi-agent-kit

Package Overview

Advanced tools

Install Socket

Detect and block malicious and high-risk dependencies

Install

@techwavedev/agi-agent-kit

Enterprise-Grade Agentic Framework - Modular skill-based AI assistant toolkit with deterministic execution, semantic memory, and platform-adaptive orchestration.

latest

Source

npm

Version: 1.5.3

Version published: 4 days ago

Weekly downloads: 713

Maintainers: 1

Weekly downloads

Created: last month

Source

🚀 AGI Agent Kit

🌐 Português (BR) | English

Stop hallucinating. Start executing.

AGI Agent Kit is the enterprise-grade scaffolding that turns any AI coding assistant into a deterministic production machine. While LLMs are probabilistic (90% accuracy per step = 59% over 5 steps), this framework forces them through a 3-Layer Architecture — Intent → Orchestration → Execution — where business logic lives in tested scripts, not hallucinated code.

Why this exists

Most AI coding setups give you a prompt and hope for the best. AGI Agent Kit gives you:

🧠 Hybrid Memory — Qdrant vectors + BM25 keywords: semantic similarity for concepts, exact matching for error codes and IDs (90-100% token savings)
🎯 19 Specialist Agents — Domain-bounded experts (Frontend, Backend, Security, Mobile, Game Dev...) with enforced file ownership
⚡ 878 Curated Skills — 4 core + 89 professional + 785 community skills across 16 domain categories
🔒 Verification Gates — No task completes without evidence. TDD enforcement. Two-stage code review.
🌐 9 Platforms, One Config — Write once, run on Claude Code, Gemini CLI, Codex CLI, Cursor, Copilot, OpenCode, AdaL CLI, Antigravity IDE, OpenClaw

npx @techwavedev/agi-agent-kit init

If this project helps you, consider supporting it here or simply ⭐ the repo.

🚀 Quick Start

Scaffold a new agent workspace in seconds:

npx @techwavedev/agi-agent-kit init

# Or install globally to ~/.agent to share skills across projects
npx @techwavedev/agi-agent-kit init --global

You'll be guided through an interactive wizard:

Existing install check — detects a prior install and offers Update / Reinstall / Cancel
Install scope — project-local (current dir) or global (~/.agent shared across projects)
Smart backup — scans files at risk and creates a timestamped backup before touching anything
Pack selection — choose skills to install:
- core — 4 essential skills (webcrawler, pdf-reader, qdrant-memory, documentation)
- medium — Core + 89 professional skills in 16 categories + .agent/ structure
- full — Everything: Medium + 785 community skills (878 total)
- custom — Core + you pick specific domains (AI Agents, DevOps, Security, Frontend, etc.)
Memory setup — detects Ollama/Docker/Qdrant; if missing, asks whether to install locally or use a custom URL (supports Qdrant Cloud, remote servers)
Agent Teams — opt-in to parallel multi-agent execution (writes .claude/settings.json)
Summary — shows exactly what was configured vs what needs manual action

After installation the wizard shows your next steps, including:

# Boot the memory system (verifies Qdrant + Ollama, auto-fixes issues)
python3 execution/session_boot.py --auto-fix

# Run the platform setup wizard (auto-configures your AI platform)
python3 skills/plugin-discovery/scripts/platform_setup.py --project-dir .

✨ Key Features

Feature	Description
Deterministic Execution	Separates business logic (Python scripts) from AI reasoning (Directives)
Modular Skill System	878 plug-and-play skills across 3 tiers, organized in 16 domain categories
Structured Plan Execution	Batch or subagent-driven execution with two-stage review (spec + quality)
TDD Enforcement	Iron-law RED-GREEN-REFACTOR cycle — no production code without failing test
Verification Gates	Evidence before claims — no completion without fresh verification output
Platform-Adaptive	Auto-detects Claude Code, Gemini CLI, Codex CLI, Cursor, Copilot, OpenCode, AdaL, Antigravity
Multi-Agent Orchestration	Agent Teams, subagents, Powers, or sequential personas — adapts to platform
Hybrid Memory	Qdrant vectors + BM25 keywords with weighted score merge (95% token savings)
Self-Healing Workflows	Agents read error logs, patch scripts, and update directives automatically
One-Shot Setup	Platform detection + project stack scan + auto-configuration in one command

🆚 How This Compares to Superpowers

The agi framework adopts all best patterns from obra/superpowers and extends them with capabilities superpowers does not have:

Capability	obra/superpowers	agi Framework
TDD Enforcement	✅	✅ Adapted
Plan Execution + Review	✅	✅ Adapted + platform-adaptive
Systematic Debugging	✅	✅ Adapted + `debugger` agent
Verification Gates	✅	✅ Adapted + 12 audit scripts
Two-Stage Code Review	✅	✅ Adapted into orchestrator
Multi-Platform Orchestration	❌ Claude only	✅ 9 platforms
Semantic Memory (Qdrant)	❌	✅ 90-100% token savings
19 Specialist Agents	❌	✅ Domain boundaries
Agent Boundary Enforcement	❌	✅ File-type ownership
Dynamic Question Generation	❌	✅ Trade-offs + priorities
Memory-First Protocol	❌	✅ Auto cache-hit
Skill Creator + Catalog	❌	✅ 878 composable skills
Platform Setup Wizard	❌	✅ One-shot config
Multi-Platform Symlinks	❌ Claude only	✅ 9 platforms

🧪 Real Benchmark: Subagents vs Agent Teams

The framework supports two orchestration modes. Here are real test results from execution/benchmark_modes.py running on local infrastructure (Qdrant + Ollama nomic-embed-text, zero cloud API calls):

MODE A: SUBAGENTS — Independent, fire-and-forget
  📤 Explore Auth Patterns    → ✅ stored in cache + memory (127ms)
  📤 Query Performance        → ❌ FAILED (timeout — fault tolerant)
  📤 Scan CVEs                → ✅ stored in cache + memory (14ms)
  Summary: 2/3 completed, 1 failed, 0 cross-references

MODE B: AGENT TEAMS — Shared context, coordinated
  👤 Backend Specialist       → ✅ stored in shared memory (14ms)
  👤 Database Specialist      → ✅ stored in shared memory (13ms)
  👤 Frontend Specialist      → 🔗 Read Backend + Database output first
     ✅ Got context from team-backend: "API contract: POST /api/messages..."
     ✅ Got context from team-database: "Schema: users(id UUID PK, name..."
     → ✅ stored in shared memory (14ms)
  Summary: 3/3 completed, 0 failed, 2 cross-references

2nd run (cache warm): All queries hit cache at score 1.000, reducing total time from 314ms → 76ms (Subagents) and 292ms → 130ms (Agent Teams).

Metric	Subagents	Agent Teams
Execution model	Fire-and-forget (isolated)	Shared context (coordinated)
Tasks completed	2/3 (fault tolerant)	3/3
Cross-references	0 (not supported)	2 (peers read each other's work)
Context sharing	❌ Each agent isolated	✅ Peer-to-peer via Qdrant
Two-stage review	❌	✅ Spec + Quality
Cache hits (2nd run)	5/5	5/5
Embedding provider	Ollama local (nomic-embed-text 137M)	Ollama local (nomic-embed-text 137M)

Try it yourself:

# 1. Start infrastructure
docker run -d -p 6333:6333 -v qdrant_storage:/qdrant/storage qdrant/qdrant
ollama serve & ollama pull nomic-embed-text

# 2. Boot memory system
python3 execution/session_boot.py --auto-fix
# ✅ Memory system ready — 5 memories, 1 cached responses

# 3. Run the full benchmark (both modes)
python3 execution/benchmark_modes.py --verbose

# 4. Or test individual operations:

# Store a decision (embedding generated locally via Ollama)
python3 execution/memory_manager.py store \
  --content "Chose PostgreSQL for relational data" \
  --type decision --project myapp
# → {"status": "stored", "point_id": "...", "token_count": 5}

# Auto-query: checks cache first, then retrieves context
python3 execution/memory_manager.py auto \
  --query "what database did we choose?"
# → {"source": "memory", "cache_hit": false, "context_chunks": [...]}

# Cache an LLM response for future reuse
python3 execution/memory_manager.py cache-store \
  --query "how to set up auth?" \
  --response "Use JWT with 24h expiry, refresh tokens in httpOnly cookies"

# Re-query → instant cache hit (score 1.000, zero re-computation)
python3 execution/memory_manager.py auto \
  --query "how to set up auth?"
# → {"source": "cache", "cache_hit": true, "tokens_saved_estimate": 12}

🌐 Platform Support

The framework automatically detects your AI coding environment and activates the best available features.

Skills are installed to the canonical skills/ directory and symlinked to each platform's expected path:

Platform	Skills Path	Instruction File	Orchestration Strategy
Claude Code	`.claude/skills/`	`CLAUDE.md`	Agent Teams (parallel) or Subagents
Gemini CLI	`.gemini/skills/`	`GEMINI.md`	Sequential personas via `@agent`
Codex CLI	`.codex/skills/`	`AGENTS.md`	Sequential via prompts
Antigravity IDE	`.agent/skills/`	`AGENTS.md`	Full agentic orchestration
Cursor	`.cursor/skills/`	`AGENTS.md`	Chat-based via `@skill`
GitHub Copilot	N/A (paste)	`COPILOT.md`	Manual paste into context
OpenCode	`.agent/skills/`	`OPENCODE.md`	Sequential personas via `@agent`
AdaL CLI	`.adal/skills/`	`AGENTS.md`	Auto-load on demand

Run /setup to auto-detect and configure your platform, or use the setup script directly:

# Interactive (one Y/n question)
python3 skills/plugin-discovery/scripts/platform_setup.py --project-dir .

# Auto-apply everything
python3 skills/plugin-discovery/scripts/platform_setup.py --project-dir . --auto

# Preview without changes
python3 skills/plugin-discovery/scripts/platform_setup.py --project-dir . --dry-run

📦 What You Get

your-project/
├── AGENTS.md              # Master instruction file
├── GEMINI.md → AGENTS.md  # Platform symlinks
├── CLAUDE.md → AGENTS.md
├── OPENCODE.md → AGENTS.md
├── COPILOT.md → AGENTS.md
├── skills/                # Up to 878 skills (depends on pack)
│   ├── webcrawler/        # Documentation harvesting
│   ├── qdrant-memory/     # Semantic caching & memory
│   └── ...                # 877 more skills in full pack
├── .claude/skills → skills/   # Platform-specific symlinks
├── .gemini/skills → skills/
├── .codex/skills → skills/
├── .cursor/skills → skills/
├── .adal/skills → skills/
├── directives/            # SOPs in Markdown
├── execution/             # Deterministic Python scripts
│   ├── session_boot.py    # Session startup (Qdrant + Ollama check)
│   └── memory_manager.py  # Store/retrieve/cache operations
├── skill-creator/         # Tools to create new skills
└── .agent/                # (medium/full) Agents, workflows, rules
    └── workflows/         # /setup, /deploy, /test, /debug, etc.

📖 Architecture

The system operates on three layers:

┌─────────────────────────────────────────────────────────┐
│  Layer 1: DIRECTIVES (Intent)                           │
│  └─ SOPs written in Markdown (directives/)              │
├─────────────────────────────────────────────────────────┤
│  Layer 2: ORCHESTRATION (Agent)                         │
│  └─ LLM reads directive, decides which tool to call     │
│  └─ Platform-adaptive: Teams, Subagents, or Personas    │
├─────────────────────────────────────────────────────────┤
│  Layer 3: EXECUTION (Code)                              │
│  └─ Pure Python scripts (execution/) do the actual work │
└─────────────────────────────────────────────────────────┘

Why? LLMs are probabilistic. 90% accuracy per step = 59% success over 5 steps. By pushing complexity into deterministic scripts, we achieve reliable execution.

🧠 Hybrid Memory (BM25 + Vector)

Dual-engine retrieval: Qdrant vector similarity for semantic concepts + SQLite FTS5 BM25 for exact keyword matching. Automatically merges results with configurable weights.

Scenario	Without Memory	With Memory	Savings
Repeated question	~2000 tokens	0 tokens	100%
Similar architecture	~5000 tokens	~500 tokens	90%
Past error resolution	~3000 tokens	~300 tokens	90%
Exact ID/code lookup	~3000 tokens	~200 tokens	93%

Setup (requires Qdrant + Ollama):

# Start Qdrant
docker run -d -p 6333:6333 -v qdrant_storage:/qdrant/storage qdrant/qdrant

# Start Ollama + pull embedding model
ollama serve &
ollama pull nomic-embed-text

# Boot memory system (auto-creates collections)
python3 execution/session_boot.py --auto-fix

Agents automatically run session_boot.py at session start (first instruction in AGENTS.md). Memory operations:

# Auto-query (check cache + retrieve context)
python3 execution/memory_manager.py auto --query "your task summary"

# Store a decision (auto-indexes into BM25)
python3 execution/memory_manager.py store --content "what was decided" --type decision

# Health check (includes BM25 index status)
python3 execution/memory_manager.py health

# Rebuild BM25 index from existing Qdrant data
python3 execution/memory_manager.py bm25-sync

Hybrid search modes (via hybrid_search.py):

# True hybrid (default): vector + BM25 merged
python3 skills/qdrant-memory/scripts/hybrid_search.py --query "ImagePullBackOff error" --mode hybrid

# Vector only (pure semantic)
python3 skills/qdrant-memory/scripts/hybrid_search.py --query "database architecture" --mode vector

# Keyword only (exact BM25 match)
python3 skills/qdrant-memory/scripts/hybrid_search.py --query "sg-018f20ea63e82eeb5" --mode keyword

⚡ Prerequisites

The npx init command automatically creates a .venv and installs all dependencies. Just activate it:

source .venv/bin/activate   # macOS/Linux
# .venv\Scripts\activate    # Windows

If you need to reinstall or update dependencies:

.venv/bin/pip install -r requirements.txt

🔧 Commands

Initialize a new project

npx @techwavedev/agi-agent-kit init --pack=full
# To install globally instead of per-project:
npx @techwavedev/agi-agent-kit init --pack=full --global

Auto-detect platform and configure environment

python3 skills/plugin-discovery/scripts/platform_setup.py --project-dir .

Update to latest version

npx @techwavedev/agi-agent-kit@latest init --pack=full
# or use the built-in skill:
python3 skills/self-update/scripts/update_kit.py

Boot memory system

python3 execution/session_boot.py --auto-fix

System health check

python3 execution/system_checkup.py --verbose

Create a new skill

python3 skill-creator/scripts/init_skill.py my-skill --path skills/

Update skills catalog

python3 skill-creator/scripts/update_catalog.py --skills-dir skills/

🎯 Activation Reference

Use these keywords, commands, and phrases to trigger specific capabilities:

Slash Commands (Workflows)

Command	What It Does
`/setup`	Auto-detect platform and configure environment
`/setup-memory`	Initialize Qdrant + Ollama memory system
`/create`	Start interactive app builder dialogue
`/plan`	Create a structured project plan (no code)
`/enhance`	Add or update features in existing app
`/debug`	Activate systematic debugging mode
`/test`	Generate and run tests
`/deploy`	Pre-flight checks + deployment
`/orchestrate`	Multi-agent coordination for complex tasks
`/brainstorm`	Structured brainstorming with multiple options
`/preview`	Start/stop local dev server
`/status`	Show project progress and status board
`/update`	Update AGI Agent Kit to latest version
`/checkup`	Verify agents, workflows, skills, and core files

Agent Mentions (`@agent`)

Mention	Specialist	When To Use
`@orchestrator`	Multi-agent coordinator	Complex multi-domain tasks
`@project-planner`	Planning specialist	Roadmaps, task breakdowns, phase planning
`@frontend-specialist`	UI/UX architect	Web interfaces, React, Next.js
`@backend-specialist`	API/DB engineer	Server-side, databases, APIs
`@mobile-developer`	Mobile specialist	iOS, Android, React Native, Flutter
`@security-auditor`	Security expert	Vulnerability scanning, audits, hardening
`@debugger`	Debug specialist	Complex bug investigation
`@game-developer`	Game dev specialist	2D/3D games, multiplayer, VR/AR
`@devops-engineer`	DevOps specialist	CI/CD, containers, cloud infrastructure
`@database-architect`	Database specialist	Schema design, migrations, optimization
`@documentation-writer`	Docs specialist	Technical writing, API docs, READMEs
`@test-engineer`	Testing specialist	Test strategy, automation, coverage
`@qa-automation-engineer`	QA specialist	E2E testing, regression, quality gates
`@performance-optimizer`	Performance specialist	Profiling, bottlenecks, optimization
`@seo-specialist`	SEO specialist	Search optimization, meta tags, rankings
`@penetration-tester`	Pen testing specialist	Red team exercises, exploit verification
`@product-manager`	Product specialist	Requirements, user stories, prioritization
`@code-archaeologist`	Legacy code specialist	Understanding old codebases, migrations
`@explorer-agent`	Discovery specialist	Codebase exploration, dependency mapping

Skill Trigger Keywords (Natural Language)

Category	Trigger Words / Phrases	Skill Activated
Memory	"don't use cache", "no cache", "skip memory", "fresh"	Memory opt-out
Research	"research my docs", "check my notebooks", "deep search", "@notebooklm"	`notebooklm-rag`
Documentation	"update docs", "regenerate catalog", "sync documentation"	`documentation`
Quality	"lint", "format", "check", "validate", "static analysis"	`lint-and-validate`
Testing	"write tests", "run tests", "TDD", "test coverage"	`testing-patterns` / `tdd-workflow`
TDD	"test first", "red green refactor", "failing test"	`test-driven-development`
Plan Execution	"execute plan", "run the plan", "batch execution"	`executing-plans`
Verification	"verify", "prove it works", "evidence", "show me the output"	`verification-before-completion`
Debugging	"debug", "root cause", "investigate", "why is this failing"	`systematic-debugging`
Architecture	"design system", "architecture decision", "ADR", "trade-off"	`architecture`
Security	"security scan", "vulnerability", "audit", "OWASP"	`red-team-tactics`
Performance	"lighthouse", "bundle size", "core web vitals", "profiling"	`performance-profiling`
Design	"design UI", "color scheme", "typography", "layout"	`frontend-design`
Deployment	"deploy", "rollback", "release", "CI/CD"	`deployment-procedures`
API	"REST API", "GraphQL", "tRPC", "API design"	`api-patterns`
Database	"schema design", "migration", "query optimization"	`database-design`
Planning	"plan this", "break down", "task list", "requirements"	`plan-writing`
Brainstorming	"explore options", "what are the approaches", "pros and cons"	`brainstorming`
Code Review	"review this", "code quality", "best practices"	`code-review-checklist`
i18n	"translate", "localization", "RTL", "locale"	`i18n-localization`
AWS	"terraform", "EKS", "Lambda", "S3", "CloudFront"	`aws-skills` / `terraform-skill`
Infrastructure	"service mesh", "Kubernetes", "Helm"	`docker-expert` / `server-management`

Memory System Commands

What You Want	Command / Phrase
Boot memory	`python3 execution/session_boot.py --auto-fix`
Check before a task	`python3 execution/memory_manager.py auto --query "..."`
Store a decision	`python3 execution/memory_manager.py store --content "..." --type decision`
Cache a response	`python3 execution/memory_manager.py cache-store --query "..." --response "..."`
Health check	`python3 execution/memory_manager.py health`
Skip cache for this task	Say "fresh", "no cache", or "skip memory" in your prompt

📚 Documentation

AGENTS.md — Complete architecture and operating principles
skills/SKILLS_CATALOG.md — Skill catalog
CHANGELOG.md — Version history
THIRD-PARTY-LICENSES.md — Third-party attributions

🤝 Community Skills & Credits

The Full tier includes 774 community skills adapted from the Antigravity Awesome Skills project (v5.4.0) by @sickn33, distributed under the MIT License.

This collection aggregates skills from 50+ open-source contributors and organizations including Anthropic, Microsoft, Vercel Labs, Supabase, Trail of Bits, Expo, Sentry, Neon, fal.ai, and many more. For the complete attribution ledger, see SOURCES.md.

Each community skill has been adapted for the AGI framework with:

Qdrant Memory Integration — Semantic caching and context retrieval
Agent Team Collaboration — Orchestrator-driven invocation and shared memory
Local LLM Support — Ollama-based embeddings for local-first operation

If these community skills help you, consider starring the original repo or supporting the author.

�️ Roadmap

Feature	Status	Description
Federated Agent Memory	🔬 Design	Cross-agent knowledge sharing via project-scoped Qdrant collections. Agents working on the same project read each other's decisions, errors, and patterns — building collective intelligence across sessions and platforms.
Blockchain-Authenticated Memory	🔬 Design	Cryptographic trust layer for shared memory using enterprise blockchains (Hyperledger Fabric, MultiChain, or Quorum) — self-hosted, no fees, no cryptocurrency. Agent writes are signed, content hashes are anchored on-chain, and access is token-gated per project.
Event-Driven Agent Streaming	🔬 Design	Real-time agent communication via Kafka/Flink. Agents publish decisions and observations to topics, enabling reactive workflows — e.g., a security agent triggers remediation when a vulnerability scan agent publishes findings.
Workflow Engine	📋 Planned	Execute `data/workflows.json` playbooks as guided multi-skill sequences with progress tracking and branching logic.

�🛡️ Security

This package includes a pre-flight security scanner that checks for private terms before publishing. All templates are sanitized for public use.

☕ Support

If the AGI Agent Kit helps you build better AI-powered workflows, consider supporting the project:

⭐ Star on GitHub
☕ Buy me a coffee

📄 License

Community skills in the Full tier are licensed under the MIT License. See THIRD-PARTY-LICENSES.md for details.

Keywords

FAQs

What is @techwavedev/agi-agent-kit?

Is @techwavedev/agi-agent-kit popular?

Is @techwavedev/agi-agent-kit well maintained?

Package last updated on 22 Feb 2026

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

@techwavedev/agi-agent-kit

🚀 AGI Agent Kit

Why this exists

🚀 Quick Start

✨ Key Features

🆚 How This Compares to Superpowers

🧪 Real Benchmark: Subagents vs Agent Teams

🌐 Platform Support

📦 What You Get

📖 Architecture

🧠 Hybrid Memory (BM25 + Vector)

⚡ Prerequisites

🔧 Commands

Initialize a new project

Auto-detect platform and configure environment

Update to latest version

Boot memory system

System health check

Create a new skill

Update skills catalog

🎯 Activation Reference

Slash Commands (Workflows)

Agent Mentions (@agent)

Skill Trigger Keywords (Natural Language)

Memory System Commands

📚 Documentation

🤝 Community Skills & Credits

�️ Roadmap

�🛡️ Security

☕ Support

📄 License

Keywords

Related posts

npm Introduces minimumReleaseAge and Bulk OIDC Configuration

Risky Biz Podcast: Open Source Risk Is Compounding as AI Agents Write 90% of New Code

Agent Mentions (`@agent`)