
Security News
Frontier AI Is Now Critical Infrastructure
The Fable shutdown shows how quickly model access can become a business continuity risk for AI-dependent engineering teams.
lyrie-agent
Advanced tools
Affected versions:
Pentests apps. Defends agents. Researches binaries. Trains itself. One daemon.
Install · Quick Start · Capabilities · ATP · Architecture · Shield Doctrine · Research
🌐 Localized: العربية · Deutsch · Español · Français · 日本語 · Português · 简体中文
OTT Cybersecurity LLC builds the security infrastructure of the AI era. We're not a startup adding a security feature to an AI product. We are a cybersecurity company that builds AI — the distinction matters.
Every AI agent platform treats security as an afterthought. We treat it as the foundation. Every advisory we publish on lyrie.ai/research is backed by a reproducible exploit lab and detection rules in this repo.
The internet needed TLS before e-commerce could exist. AI agents need a trust protocol before the agentic economy can.
We are building it.
The Agent Trust Protocol (ATP) — the first open cryptographic standard for AI agent identity, scope, and action verification — is authored by Lyrie and implemented here as the reference. Five primitives define what it means for an AI agent to be trustworthy: who it is, what it's authorized to do, what it did, whether it was tampered with, and how trust flows when it spawns children.
Lyrie is the reference implementation. The spec goes to IETF. Every organization running agents on ATP gets a verifiable attestation — the padlock for the AI era.
Beyond ATP, Lyrie is the only stack that fuses:
lyrie hack <target> runs end-to-end autonomous pentesting with GPU-accelerated adversarial attacks (GCG on H200, 140GB HBM3e)All of this in one daemon. Zero required cloud dependencies.
pip install lyrie-agent
# or
curl -fsSL https://lyrie.ai/install.sh | sh
lyrie init # one-time setup wizard
lyrie hack ./myapp # run your first pentest
lyrie daemon --threat-watch # start always-on threat detection
lyrie migrate --from openclaw # import from another agent platform
lyrie hack <target> — one command, end-to-end autonomous pentest.
lyrie hack ./myapp # local source tree
lyrie hack https://staging.example.com # live target
lyrie hack ./myapp --stage-e --approve # generate + approve runnable PoCs
lyrie hack ./myapp --output sarif # SARIF for GitHub Code Scanning
Example output:
[lyrie] Stage A: Attack Surface Mapping
→ 14 entry points, 3 trust boundaries, 2 tainted data flows
→ Risk hotspots: /api/upload (critical), /admin/exec (critical)
[lyrie] Stage B: Scanning
→ Nuclei: 3 critical, 7 high
→ Semgrep: 11 findings (2 CWE-89, 4 CWE-79, 5 CWE-22)
→ TruffleHog: 1 secret found in git history
→ Trivy: 0 known-vulnerable binaries (hash-verified)
[lyrie] Stage C: Validation
→ Confirmed: 3 critical, 5 high (false-positive rate: 12%)
[lyrie] Stage D: AAV (Adversarial Validation)
→ GCG suffix attack: BYPASS on /api/chat (H200, 140GB HBM3e)
→ AutoDAN black-box: 2/4 jailbreaks successful
→ Crescendo escalation: content policy bypassed in 4 turns
[lyrie] Stage E: Exploit
→ SQLi PoC generated: exploits/sqli-001.py [PENDING APPROVAL]
→ XSS PoC generated: exploits/xss-001.js [PENDING APPROVAL]
[lyrie] Stage F: Remediation
→ Patches generated: 5 files, unified diff in report/patches/
[lyrie] Self-scan complete. Lyrie's own output passed Shield gate.
What each stage validates:
GPU red-team:
lyrie redteam <endpoint> --strategy gcg # gradient-based suffix attack (H200, 140GB HBM3e)
lyrie redteam <endpoint> --strategy autodan # genetic algorithm black-box (H100, 80GB HBM3)
lyrie redteam <endpoint> --preset state-actor --dry-run # nation-state attack corpus
lyrie redteam <endpoint> --preset entra # Microsoft Entra AI priv-esc (4 vectors)
Multi-turn jailbreaks (no GPU required):
lyrie redteam <endpoint> --strategy crescendo # 4 escalation styles, HarmBench baselines
lyrie redteam <endpoint> --strategy tap # tree-of-attacks with pruning
Shield Doctrine — every layer of Lyrie that touches untrusted text passes a Shield gate. Input, output, memory, patches, skills, tool calls. Zero exceptions. See docs/shield-doctrine.md.
Agent Trust Protocol (ATP) — the cryptographic padlock for the AI era. Ed25519-signed agent identity, action receipts, scope declarations, trust chain rules. Every agent spawned by Lyrie carries a verifiable ATP badge. See ATP section.
Microsoft AGT Bridge — @lyrie/agt-bridge sits on top of Microsoft's Agent Governance Toolkit. 10/10 OWASP ASI 2026 with AGT. 7/10 standalone. Graceful degradation when AGT unavailable.
MCP Security Scanner — 8 pre-connection checks on every MCP server before Lyrie connects:
Memory Integrity — SHA-256 drift detection on every memory block. OWASP ASI-06 defense.
lyrie memory integrity-check # detect tampered memories
lyrie memory integrity-check --fix # quarantine and repair
Tools-Catalog Enforcement — every tool call passes a risk-policy gate:
critical → block + require operator approvalhigh → audit log mandatorymedium → rate-limited + allowlist-checkedlyrie tools audit # full risk assessment of installed tools
A2A Message Bus — agents query each other mid-flight via Shield-filtered pub/sub. No unverified cross-agent calls.
Daemon mode — always-on, self-healing, KAIROS tick loop:
lyrie daemon # basic daemon
lyrie daemon --threat-watch # continuous threat detection
lyrie daemon --threat-watch --self-heal # auto-recover from detected threats
lyrie daemon --threat-watch --self-heal --provider hermes # local-first, zero cloud
lyrie daemon --interval 5m # custom KAIROS tick interval
Cron management:
lyrie cron list # all scheduled jobs
lyrie cron add "0 2 * * *" "lyrie evolve dream" # nightly self-improvement
lyrie cron add "*/5 * * * *" "lyrie daemon --threat-watch"
lyrie cron disable <id> # pause without removing
lyrie cron logs <id> # execution history
Skills library — 126+ skills:
lyrie skills list # browse full library
lyrie skills search "web scraping" # find relevant skills
lyrie skills install <skill-id> # add to agent
lyrie skills run <skill-id> # execute directly
Lyrie's skill loader imports existing skill libraries (OpenClaw, Claude Code, AutoGPT) on lyrie migrate. Skills are Shield-gated before execution.
Sub-agent orchestration:
# Spawn isolated child agent (programmatic)
spawn_subagent({
task: "Scan this endpoint for SQLi and return findings as JSON",
model: "anthropic/claude-sonnet-4-6",
context: "isolated" # or "fork" to inherit parent context
})
Child agents carry ATP badges from their parent. Trust chain is cryptographically verifiable.
WorkspaceContext — every agent turn loads SOUL.md, AGENTS.md, MEMORY.md for persistent identity and rules across sessions. No cold starts.
Model routing — 15 models, task-aware:
lyrie models list # all registered models + health
lyrie models health # live status check
lyrie models route # show current routing table
| Task | Routes to |
|---|---|
| Code implement/refactor | GPT-5.4-Codex |
| Bulk/parallel ops | MiniMax-M2.5-HS |
| Strategy/architecture | Grok (reasoning) |
| Local/privacy-first | Hermes-3 (NousResearch) |
| Default | claude-sonnet-4-6 |
| Free tier fallback | NVIDIA NIM (134 models) |
Migration from any agent platform:
lyrie migrate --from openclaw # ports memory, skills, config
lyrie migrate --from claude-code # imports MCP servers + provider keys
lyrie migrate --from cursor # imports model config + extensions
lyrie migrate --from hermes # ports skills + trajectory
lyrie migrate --from autogpt # ports goals + memory
lyrie migrate --from all # auto-detect all installed platforms
lyrie migrate --from claude-code --secure # import + CVE check MCP servers
lyrie migrate --detect --dry-run # preview what would be imported
Supported: openclaw, claude-code, cursor, hermes, autogpt, nanoclaw, zeroclaw, dify, superagi, nanobot, grip-ai
All 9 tools are Shield-gated. Every call passes input/output validation before execution.
| Tool | What it does |
|---|---|
| exec | Unified shell + process manager. Auto risk detection — critical commands require approval. Supports TTY, background sessions, stdin/stdout streaming. |
| browser | Full CDP automation. Connects to a running Chrome instance via 127.0.0.1:9223. Zero timeout bugs that plagued previous adapters. Screenshot, click, type, evaluate, multi-tab. |
| web_search | Brave Search API. 1-hour result cache, domain deduplication, region/language support. |
| web_fetch | HTML → markdown extraction via readability. 30-minute cache. Handles SPAs, paywalls, JS-heavy pages. |
| message | Proactive sends to Telegram, Discord, Slack, Matrix, Feishu, IRC, and 7 more channels. Supports inline buttons, reactions, thread replies. |
| memory_store | Persistent memory with auto-categorization (preference/fact/decision/entity). Deduplication, TTL, importance scoring. |
| memory_recall | BM25-ranked semantic search over stored memories. Returns top-k by relevance. |
| image_generate | H200 local Stable Diffusion first, OpenAI fallback. Transparent backgrounds, multiple aspect ratios, edit mode. |
| tts | OpenAI TTS. Default voice: nova (warm, clear). Onyx for dramatic narration. |
| spawn_subagent | Spawn child agents in isolated or fork mode. ATP-badged. Results auto-announced to parent. |
Service installation:
lyrie service install # launchd (macOS) or systemd (Linux) — starts on boot
lyrie service status # health + uptime
lyrie service logs # tail daemon logs
lyrie service uninstall # clean removal
LyrieEvolve — self-improving agent:
lyrie evolve status # skill library stats + last dream cycle
lyrie evolve dream # score → extract → prune → train
lyrie evolve extract # pull reusable skills from recent sessions
lyrie evolve stats # domain breakdown: cyber / seo / trading / code
lyrie evolve train --export atropos # GRPO fine-tuning on H200
See docs/evolve.md and docs/h200-training.md.
AI Governance:
lyrie governance assess --interactive # NIST AI RMF 8-question assessment
lyrie governance assess --config agent.json # auto-infer from config
lyrie governance permissions ./tools.json # scan tool permissions for risk
Environment diagnostics:
lyrie doctor # full self-diagnostic (env, channels, security)
lyrie doctor --json # machine-readable for CI
ATP is the first open cryptographic standard for AI agent identity, scope, and action verification. Authored by OTT Cybersecurity LLC. IETF-draft quality RFC. packages/atp/
Five primitives:
| Primitive | File | What it proves |
|---|---|---|
| Agent Identity Certificates | aic.ts | Who this agent is (Ed25519 keypair, issuer chain) |
| Action Receipts | receipt.ts | What it did (signed log of every tool call) |
| Scope Declaration Language | scope.ts | What it's authorized to do (SDL enforcement) |
| Trust Chain Rules | trust-chain.ts | How trust flows to spawned children |
| Breach Attestation | breach.ts | Cryptographically signed incident record |
ATP Badge — every agent running on Lyrie carries a verifiable badge:
lyrie atp verify <agent-id> # verify identity + scope
lyrie atp receipt <session-id> # audit trail for any session
lyrie atp badge --show # display this agent's current badge
The IETF spec is in packages/atp/RFC-DRAFT.md. Organizations running agents on ATP get a verifiable attestation — machine-readable proof that an AI acted within its declared scope.
LyrieAAV attacks deployed AI agents and LLMs to find security vulnerabilities before adversaries do.
Attack corpus: 200+ vectors across OWASP LLM Top 10 + OWASP ASI 2026.
GPU attacks (white-box and black-box):
packages/core/src/aav/strategies/gcg.ts) — gradient-based adversarial suffix generation on H200 (140GB HBM3e). White-box access required. Finds universal suffixes that transfer across models.packages/core/src/aav/strategies/autodan.ts) — genetic algorithm jailbreak search. Black-box. No model access needed. Generates human-readable bypass prompts.Multi-turn strategies (no GPU):
OWASP LLM Top 10 + ASI 2026 coverage:
| OWASP ID | Category | Lyrie Coverage |
|---|---|---|
| LLM01 | Prompt Injection | GCG, AutoDAN, Crescendo, TAP |
| LLM02 | Insecure Output Handling | Stage C validation, Shield output gate |
| LLM03 | Training Data Poisoning | LyrieEvolve GRPO audit |
| LLM04 | Model DoS | Rate-limit enforcement |
| LLM05 | Supply Chain | Trivy binary hash verification |
| LLM06 | Sensitive Info Disclosure | TruffleHog, Shield output scan |
| LLM07 | Insecure Plugin Design | MCP Scanner (8 checks) |
| LLM08 | Excessive Agency | Scope Declaration Language |
| LLM09 | Overreliance | Human approval gates |
| LLM10 | Model Theft | ATP breach attestation |
| ASI-06 | Memory Tampering | SHA-256 drift detection |
Full docs: docs/aav.md
The deepest open-source offensive research stack on GitHub. Used internally by Lyrie's threat research team.
Tier 1 (shipped):
rr replay of crashes. Same crash, every time. Root cause in minutes, not hours.lyrie omega analyze <binary> # full Omega-Suite run
lyrie omega rop <binary> # ROP gadget discovery
lyrie omega smt <binary> <crash-input> # exploit feasibility via Z3
lyrie omega replay <crash-log> # rr deterministic replay
Lyrie is the only autonomous agent that gets measurably better at your specific workloads over time.
The loop:
lyrie evolve status # library stats + last dream timestamp
lyrie evolve extract # pull skills from recent sessions
lyrie evolve dream # full nightly cycle: score → extract → prune → summarize
lyrie evolve train --export atropos # export H200-ready GRPO training data
lyrie evolve stats # domain breakdown
Training pipeline: docs/h200-training.md. Atropos-compatible export for direct GRPO runs.
| Capability | Lyrie | General Agent Frameworks | Security Scanners |
|---|---|---|---|
| Autonomous pentest (A–F) | ✅ Full | ❌ | Partial |
| Agent Trust Protocol | ✅ Authored | ❌ | ❌ |
| GPU adversarial attacks | ✅ H200 + H100 | ❌ | ❌ |
| Local model first | ✅ Hermes-3 | Varies | ❌ |
| Built-in cron management | ✅ | ❌ | ❌ |
| 126+ skills library | ✅ | Varies | ❌ |
| Self-improving (GRPO) | ✅ H200 | ❌ | ❌ |
| OWASP ASI 2026 | ✅ 10/10 | ❌ | Partial |
| MCP security scanning | ✅ 8 checks | ❌ | ❌ |
| Background daemon | ✅ launchd/systemd | Varies | ❌ |
| Code-level remediation | ✅ | ❌ | ❌ |
| Binary exploit research | ✅ Omega-Suite | ❌ | Partial |
| 15-model task routing | ✅ | Varies | ❌ |
| Sub-agent orchestration | ✅ ATP-badged | Varies | ❌ |
| Memory integrity checks | ✅ SHA-256 | ❌ | ❌ |
| 9 built-in tools | ✅ Shield-gated | Varies | ❌ |
lyrie-agent/
├── packages/
│ ├── core/ # Engine: tools, AAV, Shield, cron, skills, models
│ │ └── src/
│ │ ├── tools/ # 9 built-in tools (browser, exec, web, memory, media, message, spawn-subagent)
│ │ ├── aav/ # Adversarial validation (GCG, AutoDAN, Crescendo, TAP)
│ │ ├── security/ # MCP scanner, provider validator
│ │ ├── cron/ # Cron manager
│ │ └── skills/ # Skill loader, registry, runner, search
│ ├── atp/ # Agent Trust Protocol (AIC, receipt, scope, trust-chain, breach)
│ ├── shield/ # Rust Shield binary (JSON-RPC, file-write scan, outbound WAF)
│ └── gateway/ # Channel adapters (Telegram, Discord, Slack, Matrix, IRC, Feishu, ...)
├── deploy/
│ └── oss-scan/ # Dockerized public scanner (research.lyrie.ai/scan)
├── docs/
│ ├── shield-doctrine.md
│ ├── aav.md
│ ├── evolve.md
│ ├── h200-training.md
│ └── brand-guide.md
└── scripts/ # redteam.ts, scan.ts, evolve.ts
Design principles:
pip install lyrie-agent and you're running# .github/workflows/lyrie-pentest.yml
- uses: OTT-Cybersecurity-LLC/lyrie-pentest-action@v1
with:
target: ./
fail-on: high
upload-sarif: true
Shield-scans every PR. Posts a single-comment-per-PR Markdown summary. Uploads SARIF to GitHub Code Scanning. Blocks merges on threshold.
1,726 tests / 0 failures / 5,672 expect() calls
94 test files — bun test --reporter=dot
Every shipped feature has tests. No stubs masquerading as implementations. If Stage E generates PoCs, there are tests that verify runnable output. If the GRPO pipeline exports Atropos data, there are tests that verify the schema.
See CONTRIBUTING.md. New CVE labs follow tools/exploit-lab/LAB-PROTOCOL.md.
Code of Conduct: CODE_OF_CONDUCT.md. PRs that weaponize Lyrie tooling against unconsenting targets are rejected.
See SECURITY.md. Responsible disclosure: security@lyrie.ai
Cybersecurity isn't a feature here — it's the product.
MIT — OTT Cybersecurity LLC. Use it, fork it, build on it.
Lyrie.ai — Built by OTT Cybersecurity LLC
Research · @lyrie_ai · LinkedIn · lyrie.ai · overthetop.ae
© 2026 OTT Cybersecurity LLC. All rights reserved.
FAQs
The world's first autonomous AI agent with built-in cybersecurity
We found that lyrie-agent demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Security News
The Fable shutdown shows how quickly model access can become a business continuity risk for AI-dependent engineering teams.

Security News
AI agents are pulling packages into environments no scanner is watching, creating exposure before security teams can see it.

Security News
GitHub Actions checkout now blocks risky pull_request_target checkouts by default to help prevent pwn request supply chain attacks.