Big News: Socket raises $60M Series C at a $1B valuation to secure software supply chains for AI-driven development.Announcement →

ai-memory-cli

Advanced tools

Install Socket

Detect and block malicious and high-risk dependencies

Install

ai-memory-cli

AI memory layer for code editors — extract knowledge, MCP server, semantic search (Cursor, Claude Code, Windsurf, VS Code Copilot)

Source

npm

Version: 2.1.0

Version published: 4 weeks ago

Weekly downloads: 45

Maintainers: 1

Weekly downloads

Created: last month

Source

ai-memory

Turn AI chat history into a searchable, git-trackable knowledge base.

npx ai-memory-cli extract          # auto-discover conversations, extract knowledge
npx ai-memory-cli search "OAuth"   # find any memory instantly
npx ai-memory-cli rules            # generate Cursor Rules from your conventions
npx ai-memory-cli context --copy   # resume any session with full context

Extract structured knowledge from AI editor conversations (Cursor, Claude Code, Windsurf, VS Code Copilot) and save it as git-trackable Markdown files. Stop losing decisions, architecture notes, and conventions buried in chat history.

中文文档

Why ai-memory?

Every day you make dozens of decisions in Cursor or Claude Code. These decisions live in chat history and are lost when you switch machines, start a new conversation, or a teammate joins. ai-memory turns ephemeral conversations into a persistent, searchable knowledge base.

What you get	How
Structured knowledge	AI extracts decisions, architecture, conventions, TODOs, issues
Git-trackable	Plain Markdown files you commit alongside your code
Token savings	`context` command compresses thousands of conversation turns into a focused prompt — typically 90%+ token reduction vs. re-reading history
Team-aware	Per-author subdirectories, no merge conflicts
Cursor Rules export	Auto-generate `.cursor/rules/` from extracted conventions — no other tool does this
Zero config	Works out of the box with `npx`

Quick Start

# Set up API key (any OpenAI-compatible provider)
export AI_REVIEW_API_KEY=sk-...    # or OPENAI_API_KEY

# Extract knowledge from all conversations
npx ai-memory-cli extract

# Search your knowledge base
npx ai-memory-cli search "authentication"

# Generate Cursor Rules from conventions
npx ai-memory-cli rules

# Generate a context prompt and copy to clipboard
npx ai-memory-cli context --copy

# Commit to git
git add .ai-memory/ && git commit -m "chore: add ai-memory knowledge base"

Commands

`list` — Show available conversations

npx ai-memory-cli list                            # show all conversations
npx ai-memory-cli list --source cursor             # only Cursor conversations
npx ai-memory-cli list --json                      # JSON output

`extract` — Extract memories from conversations

npx ai-memory-cli extract                          # extract all conversations
npx ai-memory-cli extract --incremental             # only new conversations
npx ai-memory-cli extract --pick 3                  # only conversation #3
npx ai-memory-cli extract --pick 1,4,7              # multiple conversations
npx ai-memory-cli extract --id b5677be8             # match by ID prefix
npx ai-memory-cli extract --since "3 days ago"      # only recent conversations
npx ai-memory-cli extract --type decision,todo      # only specific types
npx ai-memory-cli extract --dry-run                 # preview without writing
npx ai-memory-cli extract --force                   # overwrite existing files
npx ai-memory-cli extract --author "alice"          # override author name
npx ai-memory-cli extract --verbose                 # show LLM request details
npx ai-memory-cli extract --json                    # JSON output (CI friendly)

`search` — Search through extracted memories

npx ai-memory-cli search "OAuth"                   # keyword search across all memories
npx ai-memory-cli search "payment" --type decision  # filter by type
npx ai-memory-cli search "auth" --author alice      # filter by author
npx ai-memory-cli search "API" --include-resolved   # include resolved memories
npx ai-memory-cli search "config" --json            # JSON output

Results are ranked by relevance (title matches > content > context) with highlighted keywords.

`rules` — Export conventions as Cursor Rules

Generate a .mdc file that Cursor automatically applies to all AI responses:

npx ai-memory-cli rules                            # generate .cursor/rules/ai-memory-conventions.mdc
npx ai-memory-cli rules --output my-rules.mdc      # custom output path
npx ai-memory-cli rules --all-authors              # include team conventions

This is the conversation-to-rules pipeline — extract conventions from chat history, auto-generate editor rules. No other tool does this.

`resolve` — Mark memories as resolved

Decisions get overturned. TODOs get completed. Keep your knowledge base fresh:

npx ai-memory-cli resolve "OAuth"                  # mark matching memories as resolved
npx ai-memory-cli resolve "OAuth" --undo           # reactivate resolved memories

Resolved memories are automatically excluded from context, summary, and search results. Use --include-resolved to force inclusion.

`summary` — Generate a project-level summary

npx ai-memory-cli summary                          # write/update SUMMARY.md
npx ai-memory-cli summary --output MEMORY.md       # custom output path
npx ai-memory-cli summary --focus "payment module"  # focus on a topic
npx ai-memory-cli summary --all-authors             # include all team members
npx ai-memory-cli summary --include-resolved        # include resolved memories

`context` — Generate a continuation prompt

For seamlessly resuming work in a new conversation or on another machine:

npx ai-memory-cli context                          # generate context block (instant, no LLM)
npx ai-memory-cli context --copy                   # generate and copy to clipboard
npx ai-memory-cli context --topic "coupon system"  # focus on a specific topic
npx ai-memory-cli context --recent 7               # only last 7 days of memories
npx ai-memory-cli context --output CONTEXT.md      # write to file
npx ai-memory-cli context --summarize              # use LLM for condensed prose summary
npx ai-memory-cli context --all-authors            # include all team members
npx ai-memory-cli context --include-resolved       # include resolved memories

`init` — Initialize configuration

npx ai-memory-cli init                             # detect editors, create config

MCP Server (NEW)

ai-memory can run as an MCP server, giving AI editors (Cursor, Claude Code) direct access to your knowledge base — no manual commands needed.

Setup

Add to your Cursor MCP config (.cursor/mcp.json):

{
  "mcpServers": {
    "ai-memory": {
      "command": "npx",
      "args": ["ai-memory-cli", "serve"]
    }
  }
}

Or for Claude Code (.claude/claude_desktop_config.json):

{
  "mcpServers": {
    "ai-memory": {
      "command": "npx",
      "args": ["ai-memory-cli", "serve"]
    }
  }
}

What the AI gets

MCP Capability	What it does
`remember` tool	AI stores decisions/conventions/todos during conversations (auto-indexed)
`recall` tool	AI retrieves relevant memories using semantic + keyword hybrid search
`search_memories` tool	Full search with type/author/resolved filtering, semantic-aware
`project-context` resource	Auto-provides project context when starting a conversation

Once configured, the AI can automatically remember important decisions and recall them in future sessions — without you running any commands.

Semantic Search

ai-memory uses hybrid search combining semantic similarity (via embeddings), keyword matching, and time decay. This means you can search by meaning, not just exact keywords.

# Build search index (uses your existing LLM API for embeddings)
npx ai-memory-cli reindex

# Now search works semantically — "database choice" finds "PostgreSQL decision"
npx ai-memory-cli search "database choice"

The MCP recall and search_memories tools use hybrid search automatically. Embeddings are stored locally in .ai-memory/.embeddings.json and auto-indexed when using the remember tool.

Manual start (for testing)

npx ai-memory-cli serve           # start MCP server
npx ai-memory-cli serve --debug   # with debug logging

Watch Mode (NEW)

Automatically extract knowledge when conversations change — zero manual effort:

npx ai-memory-cli watch

Watch mode monitors all detected sources for new conversation activity and runs extraction automatically. It uses file system events (for Cursor/Claude Code) and periodic polling (for all sources) to detect changes.

ai-memory watch — auto-extract on conversation changes

   Author: conor
   Output: .ai-memory/
   [+] Watching: Cursor
   [+] Watching: Claude Code

Initial scan complete — watching for changes...

10:15:32 [Cursor] "OAuth refactor discussion" (+8 turns) — extracting...
10:15:37 [+] 2 decision, 1 convention

Press Ctrl+C to stop.

Dashboard (NEW)

Browse, search, and visualize your knowledge base in a local web UI:

npx ai-memory-cli dashboard

Opens http://localhost:3141 with:

Overview — stats cards, monthly timeline chart, author breakdown, recent activity
Memory browser — real-time search, filter by type/author/status, detail modal
Knowledge graph — interactive D3.js force-directed graph (nodes colored by type, edges by shared conversation or keywords)
Export — download as JSON, Obsidian vault (with YAML frontmatter), or copy to clipboard

npx ai-memory-cli dashboard --port 8080   # custom port

Local LLM Support (NEW)

Use Ollama or LM Studio instead of cloud APIs — no API key needed:

Ollama

# Install Ollama: https://ollama.ai
ollama pull llama3.2              # download a model
ollama pull nomic-embed-text      # (optional) for semantic search

export OLLAMA_HOST=http://localhost:11434
export OLLAMA_MODEL=llama3.2      # extraction model
npx ai-memory-cli extract

LM Studio

# Start LM Studio and load a model
export LM_STUDIO_BASE_URL=http://localhost:1234/v1
export LM_STUDIO_MODEL=your-model-name
npx ai-memory-cli extract

Cloud API keys always take priority over local LLM. If you have OPENAI_API_KEY or AI_REVIEW_API_KEY set, those will be used.

Variable	Description
`OLLAMA_HOST`	Ollama server URL (default: `http://localhost:11434`)
`OLLAMA_MODEL`	Model for extraction (default: `llama3.2`)
`OLLAMA_EMBEDDING_MODEL`	Model for semantic search (default: `nomic-embed-text`)
`LM_STUDIO_BASE_URL`	LM Studio server URL (default: `http://localhost:1234/v1`)
`LM_STUDIO_MODEL`	Model name

Supported Sources

Source	Data location	Status
Cursor	`~/.cursor/projects/{name}/agent-transcripts/`	Stable
Claude Code	`~/.claude/projects/{path}/*.jsonl`	Stable
Windsurf	`~/AppData/Windsurf/User/workspaceStorage/*/state.vscdb`	Beta
VS Code Copilot	`~/AppData/Code/User/workspaceStorage//chatSessions/.json`	Beta

Typical Workflow

First extraction

npx ai-memory-cli list                    # see what conversations are available
npx ai-memory-cli extract                 # extract everything (few minutes on first run)
npx ai-memory-cli rules                   # generate Cursor Rules
git add .ai-memory/ .cursor/rules/
git commit -m "chore: add ai-memory knowledge base"

Daily use (incremental)

npx ai-memory-cli extract --incremental   # after a productive coding session
npx ai-memory-cli rules                   # refresh Cursor Rules
git add .ai-memory/ && git commit -m "chore: update memories"

Starting a new conversation

npx ai-memory-cli context --copy          # copy context to clipboard
# Paste into new Cursor/Claude Code session

The output looks like:

## Project Context

### Key Decisions (follow without re-discussion)
- **Use OAuth Bridge pattern**: WebView cannot receive redirect directly...

### Conventions (always follow)
- **Never call getServerSideProps in this project**: ...

### Active TODOs
- [ ] Add retry logic to payment webhook handler

Finding specific knowledge

npx ai-memory-cli search "payment"        # find all payment-related memories
npx ai-memory-cli search "auth" --type decision  # only decisions about auth

Team Workflow

When multiple people use ai-memory in the same git repo, each person's memories are automatically stored in their own subdirectory.

How it works

Author identity is auto-detected (priority: --author CLI flag > config.author > git config user.name > OS username). No manual setup needed.

.ai-memory/
├── conor/
│   ├── decisions/
│   │   └── 2026-04-15-oauth-bridge.md
│   └── todos/
│       └── 2026-04-15-add-retry.md
├── alice/
│   ├── decisions/
│   │   └── 2026-04-16-payment-design.md
│   └── architecture/
│       └── 2026-04-16-module-split.md
└── .config.json

Usage

# Everyone extracts normally — writes to their own directory
npx ai-memory-cli extract --incremental

# Generate your own context (default: only your memories)
npx ai-memory-cli context --copy

# Include the whole team's memories
npx ai-memory-cli summary --all-authors
npx ai-memory-cli context --all-authors --copy

# Override author name
npx ai-memory-cli extract --author "alice"

Upgrading existing projects

Memories created before v1.3 are stored in flat directories (.ai-memory/decisions/). After upgrading:

Old files are still read normally (backwards compatible), with author empty
New extractions go to .ai-memory/{author}/decisions/ etc.
No manual migration required

Cross-Device Workflow

Work machine                                   Home machine
────────────                                   ────────────
Cursor / Claude Code dev work
        -> npx ai-memory-cli extract --incremental
        -> git add .ai-memory/
git commit && git push
                                               git pull
                                               -> npx ai-memory-cli context --topic "today's work"
                                               -> Paste context into new conversation
                                               -> Seamlessly resume

Configuration

ai-memory works with zero config. To customize, run npx ai-memory-cli init or create .ai-memory/.config.json manually:

{
  "sources": {
    "cursor": { "enabled": true, "projectName": "my-project" },
    "claudeCode": { "enabled": true },
    "windsurf": { "enabled": true },
    "copilot": { "enabled": true }
  },
  "extract": {
    "types": ["decision", "architecture", "convention", "todo", "issue"],
    "ignoreConversations": [],    // conversation UUIDs to skip
    "minConversationLength": 5   // skip very short conversations
  },
  "output": {
    "dir": ".ai-memory",
    "summaryFile": "SUMMARY.md",
    "language": "zh"             // "zh" or "en" — output language for summaries
  },
  "model": "",                   // leave empty for auto-selection
  "author": ""                   // leave empty to auto-detect from git config user.name
}

Environment variables

Variable	Description
`AI_REVIEW_API_KEY`	API key (preferred, shared with ai-review-pipeline)
`OPENAI_API_KEY`	OpenAI API key
`OPENAI_BASE_URL`	Custom OpenAI-compatible API base URL
`OPENAI_MODEL`	Model override for OpenAI
`ANTHROPIC_API_KEY`	Anthropic API key (requires compatible proxy)
`ANTHROPIC_BASE_URL`	Anthropic proxy base URL
`AI_REVIEW_BASE_URL`	Custom API base URL
`AI_REVIEW_MODEL`	Model to use (default: `gpt-4o-mini`)
`OLLAMA_HOST`	Ollama server URL (default: `http://localhost:11434`)
`OLLAMA_MODEL`	Ollama model for extraction
`OLLAMA_EMBEDDING_MODEL`	Ollama model for semantic search embeddings
`LM_STUDIO_BASE_URL`	LM Studio API URL
`LM_STUDIO_MODEL`	LM Studio model name

Output Structure

Each memory is its own file, organized by author and type:

.ai-memory/
├── SUMMARY.md                              # Project summary (from `summary` command)
├── conor/                                  # Per-author subdirectory
│   ├── decisions/
│   │   ├── 2026-04-12-oauth-bridge-pattern.md
│   │   └── 2026-04-13-async-job-queue-design.md
│   ├── architecture/
│   │   └── 2026-04-10-payment-module-design.md
│   ├── conventions/
│   │   └── 2026-04-08-coding-conventions.md
│   ├── todos/
│   │   └── 2026-04-12-add-retry-logic.md
│   └── issues/
│       └── 2026-04-11-sqlite-locking-fix.md
├── .index/                                 # Extraction index (auto-managed)
├── .config.json                            # Configuration (commit this)
└── .state.json                             # Extraction state (add to .gitignore)

Add .ai-memory/.state.json to .gitignore — it tracks which conversations have been processed and is machine-specific.

CI Integration

# .github/workflows/memory.yml
- name: Extract AI memories
  run: npx ai-memory-cli extract --incremental --json
  env:
    AI_REVIEW_API_KEY: ${{ secrets.AI_REVIEW_API_KEY }}

Requirements

Node.js >= 18
An API key for any OpenAI-compatible provider, or a local LLM (Ollama / LM Studio)

Tip: Node.js 22+ enables richer conversation titles by reading Cursor/Windsurf's database. On Node 18-20, titles are extracted from the first message (still works fine).

License

MIT — Conor Liu

Keywords

FAQs

What is ai-memory-cli?

Is ai-memory-cli popular?

Is ai-memory-cli well maintained?

Package last updated on 22 Apr 2026

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

ai-memory-cli

ai-memory

Why ai-memory?

Quick Start

Commands

list — Show available conversations

extract — Extract memories from conversations

search — Search through extracted memories

rules — Export conventions as Cursor Rules

resolve — Mark memories as resolved

summary — Generate a project-level summary

context — Generate a continuation prompt

init — Initialize configuration

MCP Server (NEW)

Setup

What the AI gets

Semantic Search

Manual start (for testing)

Watch Mode (NEW)

Dashboard (NEW)

Local LLM Support (NEW)

Ollama

LM Studio

Supported Sources

Typical Workflow

First extraction

Daily use (incremental)

Starting a new conversation

Finding specific knowledge

Team Workflow

How it works

Usage

Upgrading existing projects

Cross-Device Workflow

Configuration

Environment variables

Output Structure

CI Integration

Requirements

License

Keywords

Related posts

Socket Raises $60M Series C at a $1B Valuation to Help Enterprises Build Securely With AI

Popular Go Decimal Library Targeted by Long-Running Typosquat with DNS Backdoor

`list` — Show available conversations

`extract` — Extract memories from conversations

`search` — Search through extracted memories

`rules` — Export conventions as Cursor Rules

`resolve` — Mark memories as resolved

`summary` — Generate a project-level summary

`context` — Generate a continuation prompt

`init` — Initialize configuration