Big News: Socket raises $60M Series C at a $1B valuation to secure software supply chains for AI-driven development.Announcement
Sign In

wikimem

Package Overview
Dependencies
Maintainers
1
Versions
20
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

wikimem

Self-improving wiki IDE. Ingest anything (13 formats), query with any LLM, three automations: ingest, scrape, improve.

latest
npmnpm
Version
0.8.6
Version published
Weekly downloads
42
-17.65%
Maintainers
1
Weekly downloads
 
Created
Source

wikimem

Self-improving wiki IDE. Ingest anything. Query with any LLM. Three automations.

npm version License: MIT Formats

npx wikimem@latest

What is WikiMem?

WikiMem takes Karpathy's LLM wiki concept and turns it into a full IDE. Drop any file — PDF, audio, video, slides, spreadsheet, URL — and watch it compile into structured, interlinked wiki pages via Claude, GPT-4o, or Ollama. Three automations keep your knowledge base growing and self-improving while you sleep.

raw/                        wiki/
  paper.pdf                   index.md ........... content catalog
  podcast.mp3    ──LLM──>    sources/paper.md ... summary + citations
  screenshot.png              entities/openai.md . people, orgs, tools
  meeting.docx                concepts/rag.md .... ideas + frameworks
  blog-url                    syntheses/ ......... cross-cutting analysis

Quick Start

# Create a vault and start the IDE
npx wikimem init my-wiki
cd my-wiki
npx wikimem serve

Open http://localhost:3141. That's it — you have a running wiki IDE.

# Or ingest from the CLI
wikimem ingest paper.pdf
wikimem ingest https://en.wikipedia.org/wiki/Large_language_model
wikimem query "What are the key themes across my sources?"

Features

13+ Format Ingestion

Drop anything. WikiMem detects the file type, runs the right processor, and produces wiki pages with cross-references and citations.

FormatExtensionsProcessor
Text.md, .txtDirect read
Structured.json, .csv, .yamlSchema-aware extraction
PDF.pdfBuilt-in text extraction
Office.docx, .pptx, .xlsxDocument parsing
HTML.html, .htmTag stripping + content extraction
Image.png, .jpg, .gif, .webpClaude Vision description
Audio.mp3, .wav, .m4a, .ogg, .flacWhisper / Deepgram transcription
Video.mp4, .mov, .avi, .mkv, .webmffmpeg → Whisper transcription
URLhttps://...Firecrawl / fetch → markdown

Knowledge Graph

D3-powered interactive force-directed graph. Click a node to highlight its neighbors, double-click to open. Community detection clusters related pages. Hub nodes sized by connection count.

Time-Lapse

Watch your knowledge base grow commit-by-commit. Every wiki change is checkpointed in git — scrub through the timeline to see pages appear, links form, and the graph densify.

WYSIWYG Editing

Click any wiki page to edit it inline. Markdown shortcuts, live preview, Cmd+S to save. Changes are auto-committed to git.

Three Automations

AutomationTriggerWhat it does
IngestFile watcher on raw/New file detected → process → wiki pages → git commit
ScrapeCron schedule or manualRSS feeds, GitHub trending, URLs → fetch → deposit in raw/ → triggers Ingest
ObserveNightly or manualLLM Council scores wiki quality (coverage, consistency, cross-linking, freshness, organization) → proposes and applies improvements

Git Checkpointing

Every change committed automatically. Browse history, restore snapshots, see diffs. Your wiki is a git repo from day one.

Pipeline Visualization

See exactly how your document flows through the system — file detection, text extraction, LLM processing, page generation, cross-linking, indexing — step by step in the web UI.

Connectors

Sync external sources into your vault automatically.

ConnectorStatus
Local folders✅ Shipped
Git repos✅ Shipped
GitHub✅ Shipped
Webhooks✅ Shipped
Slack🔜 Coming soon
Gmail🔜 Coming soon

MCP Server

Use WikiMem as a tool inside Claude Code, Cursor, or any MCP-compatible client.

wikimem mcp

Multiple LLMs

ProviderFlagDefault Model
Claude-p claudeclaude-sonnet-4-20250514
OpenAI-p openaigpt-4o
Ollama-p ollamallama3.2

Ollama runs fully local — no API keys, no network, no data leaves your machine.

CLI Reference

CommandDescription
wikimem init [dir]Create a new vault (--template research|business|codebase, --from-folder, --from-repo)
wikimem serveStart the web IDE on port 3141
wikimem ingest <source>Process a file or URL into wiki pages
wikimem search <term>BM25 full-text search across wiki pages
wikimem ask <question>Ask a question, get an answer from your wiki
wikimem query <question>Ask a question and optionally save as synthesis page (--file)
wikimem lintHealth-check: orphan pages, broken links, missing summaries (--fix)
wikimem statusVault statistics: pages, words, sources, links, orphans
wikimem watchAuto-ingest files dropped into raw/
wikimem scrapeFetch from configured RSS/GitHub/URL sources
wikimem improveRun self-improvement cycle (--dry-run, --threshold 90)
wikimem exportExport wiki to other formats
wikimem openOpen vault in Obsidian
wikimem historyBrowse audit trail, restore snapshots
wikimem mcpStart MCP server for Claude Code / Cursor
wikimem publishPublish wiki as static site, RSS, JSON feed, or digest (--format html,rss,json-feed,digest)
wikimem duplicatesDetect and manage near-duplicate sources

Web UI

wikimem serve opens a full IDE at localhost:3141:

  • File tree — browse wiki pages with collapsible folders
  • Tabbed editor — open multiple pages, WYSIWYG markdown editing
  • Knowledge graph — interactive D3 force-directed visualization
  • Pipeline view — drag-and-drop file ingestion with step-by-step progress
  • Time-lapse — scrub through git history to watch your wiki grow
  • SearchCmd+K fuzzy search across all pages
  • Command paletteCmd+P for quick actions
  • Settings — configure API keys, models, and automations from the UI
  • Ask your knowledge — query your wiki from the browser

MCP Server

WikiMem ships with a built-in MCP server so Claude Code and Cursor can read, search, and query your wiki directly.

Add to Claude Code (.mcp.json):

{
  "mcpServers": {
    "wikimem": {
      "command": "npx",
      "args": ["-y", "wikimem", "mcp"],
      "env": {
        "WIKIMEM_VAULT": "/path/to/your/vault"
      }
    }
  }
}

Or run standalone:

wikimem-mcp

Configuration

After wikimem init, your vault contains config.yaml:

provider: claude                    # claude | openai | ollama
model: claude-sonnet-4-20250514

sources:
  - name: "HN Front Page"
    type: rss
    url: "https://hnrss.org/frontpage"

  - name: "GitHub Trending TS"
    type: github
    query: "stars:>100 created:>7d language:typescript"

improvement:
  threshold: 80
  schedule: "0 3 * * *"            # 3am nightly

Environment Variables

VariablePurpose
ANTHROPIC_API_KEYClaude API access (default provider)
OPENAI_API_KEYOpenAI API access
OLLAMA_BASE_URLOllama server URL (default: http://localhost:11434)
FIRECRAWL_API_KEYEnhanced URL-to-markdown (optional)
DEEPGRAM_API_KEYAudio transcription (optional, falls back to Whisper)

Architecture

vault/
├── wiki/           ← LLM-generated pages (sources/, entities/, concepts/, syntheses/)
├── raw/            ← Immutable source documents (date-stamped subdirectories)
├── AGENTS.md       ← Schema — wiki structure + conventions
├── config.yaml     ← Configuration — provider, sources, schedules
└── index.md        ← Content catalog (auto-maintained)

Three layers: raw/ (immutable sources) → LLM processing → wiki/ (structured knowledge). AGENTS.md is the schema file that tells the LLM how to structure output — it co-evolves with your wiki.

Three automations: Ingest (file watcher → process → wiki pages), Scrape (RSS/GitHub/URLs → raw/), Observe (LLM Council → score → improve).

Obsidian Integration

WikiMem vaults are Obsidian vaults. Open any wikimem directory in Obsidian — no plugins, no configuration:

  • [[wikilinks]] rendered as backlinks
  • YAML frontmatter as page metadata
  • Graph view showing all connections
  • Tag view from frontmatter tags: arrays

Privacy

  • Everything runs locally. Your wiki is a folder of markdown files.
  • No data sent anywhere except LLM API calls (and those are optional with Ollama).
  • raw/ excluded from git by default — your source documents stay private.
  • config.yaml excluded from git — API keys never committed.
PathSafe to commit?Why
wiki/LLM-generated summaries, no raw personal data
AGENTS.mdSchema file, no personal data
raw/Original source files
config.yamlMay contain API keys

Credits

Inspired by Andrej Karpathy's LLM Wiki pattern — the idea that LLMs should compile knowledge into structured, interlinked wikis rather than just answering questions from raw chunks.

Built with Express, D3, simple-git, and the Claude / OpenAI / Ollama APIs.

License

MIT — see LICENSE.

Keywords

llm

FAQs

Package last updated on 12 Apr 2026

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts