llm-wiki-agent

Name: llm-wiki-agent
Author: SamurAIGPT

Verified

A personal knowledge base that builds and maintains itself. Drop in sources — Claude (or Codex/Gemini) reads them, extracts knowledge, and maintains a persistent interlinked wiki. Works with Claude Code, Codex, OpenCode, Gemini CLI. No API key needed.

3,131stars

364forks

Python

Added 4/7/2026

View on GitHub Download ZIP Scan for vulnerabilities

30 days in the Featured rail · terms & refunds

AI Agentsai-agentai-toolsautomationclaude-codecodexgeminigenerative-aiknowledge-baseknowledge-graphllmmarkdownmuapinote-takingobsidianopen-sourcepersonal-knowledge-managementragresearchsecond-brainwiki

Installation

# Add to your Claude Code skills
git clone https://github.com/SamurAIGPT/llm-wiki-agent

Getting Started

Guides for using ai agents skills like llm-wiki-agent.

Security ReportVerified

Last scanned: 4/24/2026

{
  "issues": [],
  "status": "PASSED",
  "scannedAt": "2026-04-24T06:08:51.804Z",
  "semgrepRan": false,
  "npmAuditRan": true,
  "pipAuditRan": false
}

README.md

LLM Wiki Agent

A coding agent skill. Drop source documents into raw/ and tell the agent to ingest them — it reads them, extracts knowledge, and builds a persistent interlinked wiki. Every new source makes the wiki richer. You never write it.

Most knowledge tools make you search your own notes. This one reads everything you've collected and writes a structured wiki that compounds over time — cross-references already built, contradictions already flagged, synthesis already done.

ingest raw/papers/attention-is-all-you-need.md

wiki/
├── index.md          catalog of all pages — updated on every ingest
├── log.md            append-only record of every operation
├── overview.md       living synthesis across all sources
├── sources/          one summary page per source document
├── entities/         people, companies, projects — auto-created
├── concepts/         ideas, frameworks, methods — auto-created
└── syntheses/        query answers filed back as wiki pages
graph/
├── graph.json        persistent node/edge data (SHA256-cached)
└── graph.html        interactive vis.js visualization — open in any browser

Related Projects

Open-Generative-AI — Add AI image & video generation to your knowledge base pipeline
Open-AI-Design-Agent — Autonomous AI design agent — pair with wiki agent for research + visual output
AI-Voice-Agent — Self-hosted AI voice agent for real-time voice conversations, sales calls, and customer support

Install

Requires: Claude Code, Codex, Gemini CLI, or any agent that reads a config file.

git clone https://github.com/SamurAIGPT/llm-wiki-agent.git
cd llm-wiki-agent

Open in your agent — no API key or Python setup needed:

claude      # reads CLAUDE.md + .claude/commands/ (slash commands available)
codex       # reads AGENTS.md
opencode    # reads AGENTS.md
gemini      # reads GEMINI.md

Usage

All agents understand natural language and shorthand triggers:

ingest raw/papers/my-paper.md              # ingest a markdown source
ingest report.pdf                          # auto-converts to .md, then ingests
ingest slides.pptx notes.docx              # batch, mixed formats
query: what are the main themes?           # synthesize answer from wiki pages
lint                                       # find orphans, contradictions, gaps
build graph                                # build graph.html from all wikilinks

Plain English works too:

"Ingest this paper: raw/papers/llama2.md"
"What does the wiki say about attention mechanisms?"
"Check for contradictions across sources"
"Build the knowledge graph and tell me the most connected nodes"

Claude Code also provides /wiki-ingest, /wiki-query, /wiki-lint, /wiki-graph as slash commands (via .claude/commands/). These are Claude Code-specific — other agents use the natural language triggers above, which work identically.

Works with markdown, PDF, DOCX, PPTX, XLSX, HTML, TXT, CSV, JSON, XML, RST, EPUB, and more. Non-markdown files are auto-converted via markitdown at ingest time — no separate step needed.

What You Get

Persistent wiki — structured markdown pages that accumulate across sessions. Unlike chat, nothing is lost.

Entity pages — auto-created for every person, company, or project mentioned across sources. Updated each time a new source references them.

Concept pages — auto-created for every key idea or framework. Cross-referenced to every source that discusses them.

Living overview — wiki/overview.md is revised on every ingest to reflect the current synthesis across everything you've read.

Contradiction flags — when a new source contradicts an existing claim, it's flagged at ingest time, not buried until query time.

Knowledge graph — graph.html shows every wiki page as a node, every [[wikilink]] as an edge, and Claude-inferred implicit relationships as dotted edges. Community detection clusters related topics.

Lint reports — orphan pages, broken links, missing entity pages, data gaps with suggested sources to fill them.

Use Cases

Research

Going deep on a topic over weeks — reading papers, articles, reports.

/wiki-ingest raw/papers/attention-is-all-you-need.md
/wiki-ingest raw/papers/llama2.md
/wiki-ingest raw/papers/rag-survey.md

# Wiki builds entity pages (Meta AI, Google Brain) and
# concept pages (Attention, RLHF, Context Window) automatically.

/wiki-query "What are the main approaches to reducing hallucination?"
/wiki-query "How has context window size evolved across models?"

/wiki-lint
# → "No sources on mixture-of-experts — consider the Mixtral paper"

By the end you have a structured, interlinked reference — not a folder of PDFs you'll never reopen.

Reading a Book

File each chapter as you go. Build out pages for characters, themes, arguments.

/wiki-ingest raw/book/chapter-01.md
/wiki-ingest raw/book/chapter-02.md

# Wiki creates entity and theme pages automatically.

/wiki-query "How has the protagonist's motivation evolved?"
/wiki-query "What contradictions exist in the author's argument so far?"

/wiki-graph   # → graph.html shows every character/theme and how they connect

Think fan wikis like Tolkien Gateway — built as you read, with the agent doing all the cross-referencing.

Personal Knowledge Base

Track goals, health, habits, self-improvement — file journal entries, articles, podcast notes.

/wiki-ingest raw/journal/2026-01-week1.md
/wiki-ingest raw/articles/huberman-sleep-protocol.md
/wiki-ingest raw/articles/atomic-habits-summary.md

/wiki-query "What patterns show up in my journal entries about energy?"
/wiki-query "What habits have I tried and what was the outcome?"

The wiki builds a structured picture over time. Concepts like "Sleep", "Exercise", "Deep Work" accumulate evidence from every source filed.

Business / Team Intelligence

Feed in meeting transcripts, project docs, customer calls.

/wiki-ingest raw/meetings/q1-planning-transcript.md
/wiki-ingest raw/docs/product-roadmap-2026.md
/wiki-ingest raw/calls/customer-interview-acme.md

/wiki-query "What feature requests have come up most across customer calls?"
/wiki-query "What decisions were made in Q1 and what was the rationale?"

/wiki-lint
# → "Project X mentioned in 5 pages but no dedicated page"
# → "Roadmap contradicts customer interview on priority of feature Y"

The wiki stays current because the agent does the maintenance no one wants to do.

Competitive Analysis

Track a company, market, or technology over time.

/wiki-ingest raw/competitors/openai-announcements.md
/wiki-ingest raw/market/ai-funding-report-q1.md

/wiki-query "How do OpenAI and Anthropic differ on safety approach?"
/wiki-query "Which companies announced multimodal models in the last 6 months?"
/wiki-query "Competitive landscape summary as of today"
# → agent shows the answer, then asks if you want to save it as a synthesis page

The Graph

Two-pass build:

Deterministic — parses all [[wikilinks]] across wiki pages → edges tagged EXTRACTED
Semantic — agent infers implicit relationships not captured by wikilinks → edges tagged INFERRED (with confidence score) or AMBIGUOUS

Louvain community detection clusters nodes by topic. SHA256 cache means only changed pages are reprocessed. Output is a self-contained graph.html — no server, opens in any browser.

CLAUDE.md / AGENTS.md

The schema file tells the agent how to maintain the wiki — page formats, ingest/query/lint/graph workflows, naming conventions. This is the key config file. Edit it to customize behavior for your domain.

Agent	Schema file
Claude Code	`CLAUDE.md`
Codex / OpenCode	`AGENTS.md`
Gemini CLI	`GEMINI.md`

What Makes This Different from RAG

RAG	LLM Wiki Agent
Re-derives knowledge every query	Compiles once, keeps current
Raw chunks as retrieval unit	Structured wiki pages
No cross-references	Cross-references pre-built
Contradictions surface at query time (maybe)	Flagged at ingest time
No accumulation	Every source makes the wiki richer

Obsidian Integration

The wiki is designed to be browsed seamlessly in Obsidian. Since the agent maintains consistent [[wikilinks]], you get a naturally growing knowledge graph in your vault.

Vault Symlink Pattern

If you want to keep the LLM Wiki Agent repository separate from your main personal vault, use symlinks:

Keep your working agent repository at e.g., ~/llm-wiki-agent

Create a symlink from your main Obsidian vault:

ln -sfn ~/llm-wiki-agent/wiki ~/your-obsidian-vault/wiki

Use the Obsidian Web Clipper or write directly to raw/ in the agent repo to queue items for ingestion.

Note: If you ever move your local repo directory, remember to update the symlink, otherwise the wiki/ directory will appear missing in Obsidian.

Recommended .obsidian Config

Graph View: Filter out index.md and log.md (e.g. -file:index.md -file:log.md) to avoid them becoming gravity wells in your Obsidian graph.
Dataview: Use the community plugin Dataview to query the YAML frontmatter the agent automatically injects (e.g., type: source, tags: [diary]).

Multi-Format Ingest

Drop any supported file directly into ingest — no separate conversion step needed:

# These all work — auto-converted at ingest time
ingest report.pdf
inges

Frequently Asked Questions

What is llm-wiki-agent?

llm-wiki-agent is an open-source ai agents skill for AI coding assistants such as Claude Code, Codex CLI, and ChatGPT, built by SamurAIGPT. A personal knowledge base that builds and maintains itself. Drop in sources — Claude (or Codex/Gemini) reads them, extracts knowledge, and maintains a persistent interlinked wiki. Works with Claude Code, Codex, OpenCode, Gemini CLI. No API key needed. It has 3,131 GitHub stars.

Is llm-wiki-agent safe to use?

Yes. llm-wiki-agent passed SkillsLLM's automated security scan — a dependency vulnerability audit plus prompt-injection heuristics — with no high-severity issues. You can read the full report in the Security Report section on this page.

How do I install llm-wiki-agent?

Clone the repository with "git clone https://github.com/SamurAIGPT/llm-wiki-agent" and add it to your Claude Code skills directory (see the Installation section above).

What programming language is llm-wiki-agent written in?

llm-wiki-agent is primarily written in Python. It is open-source under SamurAIGPT on GitHub, so you can review or fork the full source.

Are there alternatives to llm-wiki-agent?

Yes. SkillsLLM lists many other AI Agents skills you can browse and compare side by side. Open the AI Agents category from the badge at the top of this page, or use the Related Skills and comparison links further down to weigh llm-wiki-agent against similar tools.

Agentic AI for Beginners

Build your first AI agent from scratch - tool use, ReAct pattern, memory, deployment

41 minBeginner

Comments (0)

to leave a comment.

No comments yet. Be the first to share your thoughts!

Related Skills

superpowers

by obra

An agentic skills framework & software development methodology that works.

234,966

20,863

Shell

AI Agentsaibrainstorming

View details

Compare

ECC

by affaan-m

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

226,412

34,630

JavaScript

AI Agentsai-agentsanthropic

The agent that grows with you

209,875

38,386

Python

AI Agentsaiai-agent

View details

Compare

n8n

by n8n-io

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

195,351

59,093

TypeScript

MCP Serversaiapis

View details

Compare

everything-claude-code

by affaan-m

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

185,940

28,768

JavaScript

AI Agentsai-agentsanthropic

View details

Compare

claude-code

by anthropics

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflows - all through natural language commands.

120,031

19,897

Shell

AI Agents

View details

Compare

Browse all AI Agents skills

Developers Also Liked

Based on votes and bookmarks from developers who liked this skill

claude-mem

by thedotmack

Persistent Context Across Sessions for Every Agent – Captures everything your agent does during sessions, compresses it with AI, and injects relevant context back into future sessions. Works with Claude Code, OpenClaw, Codex, Gemini, Hermes, Copilot, OpenCode + More

86,041

7,447

JavaScript

AI Agentsaiai-agents

View details

learn-claude-code

by shareAI-lab

Bash is all you need - A nano claude code–like 「agent harness」, built from 0 to 1

69,977

11,395

Python

AI Agentsagentagent-development

View details

caveman

by JuliusBrussee

🪨 why use many token when few token do trick — Claude Code skill that cuts 65% of tokens by talking like caveman

85,172

4,737

JavaScript

AI Agentsaianthropic

View details

awesome-claude-skills

by ComposioHQ

A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows

66,928

7,473

Python

AI Agentsagent-skillsai-agents

View details

graphify

by safishamsi

AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more). Turn any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a queryable knowledge graph. App code + database schema + infrastructure in one graph.

76,348

7,585

Python

AI Agentsantigravityclaude-code

View details

cherry-studio

by CherryHQ

AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs

48,195

4,573

TypeScript

AI Agentsagent-skillsai-agent

View details

ArcReel apm

LLM Wiki Agent

Most knowledge tools make you search your own notes. This one reads everything you've collected and writes a structured wiki that compounds over time — cross-references already built, contradictions already flagged, synthesis already done.

ingest raw/papers/attention-is-all-you-need.md

wiki/
├── index.md          catalog of all pages — updated on every ingest
├── log.md            append-only record of every operation
├── overview.md       living synthesis across all sources
├── sources/          one summary page per source document
├── entities/         people, companies, projects — auto-created
├── concepts/         ideas, frameworks, methods — auto-created
└── syntheses/        query answers filed back as wiki pages
graph/
├── graph.json        persistent node/edge data (SHA256-cached)
└── graph.html        interactive vis.js visualization — open in any browser

Related Projects

Open-Generative-AI — Add AI image & video generation to your knowledge base pipeline
Open-AI-Design-Agent — Autonomous AI design agent — pair with wiki agent for research + visual output
AI-Voice-Agent — Self-hosted AI voice agent for real-time voice conversations, sales calls, and customer support

Install

Requires: Claude Code, Codex, Gemini CLI, or any agent that reads a config file.

git clone https://github.com/SamurAIGPT/llm-wiki-agent.git
cd llm-wiki-agent

Open in your agent — no API key or Python setup needed:

claude      # reads CLAUDE.md + .claude/commands/ (slash commands available)
codex       # reads AGENTS.md
opencode    # reads AGENTS.md
gemini      # reads GEMINI.md

Usage

All agents understand natural language and shorthand triggers:

ingest raw/papers/my-paper.md              # ingest a markdown source
ingest report.pdf                          # auto-converts to .md, then ingests
ingest slides.pptx notes.docx              # batch, mixed formats
query: what are the main themes?           # synthesize answer from wiki pages
lint                                       # find orphans, contradictions, gaps
build graph                                # build graph.html from all wikilinks

Plain English works too:

"Ingest this paper: raw/papers/llama2.md"
"What does the wiki say about attention mechanisms?"
"Check for contradictions across sources"
"Build the knowledge graph and tell me the most connected nodes"

Works with markdown, PDF, DOCX, PPTX, XLSX, HTML, TXT, CSV, JSON, XML, RST, EPUB, and more. Non-markdown files are auto-converted via markitdown at ingest time — no separate step needed.

What You Get

Persistent wiki — structured markdown pages that accumulate across sessions. Unlike chat, nothing is lost.

Entity pages — auto-created for every person, company, or project mentioned across sources. Updated each time a new source references them.

Concept pages — auto-created for every key idea or framework. Cross-referenced to every source that discusses them.

Living overview — wiki/overview.md is revised on every ingest to reflect the current synthesis across everything you've read.

Contradiction flags — when a new source contradicts an existing claim, it's flagged at ingest time, not buried until query time.

Lint reports — orphan pages, broken links, missing entity pages, data gaps with suggested sources to fill them.

Use Cases

Research

Going deep on a topic over weeks — reading papers, articles, reports.

/wiki-ingest raw/papers/attention-is-all-you-need.md
/wiki-ingest raw/papers/llama2.md
/wiki-ingest raw/papers/rag-survey.md

# Wiki builds entity pages (Meta AI, Google Brain) and
# concept pages (Attention, RLHF, Context Window) automatically.

/wiki-query "What are the main approaches to reducing hallucination?"
/wiki-query "How has context window size evolved across models?"

/wiki-lint
# → "No sources on mixture-of-experts — consider the Mixtral paper"

By the end you have a structured, interlinked reference — not a folder of PDFs you'll never reopen.

Reading a Book

File each chapter as you go. Build out pages for characters, themes, arguments.

/wiki-ingest raw/book/chapter-01.md
/wiki-ingest raw/book/chapter-02.md

# Wiki creates entity and theme pages automatically.

/wiki-query "How has the protagonist's motivation evolved?"
/wiki-query "What contradictions exist in the author's argument so far?"

/wiki-graph   # → graph.html shows every character/theme and how they connect

Think fan wikis like Tolkien Gateway — built as you read, with the agent doing all the cross-referencing.

Personal Knowledge Base

Track goals, health, habits, self-improvement — file journal entries, articles, podcast notes.

/wiki-ingest raw/journal/2026-01-week1.md
/wiki-ingest raw/articles/huberman-sleep-protocol.md
/wiki-ingest raw/articles/atomic-habits-summary.md

/wiki-query "What patterns show up in my journal entries about energy?"
/wiki-query "What habits have I tried and what was the outcome?"

The wiki builds a structured picture over time. Concepts like "Sleep", "Exercise", "Deep Work" accumulate evidence from every source filed.

Business / Team Intelligence

Feed in meeting transcripts, project docs, customer calls.

/wiki-ingest raw/meetings/q1-planning-transcript.md
/wiki-ingest raw/docs/product-roadmap-2026.md
/wiki-ingest raw/calls/customer-interview-acme.md

/wiki-query "What feature requests have come up most across customer calls?"
/wiki-query "What decisions were made in Q1 and what was the rationale?"

/wiki-lint
# → "Project X mentioned in 5 pages but no dedicated page"
# → "Roadmap contradicts customer interview on priority of feature Y"

The wiki stays current because the agent does the maintenance no one wants to do.

Competitive Analysis

Track a company, market, or technology over time.

/wiki-ingest raw/competitors/openai-announcements.md
/wiki-ingest raw/market/ai-funding-report-q1.md

/wiki-query "How do OpenAI and Anthropic differ on safety approach?"
/wiki-query "Which companies announced multimodal models in the last 6 months?"
/wiki-query "Competitive landscape summary as of today"
# → agent shows the answer, then asks if you want to save it as a synthesis page

The Graph

Two-pass build:

Deterministic — parses all [[wikilinks]] across wiki pages → edges tagged EXTRACTED
Semantic — agent infers implicit relationships not captured by wikilinks → edges tagged INFERRED (with confidence score) or AMBIGUOUS

Louvain community detection clusters nodes by topic. SHA256 cache means only changed pages are reprocessed. Output is a self-contained graph.html — no server, opens in any browser.

CLAUDE.md / AGENTS.md

Agent	Schema file
Claude Code	`CLAUDE.md`
Codex / OpenCode	`AGENTS.md`
Gemini CLI	`GEMINI.md`

What Makes This Different from RAG

RAG	LLM Wiki Agent
Re-derives knowledge every query	Compiles once, keeps current
Raw chunks as retrieval unit	Structured wiki pages
No cross-references	Cross-references pre-built
Contradictions surface at query time (maybe)	Flagged at ingest time
No accumulation	Every source makes the wiki richer

Obsidian Integration

The wiki is designed to be browsed seamlessly in Obsidian. Since the agent maintains consistent [[wikilinks]], you get a naturally growing knowledge graph in your vault.

Vault Symlink Pattern

If you want to keep the LLM Wiki Agent repository separate from your main personal vault, use symlinks:

Keep your working agent repository at e.g., ~/llm-wiki-agent

Create a symlink from your main Obsidian vault:

ln -sfn ~/llm-wiki-agent/wiki ~/your-obsidian-vault/wiki

Use the Obsidian Web Clipper or write directly to raw/ in the agent repo to queue items for ingestion.

Note: If you ever move your local repo directory, remember to update the symlink, otherwise the wiki/ directory will appear missing in Obsidian.

Recommended .obsidian Config

Graph View: Filter out index.md and log.md (e.g. -file:index.md -file:log.md) to avoid them becoming gravity wells in your Obsidian graph.
Dataview: Use the community plugin Dataview to query the YAML frontmatter the agent automatically injects (e.g., type: source, tags: [diary]).

Multi-Format Ingest

Drop any supported file directly into ingest — no separate conversion step needed:

# These all work — auto-converted at ingest time
ingest report.pdf
inges