by Ontos-AI
Knowhere extracts, parses, and outputs structured chunks ready for AI Agents and RAG.
# Add to your Claude Code skills
git clone https://github.com/Ontos-AI/knowhereLast scanned: 5/23/2026
{
"issues": [],
"status": "PASSED",
"scannedAt": "2026-05-23T06:33:39.539Z",
"semgrepRan": false,
"npmAuditRan": true,
"pipAuditRan": true
}knowhere is an open-source ai agents skill for AI coding assistants such as Claude Code, Codex CLI, and ChatGPT, built by Ontos-AI. Knowhere extracts, parses, and outputs structured chunks ready for AI Agents and RAG. It has 1,805 GitHub stars.
Yes. knowhere passed SkillsLLM's automated security scan — a dependency vulnerability audit plus prompt-injection heuristics — with no high-severity issues. You can read the full report in the Security Report section on this page.
Clone the repository with "git clone https://github.com/Ontos-AI/knowhere" and add it to your Claude Code skills directory (see the Installation section above).
knowhere is primarily written in Python. It is open-source under Ontos-AI on GitHub, so you can review or fork the full source.
Yes. SkillsLLM lists many other AI Agents skills you can browse and compare side by side. Open the AI Agents category from the badge at the top of this page, or use the Related Skills and comparison links further down to weigh knowhere against similar tools.
No comments yet. Be the first to share your thoughts!
Knowhere is the memory layer between complex, dirty documents and AI agents.
It ingests unstructured documents and produces persistent, navigable memory: parsing, hierarchy extraction, multi-modal structuring, and graph construction in a single pipeline. Every chunk retains full semantic context, making the output a natural fit for Agentic RAG, vector-based RAG, or any LLM workflow.
[!NOTE] Get started in seconds with Knowhere Cloud. Avoid the complexity of self-deployment. Use our managed API at knowhereto.ai and enjoy $5 in free credits upon registration.
Knowhere runs in two steps: build memory from documents, then let agents retrieve from it.
Q: What is Knowhere's relationship with MinerU?
A: Knowhere uses MinerU as its default parser because it performs best in our tests. Any parser only gets you raw Markdown. Knowhere's value is what comes after: hierarchy reconstruction, multi-modal normalization, and cross-document graph construction. Any Markdown-outputting tool works.
Q: What LLM / VLM dependencies does Knowhere have?
A: By default, DeepSeek (deepseek-chat) handles text and table summarization, and Qwen-VL (qwen3.6-flash) handles image OCR and descriptions. Knowhere is model-agnostic. Swap in OpenAI, DashScope, Zhipu, or Volcengine via environment variables.
Q: How is Agentic Retrieval different from traditional RAG?
A: Traditional RAG does a flat vector lookup and returns isolated snippets. Knowhere's agents navigate the document's section tree and cross-document graph, drilling into the most relevant regions the way a human reader would, returning traceable, well-contextualized evidence.
Q: Does it handle images and tables?
A: Yes. Knowhere extracts them, runs them through VLMs for summarization and feature extraction, and links them back to their source chunks so agents can retrieve and cite multi-modal assets at inference time.
Agents using Knowhere outperform those working from raw documents, Markitdown, Unstructured, or MinerU output on real-world tasks: searching, modifying, and answering questions.
We're not developing the next MinerU — we're building document memory infrastructure that agents can effectively consume.
(Internal evaluation across identical agentic RAG tasks. Baselines: raw documents and parser output fed directly to agents.)
[!NOTE] 📊 Benchmarks are actively expanding. More parsers and retrieval baselines coming soon.
| Repository | Description |
|---|---|
| knowhere | This repo. Backend API and worker: document ingestion, parsing, graph construction, and retrieval. |
| 🖥️ knowhere-dashboard | The web UI. Connects to the API for the full product experience. |
| 🐳 knowhere-self-hosted | Docker Compose stack for self-hosted deployments. Packages the API, worker, and dashboard together. |
| 🐍 knowhere-python-sdk | Official Python SDK for the Knowhere Cloud API. |
| 🦕 knowhere-node-sdk | Official Node.js SDK for the Knowhere Cloud API. |
✅ Supported
.pdf .docx .pptx .xlsx .csv.jpg .png.md .txt .json⏳ Coming Soon
.epub .html .xml.mp4 .mp3.skills.mdWant to see a new format supported? Adding a parser is a great first contribution. Check out CONTRIBUTING.md to get started.
uvdocker composeuv sync --all-packages
cp apps/api/.env.example apps/api/.env
cp apps/worker/.env.example apps/worker/.env
.env files with the values you need for local work:DS_KEY, ALI_API_KEYS, GPT_API_KEY, or GLM_API_KEYMINERU_API_KEYS if you need PDF parsingMost parser and retrieval tuning values have code defaults. Start with the required external services first, then override model names, provider URLs, budgets, or concurrency limits only when your deployment needs different behavior. See docs/external-services.md for the full dependency matrix.
./deploy/local-dev/start-dev.sh
cd apps/api && uv run main.py
cd apps/worker && uv run worker.py
The API runs migrations during startup.
For API-only development without the dashboard, create an API-only user/key after the API service starts:
cd apps/api
uv run scripts/init_user.py --email you@example.com
If you plan to use the dashboard, register through the dashboard instead of
using scripts/init_user.py.
The API is now running at http://localhost:5005. If you want the full product experience with a UI, run the knowhere-dashboard alongs