Name: knowhere
Author: Ontos-AI

What We Are

We're not developing the next MinerU, instead, we're building document memory infrastructure that agents can effectively consume.

Knowhere turns unstructured documents into persistent, navigable memory for AI agents. It handles parsing, hierarchy extraction, multi-modal structuring, and graph construction, giving your agents structured, high-quality context for Agentic RAG, traditional RAG, or any LLM workflow.

[!TIP] Knowhere stands on the shoulders of giants like MinerU and Pymupdf. We take their output, optimize it, and then build hierarchical structure and multi-modal cross-document graphs on top. The result is a persistent, citable memory layer purpose-built for agent consumption.

[!NOTE] Get started in seconds with Knowhere Cloud. Avoid the complexity of self-deployment. Use our managed API at knowhereto.ai and enjoy $5 in free credits upon registration.

📢 News

May 7, 2026: 🚀 Knowhere is now Open Source! We have open-sourced our entire stack for document ingestion, parsing, and agentic RAG. You can now self-host the full platform using knowhere-self-hosted. Check out our Contribution Guide to get involved!
Apr 30, 2026: 📦 Version 2026.04.30.1 has been released. This update includes several stability improvements and initial support for the agentic RAG layer. See the full changelog for details.

knowhere

What We Are

📢 News

Related Skills

How it Works

Step 1: Parse and Build Memory

Step 2: Agentic Retrieval

Performance Benchmark

Key Advantages

Ecosystem

Features

Frequently Asked Questions (FAQ)

Supported Formats

Prerequisites

Quick Start