orbit

Name: orbit
Author: schmitech

Verified

Self-hosted, OpenAI-compatible AI gateway for private RAG, natural-language data access, and tool-calling agents.

290stars

53forks

Python

Installation

# Add to your Claude Code skills
git clone https://github.com/schmitech/orbit

Getting Started

Guides for using ai agents skills like orbit.

Caveman: Cut Claude Token Use by 65%
How agent-side prompt compression works, when to use it, and when not to.
What is an AI Skills Marketplace?
Definitions, how marketplaces work, and how to choose between them in 2026.
Getting Started with AI Skills

Security ReportVerified

Last scanned: 5/30/2026

{
  "issues": [],
  "status": "PASSED",
  "scannedAt": "2026-05-30T15:14:31.382Z",
  "npmAuditRan": true,
  "pipAuditRan": true
}

README.md

Frequently Asked Questions

What is orbit?

orbit is an open-source ai agents skill for AI coding assistants such as Claude Code, Codex CLI, and ChatGPT, built by schmitech. Self-hosted, OpenAI-compatible AI gateway for private RAG, natural-language data access, and tool-calling agents. It has 290 GitHub stars.

Is orbit safe to use?

Yes. orbit passed SkillsLLM's automated security scan — a dependency vulnerability audit plus prompt-injection heuristics — with no high-severity issues. You can read the full report in the Security Report section on this page.

How do I install orbit?

Clone the repository with "git clone https://github.com/schmitech/orbit" and add it to your Claude Code skills directory (see the Installation section above).

What programming language is orbit written in?

orbit is primarily written in Python. It is open-source under schmitech on GitHub, so you can review or fork the full source.

Are there alternatives to orbit?

Yes. SkillsLLM lists many other AI Agents skills you can browse and compare side by side. Open the AI Agents category from the badge at the top of this page, or use the Related Skills and comparison links further down to weigh orbit against similar tools.

Agentic AI for Beginners

Build your first AI agent from scratch - tool use, ReAct pattern, memory, deployment

41 minBeginner

Comments (0)

to leave a comment.

No comments yet. Be the first to share your thoughts!

Related Skills

superpowers

by obra

An agentic skills framework & software development methodology that works.

234,966

cashclaw system-prompts-and-models-of-ai-tools-chinese

ORBIT

Open Retrieval-Based Inference Toolkit

A self-hosted, OpenAI-compatible AI gateway for private RAG, natural-language data access, and tool-calling agents — run it in your own environment across 37+ model providers.

What is ORBIT?

ORBIT is a single API gateway that puts production controls in front of any AI model and connects it to your private data. Point one OpenAI-compatible /v1/chat endpoint at local or cloud models, ask natural-language questions over your databases and documents, and give models controlled access to tools — all behind API keys, quotas, rate limits, and audit logs you host yourself.

Reach for ORBIT when you need to:

💬 Query SQL, NoSQL, vector stores, REST/GraphQL APIs, Elasticsearch, and files in plain English
🔀 Route one API contract across local (Ollama, llama.cpp, vLLM) and cloud models
🛠️ Give models scoped access to MCP tools and multi-step agent workflows
🔒 Ship private RAG with auth, quotas, moderation, metrics, and admin controls
🧪 Prototype locally, then keep the same architecture in production

Comparisons: ORBIT vs. Open WebUI · ORBIT vs. LiteLLM

⭐ If ORBIT is useful to you, a star helps more developers find it.

🏗️ Architecture

🚀 Quick Start

[!WARNING] Do not clone and deploy the main branch in production. Use the latest stable release tarball from GitHub Releases instead. The main branch is intended for development and testing and may include unreleased changes.

Latest stable release tarball (Linux/macOS):

curl -LO https://github.com/schmitech/orbit/releases/download/v2.8.3/orbit-2.8.3.tar.gz
tar -xzf orbit-2.8.3.tar.gz && cd orbit-2.8.3
./install/setup.sh        # add --wizard for interactive setup
./bin/orbit.sh start

Docker Compose from main — for local development/testing with Ollama and a lightweight local model:

git clone https://github.com/schmitech/orbit.git
cd orbit/docker
docker compose up -d
# NVIDIA GPU: docker compose -f docker-compose.yml -f docker-compose.gpu.yml up -d

Windows (native): see the Windows installation guide.

Verify the gateway:

curl -X POST http://localhost:3000/v1/chat \
  -H 'Content-Type: application/json' \
  -H 'X-API-Key: default-key' \
  -H 'X-Session-ID: local-test' \
  -d '{"messages": [{"role": "user", "content": "Summarize ORBIT in one sentence."}], "stream": false}'

Then open the admin dashboard at http://localhost:3000/admin (admin / admin123) for metrics, latency, sessions, adapters, logs, and health.

📖 Full setup: Docker Guide · Tutorial

✨ Features

Capability	What you get
OpenAI-compatible gateway	One `/v1/chat` interface across local, self-hosted, and cloud providers.
Model routing (37+ providers)	Local: Ollama, llama.cpp, vLLM, TensorRT-LLM, Transformers, LM Studio, BitNet. Cloud: OpenAI, Anthropic, Gemini, Bedrock, Vertex, Azure, Groq, Mistral, DeepSeek, xAI, and many more.
Natural-language data access	Plain English → SQL, MongoDB, Elasticsearch DSL, REST, GraphQL across Postgres, MySQL, Oracle, SQL Server, DuckDB, Athena, Supabase, Cassandra, Redis, and composite multi-source answers.
Vector RAG	Chroma, Qdrant, Pinecone, Milvus, Weaviate, Marqo, pgvector, FAISS, DuckDB, Redis, Elasticsearch.
File & multimodal RAG	PDFs, DOCX, spreadsheets, CSVs, markdown, images, and audio — with cached file context across conversations.
Pluggable file storage	Store uploads and generated media on local disk (default) or in the cloud — AWS S3, MinIO / SeaweedFS (S3-compatible), Azure Blob, or Google Cloud Storage — selected with one config key.
File encryption at rest	AES-256-GCM encryption for uploaded files, opt-in per adapter — keep classified or regulated documents encrypted on disk or in the cloud without touching adapters that don't need it.
Cloud secrets management	Resolve API keys, DB passwords, and connection strings from AWS Secrets Manager, Azure Key Vault, or GCP Secret Manager instead of (or alongside) `.env` — selected with one config key. Guide.
Web search	Provider-agnostic real-time context via DuckDuckGo (free), Brave, SearXNG, Serper, Tavily, Google PSE, Perplexity — decoupled from synthesis so any LLM can answer.
MCP tool agents	Connect MCP servers (filesystem, GitHub, Slack, Postgres, Jira, Notion, and more) over stdio/SSE with bounded, server-side agent loops.
A2A peer protocol	Google Agent-to-Agent support — discovery via `/.well-known/agent.json` and task delegation over JSON-RPC. Guide.
Media generation	Image, video, PDF, Word, Excel, PowerPoint, CSV, and markdown generation adapters in the same chat flow.
Voice (STT/TTS)	OpenAI, Whisper, Google, Gemini, ElevenLabs, Coqui, and more.
Production controls	API keys, RBAC, SSO via Entra ID & Auth0 (OIDC), per-key routing, rate limits, token quotas, moderation, circuit breakers, fallback routing, metrics, audit logs, and hot adapter reloads.
Config-first	Define adapters, providers, datasources, prompts, and guardrails in YAML — no application code.

📚 Deep dive: Docs index · Adapter guide · Configuration · Cookbook

💡 Use Cases

Same gateway, different jobs. Each use case below maps to a recurring pain that teams across industries are trying to solve with AI — with a short demo as proof.

Business outcome	Who feels the pain	What it replaces
Answers from scattered documents	Insurance, legal, finance, research	Hours spent manually cross-referencing PDFs, spreadsheets, and scans
Self-service data access	Retail, operations, finance, SaaS	BI ticket backlogs and waiting on the data team to write SQL
Faster incident response	SRE, DevOps, security ops	Bottlenecks where only a few specialists can write query DSL
Automated internal workflows	IT ops, support, back-office	Manual glue work across databases, Slack, files, and tickets
Conversational tool use, zero orchestration code	Product, support, SaaS builders	Bolt-on agent frameworks (LangChain, AutoGen) just to get tool calling
AI on regulated data	Healthcare, government, legal, banking	Compliance blocks on sending sensitive data to cloud LLMs
Governed AI rollout	Any regulated or enterprise org	Shadow AI with no audit trail, cost control, or visibility
In-flow content generation	Marketing, e-learning, communications	Slow, fragmented media-production tooling