by schmitech
A self-hosted AI infrastructure for private RAG and multi-model applications.
# Add to your Claude Code skills
git clone https://github.com/schmitech/orbitTeams want AI connected to real business data without sending everything to a SaaS vendor, rewriting applications for every model provider, or maintaining fragile glue code between LLMs, databases, APIs, and files.
ORBIT gives you one OpenAI-compatible gateway for private RAG, model routing, retrieval adapters, conversations, tools, and production controls. Run it on your infrastructure, connect the systems you already use, and choose local or hosted models per workload.
You can build:
git clone https://github.com/schmitech/orbit.git && cd orbit/docker
docker compose up -d
Then test the OpenAI-compatible chat API:
curl -X POST http://localhost:3000/v1/chat \
-H 'Content-Type: application/json' \
-H 'X-API-Key: default-key' \
-H 'X-Session-ID: local-test' \
-d '{
"messages": [{"role": "user", "content": "Summarize ORBIT in one sentence."}],
"stream": false
}'
ORBIT listens on port 3000. The admin panel is available at with the default login / .
No comments yet. Be the first to share your thoughts!
adminadmin123For GPU acceleration:
docker compose -f docker-compose.yml -f docker-compose.gpu.yml up -d
Adapter wiring and sample domains live in config/adapters/ and examples/intent-templates/. See the full Docker Guide for GPU mode, volumes, and configuration.
| Common problem | What ORBIT provides | | :--- | :--- | | One SDK per provider, with rewrites when you switch | One OpenAI-compatible API across local and hosted providers | | Separate systems for inference, retrieval, tools, and chat history | One gateway for model calls, adapters, tools, conversations, and clients | | RAG limited to vector search over clean documents | Retrieval over SQL, NoSQL, HTTP, GraphQL, files, web content, and vector stores | | Glue scripts between prompts and business systems | Intent adapters, composite adapters, diagnostics, and reusable templates | | Privacy-sensitive data sent through third-party services by default | Self-hosted deployment with local models, local embeddings, API keys, RBAC, audit logs, and rate limits | | Provider failures cascading into application failures | Circuit breakers, failover, parallel fan-out, and quota-aware throttling |
ORBIT is probably more than you need if you only want a thin wrapper around one LLM provider.
ORBIT is not only a model router. It handles the layers that usually become custom infrastructure in production RAG systems: retrieval, tools, adapters, conversations, access control, and operational safeguards.
| Use case | Start here | | :--- | :--- | | Chat with a local model through an OpenAI-compatible API | Step-by-step tutorial | | Ask Postgres, MySQL, MongoDB, DuckDB, or Elasticsearch questions in natural language | Database copilot | | Query SQL + NoSQL + REST APIs in one prompt | Composite adapters | | Upload files and get grounded answers | File-upload RAG | | Deploy a private AI gateway for regulated data | Private gateway cookbook | | Run ORBIT as an MCP tool server for agents | [MCP / OpenClaw walkth