Tingly Box decides which model to call, when to compress context, and how to route requests for maximum efficiency, offering secure, reliable, and customizable functional extensions.
Key Features
Unified API – One mixin endpoint to rule them all, use what you like - OpenAI / Anthropic / Google
Smart Routing, Not Just Load Balancing – Intelligently route requests across models and tokens based on cost, speed, or custom policies, not simple load balancing
Smart Context Compression – (Coming soon) Automatically distill context to its essential parts: sharper relevance, lower cost, and faster responses
Auto API Translation – Seamlessly bridge OpenAI, Anthropic, Google, and other API dialects—no code changes needed
Blazing Fast – Adds typically < 1ms of overhead—so you get flexibility without latency tax
Flexible Auth – Support for both API keys and OAuth (e.g., Claude.ai), so you can use your existing quotas anywhere
Visual Control Panel – Intuitive UI to manage providers, routes, aliases, and models at a glance
Client Side Usage Stats - Track token consumption, latency, cost estimates, and model selection per request—directly from your client
Quick Start
Install
From npm (recommended)
# Install and run (auto service migration without any args)
npx tingly-box@latest
if any trouble, please check tingly-box process and port 12580 and confirm to kill them.
From source code
Requires: Go 1.21+, Node.js 18+, pnpm, task, openapi-generator-cli