Official DINO-X Model Context Protocol (MCP) server that empowers LLMs with real-world visual perception through image object detection, localization, and captioning APIs.
An open-source AI agent that brings the power of Gemini directly into your terminal.