by Leeroo-AI
An ML engineering plugin for your coding agents.
# Add to your Claude Code skills
git clone https://github.com/Leeroo-AI/supermlLast scanned: 5/30/2026
{
"issues": [],
"status": "PASSED",
"scannedAt": "2026-05-30T15:46:55.750Z",
"npmAuditRan": true,
"pipAuditRan": true
}superml is an open-source ai agents skill for AI coding assistants such as Claude Code, Codex CLI, and ChatGPT, built by Leeroo-AI. An ML engineering plugin for your coding agents. It has 186 GitHub stars.
Yes. superml passed SkillsLLM's automated security scan — a dependency vulnerability audit plus prompt-injection heuristics — with no high-severity issues. You can read the full report in the Security Report section on this page.
Clone the repository with "git clone https://github.com/Leeroo-AI/superml" and add it to your Claude Code skills directory (see the Installation section above).
superml is primarily written in Python. It is open-source under Leeroo-AI on GitHub, so you can review or fork the full source.
Yes. SkillsLLM lists many other AI Agents skills you can browse and compare side by side. Open the AI Agents category from the badge at the top of this page, or use the Related Skills and comparison links further down to weigh superml against similar tools.
No comments yet. Be the first to share your thoughts!
It adds two things your coding agent doesn't have:
ML Pipeline: Seven skills that encode the workflow you already follow. Plan against real framework docs. Catch config mistakes before they cost you GPU hours. Debug OOM, NaN, and divergence by root cause, not by guessing. Get ranked next steps when metrics plateau. An agentic experiment memory carries hypotheses, results, and lessons across sessions — your agent stops repeating failed experiments and starts compounding what works.
Memory: Backed by Leeroopedia, 27k+ pages across 1000+ ML/AI frameworks. Config references, debugging heuristics, implementation patterns, and battle-tested defaults from vLLM to DeepSpeed to LangChain. Built by the Leeroo continuous learning system, structured as a browsable wiki, and continuously updated by AI and human engineers. When your agent recommends a config, it points to the page it learned it from.
Works with Claude Code, Cursor, Codex, OpenCode, and Gemini CLI.
ml-expert) handles deeper tasks and remembers your hardware, experiments, and lessons across sessions.37 ML tasks scored head-to-head: Cursor / Claude Code with SuperML vs without. Each response rated by independent LLM judges across correctness, specificity, mistake prevention, actionability, and grounding.
See TESTED_TASKS.md for detailed scores and methodology.
The plugin works without an API key — skills use web search to ground answers. With a key, your agent gets access to the Leeroopedia knowledge base (27k+ pages, faster and more precise lookups). The plugin will tell you if it's running without a key.
To get a key: app.leeroopedia.com — $20 free credit on signup, no credit card.
export LEEROOPEDIA_API_KEY=kpsk_your_key_here
Add to your shell profile (~/.bashrc, ~/.zshrc) so it persists.
Register the marketplace, then install the plugin:
/plugin marketplace add leeroo-ai/leeroo-marketplace
/plugin install superml@leeroo-marketplace
Or install directly from GitHub:
claude plugin add --from-github leeroo-ai/superml
In Cursor Agent chat (waiting for Cursor team approval):
/add-plugin superml
Or clone into your project — Cursor auto-detects .cursor-plugin/plugin.json:
git clone https://github.com/leeroo-ai/superml.git
See .codex/INSTALL.md.
See .opencode/INSTALL.md.
git clone https://github.com/leeroo-ai/superml.git
gemini extension add ./superml/gemini-extension.json
If you just want the knowledge base without the full plugin, see leeroopedia-mcp for setup instructions.
You get the MCP tools (memory) but not the workflow skills (process).
Start a conversation and try something like:
I'm fine-tuning Llama 3.1 8B on 50k instruction pairs with 1xA100 80GB.
Set up the full training config — QLoRA, proper chat template, loss masking on prompts.
If it's working, your agent will ground its answer in documentation (KB citations or web sources), catch common pitfalls before they waste a training run, and give you a runnable config.
| Skill | What it does |
|---|---|
| ml-plan | Plan training runs, architectures, and multi-step pipelines |
| ml-verify | Check configs, code, and math before you burn GPU hours |
| ml-debug | Debug OOM, NaN, divergence, crashes, bad throughput |
| ml-iterate | Ranked next steps when results aren't where you want them |
| ml-experiment | Track experiments — hypotheses, results, and learnings across sessions |
| ml-research | Deep-dive into ML topics, compare approaches, survey frameworks |
| using-superml | Loaded at session start — wires up skills to KB tools and sets quality standards |
ml-expert: a persistent ML engineer agent for the bigger stuff: pipeline reviews, deep analysis, framework deep-dives. It remembers your hardware setup, past experiments, and lessons learned across sessions.
SuperML ships with a self-refine toolkit that lets you optimize the skills for your specific ML niche. Describe your domain, generate a test suite, and run an automated judge-refine loop:
python self-refine/generate_suite.py "biomedical fine-tuning with clinical NLP"
python self-refine/run.py --suite self-refine/suites/biomedical-fine-tuning-with-clinical-nlp.yaml
See self-refine/README.md for the full guide.
SuperML is integrated in our enterprise platform — forecasting & planning, fraud & anomaly detection, customer analytics, recommendation systems, document intelligence, and customer service automation.
See CONTRIBUTING.md for how to report bugs, suggest improvements, and submit PRs.