by lemonade-sdk
Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https://discord.gg/5xXzkMu8Zk
# Add to your Claude Code skills
git clone https://github.com/lemonade-sdk/lemonadeLemonade is the local AI server that gives you the same capabilities as cloud APIs, except 100% free and private. Use the latest models for chat, coding, speech, and image generation on your own NPU and GPU.
Lemonade comes in two flavors:
This project is built by the community for every PC, with optimizations by AMD engineers to get the most from Ryzen AI, Radeon, and Strix Halo PCs.
No comments yet. Be the first to share your thoughts!
To run and chat with Gemma:
lemonade run Gemma-4-E2B-it-GGUF
To code with Lemonade models:
lemonade launch claude
Multi-modality:
# image gen
lemonade run SDXL-Turbo
# speech gen
lemonade run kokoro-v1
# transcription
lemonade run Whisper-Large-v3-Turbo
To see available models and download them:
lemonade list
lemonade pull Gemma-4-E2B-it-GGUF
To see the backends available on your PC:
lemonade backends
Lemonade supports a wide variety of LLMs (GGUF, FLM, and ONNX), whisper, stable diffusion, etc. models across CPU, GPU, and NPU.
Use lemonade pull or the built-in Model Manager to download models. You c