🎭 Generative Media Skills for AI Agents

The Ultimate Multimodal Toolset for Claude Code, Cursor, and Gemini CLI. A high-performance, schema-driven architecture for AI agents to generate, edit, and display professional-grade images, videos, and audio — powered by the muapi-cli.

✨ Key Features

🤖 Agent-Native Design — CLI-powered scripts with structured JSON outputs, semantic exit codes, and --jq filtering for seamless agentic pipelines.
🧠 Expert Knowledge Layer — Domain-specific skills that bake in professional cinematography, atomic design, and branding logic.
⚡ CLI-Powered Core — All primitives delegate to muapi-cli — no curl, no JSON parsing, no boilerplate.
🖼️ Direct Media Display — Use the --view flag to automatically download and open generated media in your system viewer.
📁 Local File Support — Auto-upload images, videos, faces, and audio from your local machine to the CDN for processing.
🌈 100+ AI Models — One-click access to Midjourney v7, Flux Kontext, Seedance 2.0, Kling 3.0, Veo3, and more.
🔌 MCP Server — Run muapi mcp serve to expose all 19 tools directly to Claude Desktop, Cursor, or any MCP-compatible agent.

🏗️ Scalable Architecture

This repository uses a Core/Library split to ensure efficiency and high-signal discovery for LLMs:

⚙️ Core Primitives (`/core`)

Thin wrappers around muapi-cli for raw API access.

core/media/ — File upload
core/edit/ — Image editing (prompt-based)
core/platform/ — Setup, auth & result polling

📚 Expert Library (`/library`)

High-value skills that translate creative intent into technical directives.

Cinema Director (/library/motion/cinema-director/) — Technical film direction & cinematography.
Nano-Banana (/library/visual/nano-banana/) — Reasoning-driven image generation (Gemini 3 Style).
UI Designer (/library/visual/ui-design/) — High-fidelity mobile/web mockups (Atomic Design).
Logo Creator (/library/visual/logo-creator/) — Minimalist vector branding (Geometric Primitives).
Seedance 2 (Doubao Video) (/library/motion/seedance-2/) — Director-level cinematic video generation with text-to-video, image-to-video, and video extension with native audio-video sync.
AI Clipping (/library/edit/ai-clipping/) — Long video → ranked vertical short clips in one managed API call. Server-side transcription, virality ranking, dedupe, and face-tracked auto-crop — no local Whisper or LLM.
YouTube Shorts (/library/social/youtube-shorts/) — Platform-aware preset over AI Clipping (Shorts / TikTok / Reels / Feed defaults).

Plus 41 ready-to-run workflow recipes organized by output type — see 🎬 Recipe Pack below.

🎬 Recipe Pack

Forty-one LLM-orchestrated workflow recipes that combine multiple muapi-cli calls into named end-to-end pipelines (e.g. photo of person → 3D action figure, product photo → cinematic 10s ad). Each skill is a SKILL.md the agent reads and follows; bring your own consuming agent (Claude Code, Cursor, MCP) — these are recipes, not bash wrappers.

Motion / Video (16)

| Skill | Description | |:---|:---| | 3D Logo Animation | Transform a 2D logo into a premium 3D version and animate it with professional cinematic effects | | AI Fight Scene Generator | High-cut-density action / fight scene — 16-cell storyboard image drives Seedance 2.0 i2v for shot-by-shot choreography | | Animal Vlogger Video | Hilarious, ultra-realistic anthropomorphic-animal vlogger acting like a human in a real-world setting | | Cartoon Dance Animation | Convert a photo into a Pixar-style 3D cartoon, then animate using a reference dance/motion video | | Character Story Video | Multi-part animated story video — establish a consistent character then animate sequential scenes | | Drone-Style Video | Aerial drone-perspective footage — bird's-eye sweeps, orbit shots, and flyover sequences | | Giant Product Showcase | Dramatic giant-scale product visual (building-sized object next to a person), optionally animated | | Jewelry Product Video | Luxury jewelry ad with high-end commercial cinematography and detailed macro animation | | Music Video | Short music video from a song theme — keyframes, animation per beat, matching music track | | One-Shot Video | Single continuous cinematic shot — no cuts, one seamless flowing scene | | Cinematic Product Ad | Cinematic 5–10s product ad from a product photo + brand brief | | Product Showcase Video | Dynamic product showcase with explosive ingredient arrangement + realistic motion animation | | Product Video Ad Maker | High-end cinematic product video ad starting from a simple product photo | | Talking Baby Video | Viral-style talking-baby video with custom costumes and scripts | | UGC Lifestyle Try-On | UGC-style lifestyle photos & video of a person using your product — authentic, social-native | | UGC Video Factory | Person photo + product photo + script → 10s vertical 9:16 UGC video ad with native dialogue (Nano-Banana Pro Edit → Seedance 2.0 VIP i2v) |

Social (5)

| Skill | Description | |:---|:---| | Instagram Post | Polished on-brand Instagram post — hero image + caption + hashtags | | Product Campaign Pack | Full multi-channel campaign — hero visuals, social assets, short ad video, platform crops | | RedNote Cover | Xiaohongshu (小红书) cover image — vibrant lifestyle aesthetic with typography overlay | | Social Media Pack | Re-render a hero image into Instagram / TikTok / Shorts / X aspect ratios | | UGC Ads Workflow | UGC video ad pipeline — combine selfie + product image, write script, animate |

Visual / Images & Design (21)

| Skill | Description | |:---|:---| | Action Figure Generator | Convert a photo of a person into a custom 3D action figure with collectible toy packaging | | Ad Creative Set | High-converting ad set — hero image, copy variations, platform crops for Meta / Google / LinkedIn | | Amazon Product Listing Pack | Full Amazon listing image set — hero, lifestyle, infographic, comparison/detail closeups | | Blog Header | Professional 1200×628 blog header image with optional title composition guidance | | Brand Kit | Cohesive brand visual kit — logo concept, color palette, typography pairings | | Brochure Designer | Multi-page brochure — cover, inner spread, back — for business, real estate, events, launches | | Couple Grid Creator | Stylized 6-box grid of a couple in romantic poses, each pose framed inside cardboard packaging | | Brand Design Guide | Comprehensive design guide — palette, typography, UI components, visual identity rules | | Fashion Try-On | Virtually try outfits by combining a person's photo + clothing item, optional fashion model video | | Floor Plan Rendering | Design a 2D floor plan and convert into a realistic 3D architectural rendering | | Interior Design | Pro interior design visualizations — redesign rooms, generate concepts, visualize furniture styles | | Interior Design Visualizer | Generate an empty room and fill it with stylish furniture / decor; or redesign an existing room | | Keyboard Art Maker | Artistic top-down photos of keyboard keycaps arranged to spell custom messages | | Logo + Branding Package | Logo + full branding package — variations (dark/light/icon), palette, mockups | | Logo Generator | Quick single-shot polished logo — fast, clean vector aesthetic with accurate brand-name text | | Multi-Angle Reshoot | Re-render a subject from dramatic camera angles (fish-eye, bird's-eye, low, macro) — identity preserved | | Multi-Angle Shots | Full multi-angle product shot set — front, side, back, top-down, 45° | | Selfie with Celebrities | Realistic behind-the-scenes selfie of the user with a celebrity; optional cinematic long-take | | Storyboard Generator | Generate N keyframes for a short story or scene sequence (image only, no video) | | URL to Design | Analyze a website URL and generate a redesigned, improved UI with modern aesthetics | | YouTube Thumbnail | High-CTR YouTube thumbnail — striking imagery, bold text placement, emotional face/subjec

Generative-Media-Skills

Related Skills

🎭 Generative Media Skills for AI Agents

✨ Key Features

🏗️ Scalable Architecture

⚙️ Core Primitives (`/core`)

📚 Expert Library (`/library`)

🎬 Recipe Pack

Generative-Media-Skills

Related Skills

🎭 Generative Media Skills for AI Agents

✨ Key Features

🏗️ Scalable Architecture

⚙️ Core Primitives (/core)

📚 Expert Library (/library)

🎬 Recipe Pack

⚙️ Core Primitives (`/core`)

📚 Expert Library (`/library`)