Your-First-LLM-Studio

Name: Your-First-LLM-Studio
Author: ChrisChen667788

Warning

First LLM Studio: local-first LLM studio for Apple Silicon with MLX runtimes, Compare Lab, benchmark ops, replay, and runtime telemetry.

101stars

8forks

TypeScript

Installation

# Add to your Claude Code skills
git clone https://github.com/ChrisChen667788/Your-First-LLM-Studio

Getting Started

Guides for using ai agents skills like Your-First-LLM-Studio.

Caveman: Cut Claude Token Use by 65%
How agent-side prompt compression works, when to use it, and when not to.
What is an AI Skills Marketplace?
Definitions, how marketplaces work, and how to choose between them in 2026.
Getting Started with AI Skills

Security ReportWarning

Last scanned: 6/18/2026

{
  "issues": [
    {
      "type": "npm-audit",
      "message": "@next/eslint-plugin-next: Vulnerability found",
      "severity": "high"
    },
    {
      "type": "npm-audit",
      "message": "brace-expansion: brace-expansion: Zero-step sequence causes process hang and memory exhaustion",
      "severity": "medium"
    },
    {
      "type": "npm-audit",
      "message": "eslint-config-next: Vulnerability found",
      "severity": "high"
    },
    {
      "type": "npm-audit",
      "message": "glob: glob CLI: Command injection via -c/--cmd executes matches with shell:true",
      "severity": "high"
    },
    {
      "type": "npm-audit",
      "message": "js-yaml: JS-YAML: Quadratic-complexity DoS in merge key handling via repeated aliases",
      "severity": "medium"
    },
    {
      "type": "npm-audit",
      "message": "next: Next Vulnerable to Denial of Service with Server Components",
      "severity": "high"
    },
    {
      "type": "npm-audit",
      "message": "picomatch: Picomatch: Method Injection in POSIX Character Classes causes incorrect Glob Matching",
      "severity": "high"
    },
    {
      "type": "npm-audit",
      "message": "playwright: Playwright downloads and installs browsers without verifying the authenticity of the SSL certificate",
      "severity": "high"
    },
    {
      "type": "npm-audit",
      "message": "postcss: PostCSS has XSS via Unescaped </style> in its CSS Stringify Output",
      "severity": "medium"
    },
    {
      "type": "npm-audit",
      "message": "yaml: yaml is vulnerable to Stack Overflow via deeply nested YAML collections",
      "severity": "medium"
    }
  ],
  "status": "WARNING",
  "scannedAt": "2026-06-18T08:49:29.444Z",
  "npmAuditRan": true,
  "pipAuditRan": true,
  "promptInjectionRan": true
}

README.md

Frequently Asked Questions

What is Your-First-LLM-Studio?

Your-First-LLM-Studio is an open-source ai agents skill for AI coding assistants such as Claude Code, Codex CLI, and ChatGPT, built by ChrisChen667788. First LLM Studio: local-first LLM studio for Apple Silicon with MLX runtimes, Compare Lab, benchmark ops, replay, and runtime telemetry. It has 101 GitHub stars.

Is Your-First-LLM-Studio safe to use?

Your-First-LLM-Studio returned warnings in SkillsLLM's automated security scan. It has no critical vulnerabilities, but review the flagged issues in the Security Report section before adding it to your workflow.

How do I install Your-First-LLM-Studio?

Clone the repository with "git clone https://github.com/ChrisChen667788/Your-First-LLM-Studio" and add it to your Claude Code skills directory (see the Installation section above).

What programming language is Your-First-LLM-Studio written in?

Your-First-LLM-Studio is primarily written in TypeScript. It is open-source under ChrisChen667788 on GitHub, so you can review or fork the full source.

Are there alternatives to Your-First-LLM-Studio?

Yes. SkillsLLM lists many other AI Agents skills you can browse and compare side by side. Open the AI Agents category from the badge at the top of this page, or use the Related Skills and comparison links further down to weigh Your-First-LLM-Studio against similar tools.

Agentic AI for Beginners

Build your first AI agent from scratch - tool use, ReAct pattern, memory, deployment

41 minBeginner

Comments (0)

to leave a comment.

No comments yet. Be the first to share your thoughts!

Related Skills

ECC

by affaan-m

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

236,687

Lucid agent-bridge

First LLM Studio

English | 简体中文

Release License

First LLM Studio hero

English

First LLM Studio is a local-first LLM workbench for Apple Silicon. It brings local MLX runtimes, remote API targets, Agent sessions, Compare, Fine-tune, Benchmark, model discovery, runtime recovery, release evidence, and admin monitoring into one operating surface.

It is not another chat shell. It is built for people who need to compare behavior, debug runtimes, run evals, prepare adapters, and keep local and remote model work inside one product loop.

Product Surfaces

Route	Core workflow
`/agent`	Tool-enabled Agent sessions, target selection, runtime state, replay, trace review, and embedded Compare entry.
`/compare`	Route-owned Compare Studio for prompt composition, lane preview, recipe persistence, review drawer, and benchmark handoff.
`/fine-tune`	Foreground Fine-tune Studio for datasets, recipes, training, evaluation, chat adapter proof loops, export, reports, and artifacts.
`/models`	Model discovery and install verification for local/community models plus hardware-fit and risk signals.
`/benchmarks`	Benchmark run controls, progress, reports, release evidence, baselines, and regression review.
`/retrieval`	Foreground knowledge management, path import, chunk inspection, and grounded retrieval validation.
`/experiments`	Unified run/session timeline with artifact lineage, cross-feature navigation, filters, and retention controls.
`/admin`	Monitoring/configuration mirror for runtime, queues, benchmark history, provider health, guardrails, and audit timelines.

Major Version Story

Version	Core capabilities
`v0.1` Foundation	Established the local-first web studio, Apple Silicon/MLX gateway workflow, local + remote target catalog, runtime telemetry, and the first Agent/Admin operating split.
`v0.2` Agent + Benchmark Ops	Added richer Agent workbench flows, Compare-style target review, replay/trace inspection, runtime recovery controls, formal benchmark operations, baselines, and regression evidence.
`v0.3` Fine-tune + Release Evidence	Added fine-tune operation loops for evaluation, adapter chat, adapter export, and distillation starters; expanded operation history, partitioned typechecks, screenshot smoke, route smoke, and public launch assets.
`v0.4` Product IA release	Moves `/fine-tune`, `/compare`, `/models`, `/benchmarks`, `/retrieval`, and `/experiments` into foreground product routes with feature-owned state/actions, artifact lineage and retention, dark-glass studio/workbench styling, canonical APIs, and admin narrowed toward monitoring/configuration.
`v0.4.1` Stability baseline	Repairs dataless workspace failure modes, keeps route smoke and typecheck green, updates the OpenAI-compatible `/v1` surface, refreshes provider status reporting, and captures current real UI evidence.
`v0.5` Starter track	Enterprise RAG, deployment registry, OpenAI-compatible API, telemetry, release-readiness gates, production attestation, and control-plane rehearsal work continue behind explicit preview gates until promoted.

Current release: VERSION.

Who It Helps

Local AI builders on Apple Silicon

Compare MLX local models against hosted APIs under aligned context budgets.
Inspect runtime cost, prewarm, release, recovery, and hardware pressure without leaving the app.
Decide which local model is actually usable for daily coding and analysis workflows.

Agent and tooling teams

Validate tool-calling, repo-grounded behavior, replay, and patch flows in one workbench.
Turn Compare runs into benchmark handoff without switching products.
Separate model-quality failures from provider quirks and local runtime instability.

Evaluation and platform engineers

Run formal and focused benchmark suites with repeatable profiles.
Review baselines, deltas, run notes, failure classifications, and release evidence.
Keep local and remote targets inside one comparable target catalog.

Core Value

Unified local + remote target catalog.
Compare Lab for model-vs-model output review.
Fine-tune workflows for datasets, recipes, training, evaluation, adapter proof loops, and export.
Benchmark operations with history, progress, baselines, reports, and release evidence.
Foreground Retrieval for document import, chunk inspection, and grounded evidence probes.
Experiments timeline for session/run lineage, artifact navigation, and retention policy.
Replay, trace review, patch inspection, and exportable review notes.
Runtime operations for prewarm, release, restart, log inspection, telemetry, and recovery.
Dynamic local/community model discovery plus remote provider health scanning.

Current Targets

Local

Local Qwen3 0.6B
Local Qwen3 4B 4-bit
Local Qwen3.5 4B 4-bit
Local Gemma 3 4B It Qat 4-bit

Remote

OpenAI Codex
OpenAI GPT-5.5
Claude API
DeepSeek API
Kimi API
GLM API
Qwen API

Screenshots

Captured from the running local app after npm run typecheck and npm run smoke:routes. README screenshots are generated with npm run screenshots:readme at 2x DPR so text stays sharp on GitHub and ModelScope.

Agent workbench with target catalog, runtime rail, and tool-enabled composer:

Agent workbench

Fine-tune Studio with workflow tabs, training controls, and report/evidence panels:

Fine-tune Studio

Fine-tune completed run with live loss curves, train/validation traces, and handoff actions:

Fine-tune training curve

Benchmark Studio with run controls and historical evidence cards:

Benchmark Studio

Benchmark run evidence from a real local smoke run:

Benchmark run evidence

Models Studio with hardware-fit chips and one-click community discovery:

Models Studio

Compare, Retrieval, and Admin surfaces:

Compare Studio Retrieval Studio Admin dashboard Admin benchmark heatmap

Quick Start

Requirements

macOS on Apple Silicon
Node 22.x
Python 3.12
MLX-compatible local environment

Install

nvm install 22
nvm use 22
npm install
cp .env.example .env.local

Start the web app

npm run dev

Default routes:

Start the local model gateway

python3.12 -m venv .venv
source .venv/bin/activate
pip install -U pip
pip install mlx mlx-lm
python scripts/local_model_gateway_supervisor.py

Gateway health:

http://127.0.0.1:4000/health

Verification

npm run typecheck:changed
npm run smoke:routes
npm run smoke:screenshots

Configuration

Copy .env.example to .env.local and fill only the providers you want to use.

Important notes:

.env.local is ignored by git.
Remote providers are optional.
Several targets use OpenAI-compatible or Claude-compatible endpoints.
Public defaults in this repository are sanitized placeholders.

Repository Structure

app/                      Next.js app routes and thin API transports
components/               Shared UI and compatibility shells
features/                 Feature-owned routes, contracts, state, actions, and application ports
lib/agent/                Agent runtime, providers, benchmark, gateway helpers
lib/finetune/             Fine-tune store facade and split operation services
scripts/                  Local gateway, runtime, verification, and release scripts
docs/                     Architecture, release notes, launch notes, roadmap, and assets
modelscope/               ModelScope profile/readme metadata
public/                   Public assets and social cover art

Distribution

GitHub: https://github.com/ChrisChen667788/local-agent-lab
ModelScope profile: https://www.modelscope.cn/profile/haozi667788
Default ModelScope repo id: haozi667788/first-llm-studio

The ModelScope package script exports the committed Git tree so GitHub and ModelScope can stay file-identical for each synced version.

Security and Privacy

Sensitive local actions require confirmation.
Secrets belong in .env.local.
Public repository defaults are sanitized.
New public commits should use a GitHub noreply address where possible.
See SECURITY.md.

Contributing

Issues and PRs are welcome.