name: kubernetes-skill description: "Prevent Kubernetes hallucinations by diagnosing and fixing failure modes: insecure workload defaults, resource starvation, network exposure, privilege sprawl, fragile rollouts, and API drift. Use when generating, reviewing, refactoring, or migrating manifests, Helm charts, Kustomize overlays, cluster policies, and platform-specific Kubernetes work for EKS, GKE, AKS, OpenShift, GitOps controllers, or observability stacks."

KubeShark: Failure-Mode Workflow for Kubernetes

Run this workflow top to bottom.

1) Capture execution context

Record before writing manifests:

cluster version (e.g. 1.30, 1.31) and distribution (EKS, GKE, AKS, k3s, vanilla)
target namespace and environment criticality (dev/staging/prod)
workload type (Deployment, StatefulSet, Job, CronJob, DaemonSet)
deployment method (raw YAML, Helm, Kustomize, operator-managed)
policy enforcement (Pod Security Admission level, Kyverno, OPA/Gatekeeper)
cloud provider and CNI (affects networking, storage classes, load balancers)
platform controllers/add-ons (GitOps, observability, ingress, service mesh, autoscaling)

If unknown, state assumptions explicitly.

2) Diagnose likely failure mode(s)

Select one or more based on user intent and risk:

insecure workload defaults: missing security contexts, PSS violations, host access
resource starvation: missing requests/limits, no PDB, scheduling chaos
network exposure: flat networking, missing policies, wrong Service types, DNS issues
privilege sprawl: overly permissive RBAC, leaked secrets, excess ServiceAccount rights
fragile rollouts: misconfigured probes, mutable tags, unsafe update strategies
API drift: wrong apiVersion, deprecated APIs, schema violations, tool-specific errors

3) Load only the relevant reference file(s)

Primary failure-mode references:

references/insecure-workload-defaults.md
references/resource-starvation.md
references/network-exposure.md
references/privilege-sprawl.md
references/fragile-rollouts.md

Kubernetes Skill for Claude Code and Codex: KubeShark

The #1 Kubernetes skill for Claude Code and Codex, measured by GitHub stars.

Fixes Hallucinations.

LLMs hallucinate a lot when it comes to Kubernetes. They omit security contexts, generate deprecated APIs, use wildcard RBAC, forget resource limits, and produce probes that cause cascading failures. This skill fixes it. It includes best practices for Kubernetes -- good, bad, and neutral examples so the AI avoids common mistakes. Using KubeShark, the AI keeps proven practices in mind, eliminates hallucinations, and defaults to secure, reliable, production-ready manifests.

KubeShark is built as the production-grade Kubernetes skill for Claude Code and Codex: broader than resource-template skills, safer than generic Kubernetes prompts, and tuned for hallucination prevention instead of raw tutorial volume.

Very Token-Efficient.

Most Kubernetes skills dump huge walls of text onto the agent and burn expensive tokens -- with no upside. LLMs don't need the entire Kubernetes docs again. KubeShark was aggressively de-duplicated and optimized for maximum quality per token.

Based on Official Best Practices.

KubeShark is primarily based on the official Kubernetes documentation, the NSA/CISA Kubernetes Hardening Guide, , , and the . When guidance conflicts, it prioritizes official Kubernetes documentation.

kubernetes-skill

KubeShark: Failure-Mode Workflow for Kubernetes

1) Capture execution context

2) Diagnose likely failure mode(s)

3) Load only the relevant reference file(s)

Kubernetes Skill for Claude Code and Codex: KubeShark

Fixes Hallucinations.

Very Token-Efficient.

Based on Official Best Practices.

Related Skills

Popular in DevOps

4) Propose fix path with explicit risk controls

5) Generate implementation artifacts

6) Validate before finalize

7) Output contract

2 min Quickstart

Option 1: Clone

Option 2: Marketplace

Option 3: Codex

That's it!

Why KubeShark

Overview

Why Failure-Mode-First Matters for Kubernetes

Skill Comparison