Claude Skill

Audit Agents/Skills/Commands (Advanced Skill)

Audit Claude Code agents, skills, and commands for quality and production readiness. Use when evaluating skill quality, checking production readiness scores, or comparing agents against best-practice templates.

Reviewed community sourceInstallable4 sections3 related pages

Editor's Note

Audit Claude Code agents, skills, and commands for quality and production readiness. Use when evaluating skill quality, checking production readiness scores, or comparing agents against best-practice templates. Covers purpose, modes, methodology.

Editorial Guide

What to do with this skill

Start with the workflow below, then drop into the upstream source only after the page has narrowed the job for you.

What this skill does

Audit Claude Code agents, skills, and commands for quality and production readiness.

When to use it

evaluating skill quality, checking production readiness scores, or comparing agents against best-practice templates.

Install and setup notes

  • Open the upstream source before treating this page as install-ready, because not every official record is meant to be dropped into a workflow unchanged.
  • Keep the context narrow. These skills are usually strongest when you load only the branch, reference set, or workflow step that matches the current task.
  • If you plan to standardize on this skill for team use, pin the upstream repo and check for updates periodically instead of assuming the official defaults are static.

Example workflow

  1. Start with a concrete task that clearly matches this skill's intended trigger: evaluating skill quality, checking production readiness scores, or comparing agents against best-practice templates.
  2. Read the overview and first source section, then choose the smallest branch of guidance or references that solves the task in front of you.
  3. Run the change on a real file, command, or workflow, verify the result, and only then widen the skill into a repeatable team pattern.

Compatible agents

This skill is explicitly marked for Claude Code.

Claude Code

Install source

This page does not expose a single copy-paste install command in the normalized record. Use the upstream install source below to confirm the exact steps, file paths, and current setup expectations before you add it to your stack.

Page Outline

PurposeModesMethodologyWorkflow

Source Content

Normalized top-level metadata comes from the directory layer. The body below is the upstream source content for this item.

Audit Agents/Skills/Commands (Advanced Skill)

Comprehensive quality audit system for Claude Code agents, skills, and commands. Provides quantitative scoring, comparative analysis, and production readiness grading based on industry best practices.

Purpose

**Problem**: Manual validation of agents/skills is error-prone and inconsistent. According to the LangChain Agent Report 2026, 29.5% of organizations deploy agents without systematic evaluation, leading to "agent bugs" as the top challenge (18% of teams).

**Solution**: Automated quality scoring across 16 weighted criteria with production readiness thresholds (80% = Grade B minimum for production deployment).

**Key Features**:

  • Quantitative scoring (32 points for agents/skills, 20 for commands)
  • Weighted criteria (Identity 3x, Prompt 2x, Validation 1x, Design 2x)
  • Production readiness grading (A-F scale with 80% threshold)
  • Comparative analysis vs reference templates
  • JSON/Markdown dual output for programmatic integration
  • Fix suggestions for failing criteria

---

Modes

| Mode | Usage | Output | |------|-------|--------| | **Quick Audit** | Top-5 critical criteria only | Fast pass/fail (3-5 min for 20 files) | | **Full Audit** | All 16 criteria per file | Detailed scores + recommendations (10-15 min) | | **Comparative** | Full + benchmark vs templates | Analysis + gap identification (15-20 min) |

**Default**: Full Audit (recommended for first run)

---

Methodology

Why These Criteria?

The 16-criteria framework is derived from:

  • **Claude Code Best Practices** (Ultimate Guide line 4921: Agent Validation Checklist)
  • **Industry Data** (LangChain Agent Report 2026: evaluation gaps)
  • **Production Failures** (Community feedback on hardcoded paths, missing error handling)
  • **Composition Patterns** (Skills should reference other skills, agents should be modular)

Scoring Philosophy

**Weight Rationale**:

  • **Identity (3x)**: If users can't find/invoke the agent, quality is irrelevant (discoverability > quality)
  • **Prompt (2x)**: Determines reliability and accuracy of outputs
  • **Validation (1x)**: Improves robustness but is secondary to core functionality
  • **Design (2x)**: Impacts long-term maintainability and scalability

**Grade Standards**:

  • **A (90-100%)**: Production-ready, minimal risk
  • **B (80-89%)**: Good, meets production threshold
  • **C (70-79%)**: Needs improvement before production
  • **D (60-69%)**: Significant gaps, not production-ready
  • **F (<60%)**: Critical issues, requires major refactoring

**Industry Alignment**: The 80% threshold aligns with software engineering best practices for production deployment (e.g., code coverage >80%, security scan pass rates).

---

Workflow

Phase 1: Discovery

  • **Scan directories**:
   .claude/agents/
   .claude/skills/
   .claude/commands/
   examples/agents/      (if exists)
   examples/skills/      (if exists)
   examples/commands/    (if exists)
  • **Classify files** by type (agent/skill/command)
  • **Load reference templates** (for Comparative mode):
   guide/examples/agents/     (benchmark files)
   guide/examples/skills/     (benchmark files)
   guide/examples/commands/   (benchmark files)

Phase 2: Scoring Engine

Load scoring criteria from `scoring/criteria.yaml`:

agents:
  max_points: 32
  categories:
    identity:
      weight: 3
      criteria:
        - id: A1.1
          name: "Clear name"
          points: 3
          detection: "frontmatter.name exists and is descriptive"
        # ... (16

<!-- truncated -->

Recommended skills

Next places to browse

Sponsored
MoltAwards: Turn AI agents loose on government contracts & jobs! logo

Turn AI agents loose on government contracts

Learn more