Featured

MoltAwards - Agent internet for government contracts + jobs.

ScaleYour.email: Fill your calendar with sales calls

Get featured free for a week

1 tool / week

Apply free →

Limited-time offer

Deploy your own AI agent

Affiliate

Launch on Hostinger →

Free spotlight

Get your AI tool in front of 30k+ builders — free for a week

Free spotlight

Apply free →

Claude Skill

Audit Agents/Skills/Commands (Advanced Skill)

Audit Claude Code agents, skills, and commands for quality and production readiness. Use when evaluating skill quality, checking production readiness scores, or comparing agents against best-practice templates.

Reviewed community sourceInstallable4 sections3 related pages

Browse full directory More automation View source

Editor's Note

Editorial Guide

What to do with this skill

Start with the workflow below, then drop into the upstream source only after the page has narrowed the job for you.

What this skill does

Audit Claude Code agents, skills, and commands for quality and production readiness.

When to use it

evaluating skill quality, checking production readiness scores, or comparing agents against best-practice templates.

Install and setup notes

Open the upstream source before treating this page as install-ready, because not every official record is meant to be dropped into a workflow unchanged.
Keep the context narrow. These skills are usually strongest when you load only the branch, reference set, or workflow step that matches the current task.
If you plan to standardize on this skill for team use, pin the upstream repo and check for updates periodically instead of assuming the official defaults are static.

Example workflow

Start with a concrete task that clearly matches this skill's intended trigger: evaluating skill quality, checking production readiness scores, or comparing agents against best-practice templates.
Read the overview and first source section, then choose the smallest branch of guidance or references that solves the task in front of you.
Run the change on a real file, command, or workflow, verify the result, and only then widen the skill into a repeatable team pattern.

Compatible agents

This skill is explicitly marked for Claude Code.

Claude Code

Where to go next

AI Agent Skills Directory

Compare ecosystems, use-case hubs, and the broader skills catalog from one search-first page.

Claude Code Skills

Browse the wider Claude Code directory if this skill is close but not quite the right fit.

Best Claude Code Skills

Use the Claude Code hub to compare the strongest installable skills before you standardize on a workflow.

Install source

This page does not expose a single copy-paste install command in the normalized record. Use the upstream install source below to confirm the exact steps, file paths, and current setup expectations before you add it to your stack.

Open install source

Page Outline

PurposeModesMethodologyWorkflow

Source Content

Normalized top-level metadata comes from the directory layer. The body below is the upstream source content for this item.

Audit Agents/Skills/Commands (Advanced Skill)

Comprehensive quality audit system for Claude Code agents, skills, and commands. Provides quantitative scoring, comparative analysis, and production readiness grading based on industry best practices.

Purpose

**Problem**: Manual validation of agents/skills is error-prone and inconsistent. According to the LangChain Agent Report 2026, 29.5% of organizations deploy agents without systematic evaluation, leading to "agent bugs" as the top challenge (18% of teams).

**Solution**: Automated quality scoring across 16 weighted criteria with production readiness thresholds (80% = Grade B minimum for production deployment).

**Key Features**:

Quantitative scoring (32 points for agents/skills, 20 for commands)
Weighted criteria (Identity 3x, Prompt 2x, Validation 1x, Design 2x)
Production readiness grading (A-F scale with 80% threshold)
Comparative analysis vs reference templates
JSON/Markdown dual output for programmatic integration
Fix suggestions for failing criteria

---

Modes

| Mode | Usage | Output | |------|-------|--------| | **Quick Audit** | Top-5 critical criteria only | Fast pass/fail (3-5 min for 20 files) | | **Full Audit** | All 16 criteria per file | Detailed scores + recommendations (10-15 min) | | **Comparative** | Full + benchmark vs templates | Analysis + gap identification (15-20 min) |

**Default**: Full Audit (recommended for first run)

---

Methodology

Why These Criteria?

The 16-criteria framework is derived from:

**Claude Code Best Practices** (Ultimate Guide line 4921: Agent Validation Checklist)
**Industry Data** (LangChain Agent Report 2026: evaluation gaps)
**Production Failures** (Community feedback on hardcoded paths, missing error handling)
**Composition Patterns** (Skills should reference other skills, agents should be modular)

Scoring Philosophy

**Weight Rationale**:

**Identity (3x)**: If users can't find/invoke the agent, quality is irrelevant (discoverability > quality)
**Prompt (2x)**: Determines reliability and accuracy of outputs
**Validation (1x)**: Improves robustness but is secondary to core functionality
**Design (2x)**: Impacts long-term maintainability and scalability

**Grade Standards**:

**A (90-100%)**: Production-ready, minimal risk
**B (80-89%)**: Good, meets production threshold
**C (70-79%)**: Needs improvement before production
**D (60-69%)**: Significant gaps, not production-ready
**F (<60%)**: Critical issues, requires major refactoring

**Industry Alignment**: The 80% threshold aligns with software engineering best practices for production deployment (e.g., code coverage >80%, security scan pass rates).

---

Workflow

Phase 1: Discovery

**Scan directories**:

   .claude/agents/
   .claude/skills/
   .claude/commands/
   examples/agents/      (if exists)
   examples/skills/      (if exists)
   examples/commands/    (if exists)

**Classify files** by type (agent/skill/command)

**Load reference templates** (for Comparative mode):

   guide/examples/agents/     (benchmark files)
   guide/examples/skills/     (benchmark files)
   guide/examples/commands/   (benchmark files)

Phase 2: Scoring Engine

Load scoring criteria from `scoring/criteria.yaml`:

agents:
  max_points: 32
  categories:
    identity:
      weight: 3
      criteria:
        - id: A1.1
          name: "Clear name"
          points: 3
          detection: "frontmatter.name exists and is descriptive"
        # ... (16

<!-- truncated -->

Recommended skills

Claude Skill

Talk Stage 3: Concepts

Builds a numbered, categorized concept catalogue from the talk summary and timeline, scoring each concept HIGH / MEDIUM / LOW for talk potential with optional repo enrichment.

Claude Skill

CLAUDE.md Improver

Audit and improve CLAUDE.md files in repositories. Use when user asks to check, audit, update, improve, or fix CLAUDE.md files. Scans for all CLAUDE.md files, evaluates quality against templates, outputs quality report, then makes targeted updates.

Claude Skill

Cyber Defense Team Skill

Path to the log file to analyze (or paste log content directly)

MoltAwards - Agent internet for government contracts + jobs.

ScaleYour.email: Fill your calendar with sales calls

Get featured free for a week

Deploy your own AI agent

Get your AI tool in front of 30k+ builders — free for a week

Audit Agents/Skills/Commands (Advanced Skill)

What to do with this skill

What this skill does

When to use it

Install and setup notes

Example workflow

Compatible agents

Where to go next

AI Agent Skills Directory

Claude Code Skills

Best Claude Code Skills

Install source

Page Outline

Source Content

Audit Agents/Skills/Commands (Advanced Skill)

Purpose

Modes

Methodology

Why These Criteria?

Scoring Philosophy

Workflow

Phase 1: Discovery

Phase 2: Scoring Engine

Recommended skills

Talk Stage 3: Concepts

CLAUDE.md Improver

Cyber Defense Team Skill

Next places to browse

Browse full Claude Code directory

More automation pages

See workflow marketplace