Agent Hooks
Shell hooks let a user run their own script at fixed points in the agent's lifecycle — to block a dangerous action, rewrite an input or an outbound message, inject context into the model, or warn the user. The script can be written in any language; it talks to the agent over a simple JSON-on-stdin, JSON-on-stdout protocol.
Tools: read_file, write_file, bash
When to use
Reach for hooks when the user wants the agent to automatically enforce a rule or react to an event without being asked each time. Examples:
- "Stop me from running
rm -rf/ destructive bash" →pre_tool_callblock - "Never let a private key get pushed to Telegram" →
on_outbound_messageblock - "Log every tool call for audit" →
post_tool_callobserve - "Remind the agent of X at the start of every model call" →
pre_llm_callcontext - "Don't let the agent claim it published when it didn't" →
on_completion_claim(in/goal) oron_stop(in normal chat) - "If the answer fails my quality check, make the agent redo it" →
on_stopblock
If the user just wants a one-off check, that's not a hook — hooks are for recurring, automatic lifecycle enforcement.
How configuration works (the agent does it end-to-end)
The agent installs and activates a hook with zero user copy-paste. Write the script, write the config entry, then call the loopback self-approve API — it flips the master switch on, approves the script for every event it's wired to, and hot-mounts it live (no restart). The user just tests it afterward.
curl -s -X POST http://localhost:8000/internal/runtime/hooks/approve \
-H 'Content-Type: application/json' \
-d '{"command": "/data/workspace/hooks/security_guard.py"}'
# -> {"ok": true, "events": [...], "mounted": N, "master_enabled": true}
⚠️ ALWAYS use an absolute path under
/data/workspacein both the yamlcommand:and this call. Never a relative path likeskills/agent-hooks/templates/security_guard.py: the bridge spawns the script with the server cwd (/app), so a relative path resolves to/app/skills/…— an empty dir — and every spawn fails. Because the bridge fails OPEN (a script it can't run = "continue"), the guard then silently protects nothing while/hooks liststill shows it "mounted". To avoid this, the standard install copies the template into/data/workspace/hooks/and points the yaml there (see the workflow below)./app/skillsis NOT the skills dir — the real one is/data/workspace/skills/(a.k.a./app/workspace/skills/via symlink).
commandMUST be the exactcommand:string from theshell_hooks.yaml
entry (the absolute script path). The hook MUST already be declared in the yaml — approval flips a declared hook to live, it can't conjure one out of thin air.
- The endpoint is loopback-only (same-uid-in-container trust boundary, same
as .env reads). It auto-enables the master switch (enable_master defaults true), so the hook fires immediately.
- Approval records the script's mtime, so a later edit surfaces as drift in
/hooks list / /hooks doctor — a swap-the-script change stays visible.
This is the whole activation story for the user — there isn't a second step. After the call returns {"ok": true}, tell them it's live and to test it. Do not mention /hooks approve, /hooks on, or "two gates" — those are internal.
Fallback (older builds only): if the curl returns 404, this runtime predates the self-approve API — only then fall back to asking the user to paste /hooks approve <command> then /hooks on.
The two gates (both handled by the self-approve API — you don't surface them)
Internally a hook fires only when BOTH hold; the self-approve call above flips both in one shot, so the user never sees or types either:
- Master switch ON —
shell_hooks.enabled: truein
workspace/config/agent.yaml. The API auto-enables it (enable_master defaults true).
- Per-hook approval — the
(event, command)pair is recorded in the
allowlist with the script's mtime (so a later edit shows as "changed since approval" drift — a swapped script stays visible). The API approves every event the command is wired to.
These exist as a security boundary, not as a user step. Do NOT mention "approve" or "two gates" when explaining hooks to a user — just say you'll set it up and they can test it. (Manual /hooks on + /hooks approve exist only as the 404 fallback for older runtimes.)
The /hooks command
Plain text on web / Telegram / WeChat (no LLM, no cost):
| Command | What it does | |
|---|---|---|
/hooks or /hooks list | master switch state, config path, every hook + approval/health | |
/hooks on \ | /hooks off | flip the master switch (hot mount/unmount, no restart) |
/hooks doctor | run each approved hook against a synthetic payload, check JSON | |
/hooks approve <event> <command> | approve + activate live (no restart) | |
/hooks revoke <command> | revoke + detach live (no restart) | |
/hooks help | usage |
Events (12) and what each can do
| Event | Fires | Capability | stdin gives the script |
|---|---|---|---|
on_user_message | a user message arrives, before the model sees it | block / rewrite text | message, channel |
pre_tool_call | before a tool runs | block / rewrite input | tool_name, tool_input |
post_tool_call | after a tool runs | observe (log/metrics) | tool_name, tool_result |
transform_tool_result | result before agent sees it | append a note | tool_name, tool_result |
pre_llm_call | before a model call | inject context | system, last_user_message, model |
post_llm_call | after a model reply | observe / swap | model |
on_response_end | final reply assembled, once per turn | rewrite reply | response, model, tokens, tool_names |
on_stop | turn boundary, after on_response_end | block → force a redo | response, tool_names, stop_hook_active |
on_outbound_message | before a TG/WeChat push | block / rewrite outbound | notification, type |
on_completion_claim | agent claims a /goal done | block → force a redo | goal, summary, response, tool_names |
on_session_start | session begins | observe | status |
on_session_end | session ends | observe / cleanup | status |
Every payload also includes event, session_id, agent_id, cwd.
The event field is the dispatch key. It names which lifecycle moment is firing (pre_tool_call, on_user_message, …). A multi-event script (like security_guard.py, one file wired to five events) reads event to decide which branch to run — no event in the payload means no branch matches, so the script falls through to "continue" (empty output = allow). The runtime always sets it; you only have to remember it when hand-crafting a test payload (see the dry-run step below). It is NOT something you put in shell_hooks.yaml — there the event: key tells the bus when to call you; the event field in the payload is the bus telling the script which moment it is.
The three "make the agent fix it" levers (don't mix them up)
These three fire near the end of a turn but have very different power — pick by what you need to happen when something's wrong:
| Event | Power | Use when |
|---|---|---|
on_response_end | rewrite only — edit the stored/forwarded reply (footer, redaction, mask). Cannot make the agent redo. Zero loop risk. | You only need to change the text (mask a leaked key, add a cost footer). |
on_stop | block → redo, in normal chat — steers your reason back as the next instruction and the agent keeps working. Kernel-capped (≤3 redos/turn) + stop_hook_active flag, so it can't trap a turn. | You need the agent to actually fix/verify its own output in ordinary conversation (quality gate, citation/publish check). Claude Code "Stop" hook parity. |
on_completion_claim | block → redo, in /goal only — refuses a fabricated "done" and keeps the goal loop running. | Same redo power, but it only fires inside a running /goal supervisor loop. |
Rule of thumb: mask → on_response_end; redo in chat → on_stop; redo in a goal → on_completion_claim. Note on_response_end can only rewrite the stored copy — tokens already streamed to a live web client can't be unsent, so prefer on_stop when you need the user to actually see a corrected answer.
Output protocol (what the script prints on stdout)
JSON object, or empty for "continue". Fields:
{"decision": "block", "reason": "..."} // deny the action / refuse completion
{"tool_input": {...}} // pre_tool_call: rewrite EXISTING input keys
{"notification": "..."} // on_outbound_message: rewrite the message
{"context": "..."} // pre_llm_call: inject into prompt (AGENT-facing)
{"systemMessage": "..."} // allow, but show the USER a note
{"add_warning": "..."} // same user-facing note channel
<empty> // continue, no change
context is agent-facing (goes into the prompt, pre_llm_call only). systemMessage / add_warning is user-facing (shown to the human on the tool-result / completion / outbound surfaces) — never injected into the prompt.
Safety: scripts run with shell=False + argv split (no shell injection) and a per-hook timeout. A script that errors, times out, or prints non-JSON falls through to continue — a broken hook can never break the agent.
Writing a readable reason
The reason is shown to the user (on the blocked-action card) and to the model. Write a plain sentence a person can read — say what you blocked and why, not just a raw command. The UI splits one reason string into two parts for you, so you don't parse anything client-side:
- Explanation + command box — put the human sentence first, then
:, then
the offending command/payload. Everything after the first ": " is rendered in a separate monospace box. The split only triggers when that tail looks like a payload (has a space or is longer than ~12 chars), so an ordinary sentence that happens to contain a colon is left intact.
- Sentence only — a reason with no
": "shows as a single sentence and no
command box. That's the right shape when there's nothing to quote (e.g. a pasted seed phrase).
[tag]is stripped — a leading tag like[security]is removed before
display and the sentence is auto-capitalised, so you can keep a tag for your own grep without it leaking into the UI.
// Good — readable sentence + a clean command box:
{"decision": "block",
"reason": "[security] This command is irreversible and would erase the disk: mkfs.ext4 /dev/sda1"}
// -> "This command is irreversible and would erase the disk" + [ mkfs.ext4 /dev/sda1 ]
// Avoid — a bare command or lone tag as the whole reason:
{"decision": "block", "reason": "mkfs /dev/sda1"} // user sees no WHY
A hook that doesn't follow this still works — a plain string just renders as one sentence. The convention only unlocks the nicer "explanation + command" layout.
Config file format
workspace/config/shell_hooks.yaml:
hooks:
- event: pre_tool_call
matcher: "rm -rf|dd if=|mkfs" # optional regex; script only spawns on a match (perf gate)
command: ./extensions/shell_hooks/examples/block_secrets.py
timeout: 10 # seconds, default 20, max 120
Two hook transports
A hook is either a local command (default) or an HTTP endpoint — same payload in, same decision JSON out, only the transport differs.
hooks:
- event: pre_tool_call
type: http # omit type -> "command" (default)
url: https://my-guard.example.com/hook
timeout: 10
HTTP specifics:
- SSRF guard — the URL must be http(s) and must NOT resolve to a loopback /
private / link-local (incl. cloud metadata 169.254.169.254) / reserved address (blocked at parse AND call time). Set STARCHILD_SHELL_HOOKS_HTTP_ALLOW_LOCAL=1 only to intentionally hit a local service.
- Approval keys on the URL:
/hooks approve <event> <url>;/hooks list
shows it as POST <url> and skips the executable/mtime checks.
Adding an LLM judgement (call the proxy, NOT /chat)
When a hook needs real reasoning ("does this leak a secret?", "is this completion actually done?"), call an LLM directly through the proxy from your script — never the agent's own /chat.
from core.http_client import proxied_post
import json, sys
event = json.load(sys.stdin)
r = proxied_post(
"https://openrouter.ai/api/v1/chat/completions",
json={
"model": "minimax/minimax-m3", # cheap default (~$0.0002/call)
"messages": [
{"role": "system", "content":
'You are a guard. Output ONLY JSON {"decision":"block|allow","reason":"..."}.'},
{"role": "user", "content": json.dumps(event)},
],
"temperature": 0, "max_tokens": 200,
},
headers={"SC-CALLER-ID": "chat:hook"}, # required for billing
timeout=40,
)
try:
print(json.dumps(json.loads(r.json()["choices"][0]["message"]["content"])))
except Exception:
print("{}") # fail-open on any parse error
Why proxy-direct: OpenRouter is an external stateless API, so it does not re-enter the agent loop or fire pre_llm_call -> no recursion, one cheap completion instead of a full agent turn, your own prompt + pure-JSON response. Calling /chat from a hook re-emits the same event (the bridge guards against the loop, but it's needless overhead) — and an LLM hook that calls /chat must never sit on pre_llm_call. See the host docs sc-proxy.md section "Calling an LLM through the proxy".
Standard workflow (the agent's checklist)
- Clarify the rule and pick the event from the table above.
- Write the script — read JSON on stdin, print a decision on stdout.
Exit non-zero / non-JSON = continue. Make it executable (chmod +x).
- Put the script at a stable ABSOLUTE path under
/data/workspace. For a
shipped template, copy it out of the skill dir so a skill update can't move it:
mkdir -p /data/workspace/hooks
cp /data/workspace/skills/agent-hooks/templates/security_guard.py /data/workspace/hooks/
chmod +x /data/workspace/hooks/security_guard.py
ls -l /data/workspace/hooks/security_guard.py # VERIFY it exists before going on
Never reference the script by a relative path (see the ⚠️ box above — it resolves against /app and silently fails open).
- Add a config entry in
workspace/config/shell_hooks.yamlwith the
absolute command: (/data/workspace/hooks/security_guard.py); add a matcher regex when possible so the script only spawns when relevant.
- Dry-run it yourself with
bash— pipe a sample JSON payload into the
script and confirm it prints valid JSON. The payload MUST include the event field — a multi-event script dispatches on it, so leaving it out makes every case fall through to "continue" and you'll wrongly conclude the guard doesn't fire. Test each event the script handles:
# should BLOCK (note the "event" key):
echo '{"event":"pre_tool_call","tool_name":"bash","tool_input":{"command":"rm -rf /"}}' \
| python3 /data/workspace/hooks/security_guard.py
# should ALLOW (empty output):
echo '{"event":"pre_tool_call","tool_name":"bash","tool_input":{"command":"ls -la"}}' \
| python3 /data/workspace/hooks/security_guard.py
- Activate it yourself via the loopback self-approve API (no user paste);
command = the exact absolute path from your yaml entry:
curl -s -X POST http://localhost:8000/internal/runtime/hooks/approve \
-H 'Content-Type: application/json' \
-d '{"command": "/data/workspace/hooks/security_guard.py"}'
On {"ok": true} the hook is live. On 404, fall back to handing the user /hooks approve <command> + /hooks on (see "How configuration works").
- Run
/hooks doctorto confirm it actually works — this is the step that
catches a wrong path / non-executable / non-JSON script. A guard that shows "mounted" in /hooks list but errors in doctor is a silent no-op (fails open). Only after doctor is clean tell the user it's live and ready to test.
Ready-made scripts (each has ONE clear job)
Two production-grade, multi-event guards ship in this skill under templates/ (copy + approve as-is). Four single-purpose examples ship with the host under extensions/shell_hooks/examples/ (copy + adapt). No two overlap — pick by the job, not by trial.
Production templates (in this skill, templates/)
| Template | Events | Its one job |
|---|---|---|
security_guard.py | on_user_message, pre_tool_call, transform_tool_result, on_response_end, on_outbound_message | Secrets + destructive bash. Block pasted/exfiltrated secrets (API keys incl. Bearer, PEM/EVM private keys, BIP-39 seeds, Solana byte-array & base58 WIF), mask leaked keys in replies/pushes, block irreversible-data-loss bash. See below. |
verify_publish_claims.py | on_stop (chat redo) / on_completion_claim (/goal redo) / on_response_end (rewrite fallback) | Anti-hallucination. Catch fabricated "published / posted to AgentX / scheduled" claims by checking the reply against ground truth (previews registry, AgentX ledger, scheduler registry). |
Single-purpose examples (host repo, extensions/shell_hooks/examples/)
| Script | Event | Its one job |
|---|---|---|
pii_redactor.py | transform_tool_result, on_response_end | Mask emails / phones (PII — distinct from secrets). |
tool_audit_log.py | post_tool_call | Observe-only: append every tool call to a JSONL audit trail. |
budget_alert.py | on_response_end | Append a soft warning when a turn's cost crosses a threshold. |
inject_website_reminder.sh | pre_llm_call | Preventive nudge: remind the model to actually publish before claiming done (pairs with verify_publish_claims.py). |
Superseded by the templates (don't ship a second, conflicting guard)
The host repo also ships some minimal single-event examples under extensions/shell_hooks/examples/ that overlap the two templates above. They're fine as learning references, but for real use prefer the template — running both just creates two guards with possibly different policies.
| Minimal example | Use this instead | Why |
|---|---|---|
block_secrets.py | security_guard.py | the guard's secret detection is a strict superset (adds Bearer, Solana byte-array, base58 WIF, destructive-bash, masking) |
check_publish.sh | verify_publish_claims.py | the template also covers AgentX posts + scheduled tasks and checks the same registry |
Removed outright (orphan duplicates, fully folded into security_guard.py): secret_guard.py (vendor-key block/mask, incl. Bearer) and dangerous_bash_guard.py (destructive-bash block). Want to also block installers / force-push? Tune the guard's DESTRUCTIVE table rather than running a second bash guard with a conflicting policy.
For any rule none of the above covers, write a fresh script — the minimal block example below is the template, and the output protocol above covers every capability.
Minimal block example (pre_tool_call, any language)
#!/usr/bin/env bash
payload="$(cat)"
python3 - "$payload" <<'PY'
import json, sys, re
ev = json.loads(sys.argv[1])
cmd = (ev.get("tool_input") or {}).get("command", "")
if re.search(r"rm\s+-rf\s+/|dd\s+if=|mkfs", cmd):
print(json.dumps({"decision": "block", "reason": f"This command is irreversible and would erase data, so I've blocked it: {cmd}"}))
else:
print("{}") # continue
PY
All-in-one security guard (templates/security_guard.py)
A ready-to-use, self-contained script that wires one file to five events and covers the common "don't leak secrets / don't nuke the box" baseline. First copy it to a stable absolute path, then wire all five events to that same absolute command: in config/shell_hooks.yaml (one block per event), then activate:
cp /data/workspace/skills/agent-hooks/templates/security_guard.py /data/workspace/hooks/
chmod +x /data/workspace/hooks/security_guard.py
curl -s -X POST http://localhost:8000/internal/runtime/hooks/approve \
-H 'Content-Type: application/json' \
-d '{"command": "/data/workspace/hooks/security_guard.py"}'
This approves every event that command is wired to and mounts it live — no user paste. (On 404, fall back to /hooks approve <command> + /hooks on.)
| Event | What it does | ||
|---|---|---|---|
on_user_message | block a pasted API key (incl. Bearer token), private key (PEM / EVM hex), seed phrase, Solana byte-array secret, or base58 WIF before the model sees it | ||
pre_tool_call (bash) | block only irreversible data loss (rm -rf /, dd to a block device, mkfs, fork bomb, chmod -R 777, git reset --hard origin/*) and credential exfiltration (cat .env | curl, scp id_rsa, printenv | curl`) |
pre_tool_call (message tools) | guard send_to_telegram / send_to_wechat args — mask a leaked key, block a seed phrase (these tools bypass the push pipeline, so this is the real outbound gate for them) | ||
transform_tool_result | warn when a tool's OUTPUT contains a secret (backend can only flag, not rewrite, result text) | ||
on_response_end | mask any secret that leaked into the final reply | ||
on_outbound_message | mask / block secrets before they're pushed to TG / WeChat |
Design policy: block only what is both very dangerous and not part of normal work. Common dev actions like curl | bash (installers) and git push --force (rebasing your own feature branch) are intentionally allowed — over-blocking trains users to disable the guard.
Tune the SECRET_PATTERNS, DESTRUCTIVE, and MSG_TOOLS tables at the top of the file for your own rules. templates/security_guard_selftest.py is the self-test (run it after any edit; dangerous strings live there as data only, so the host bash guard can't trip on them).
Anti-hallucination guard (templates/verify_publish_claims.py)
Catches a fabricated success: the agent writes "Published! community.iamstarchild.com/…", "Posted to AgentX /post/…", or "Reminder scheduled" when it never ran the tool. The script checks the reply against ground truth — the previews registry (/data/previews.json), the AgentX post ledger, and the scheduler registry — and either rewrites the reply or forces a redo. It is deliberately low-false-positive: a real published URL or an "offer to publish" (future tense) passes untouched; only a past-tense success claim with no backing trips it.
cp /data/workspace/skills/agent-hooks/templates/verify_publish_claims.py /data/workspace/hooks/
chmod +x /data/workspace/hooks/verify_publish_claims.py
curl -s -X POST http://localhost:8000/internal/runtime/hooks/approve \
-H 'Content-Type: application/json' \
-d '{"command": "/data/workspace/hooks/verify_publish_claims.py"}'
| Event | What it does |
|---|---|
on_stop | (preferred) in ordinary chat, block a fabricated success and force the agent to actually publish/redo (loop-capped) |
on_completion_claim | in a /goal loop, block a fabricated "done" and force a real publish (loop-capped) |
on_response_end | rewrite-only fallback when on_stop isn't wired: append an honest "unverified" note (cannot make the agent redo) |
Wire it on
on_stopfor normal chat — that's the only event that makes the agent actually redo a turn instead of just editing the text. The host honors only adecision: blockonon_stop/on_completion_claim(a rewrite is ignored on those events), so the hook blocks on both and only rewrites onon_response_end.templates/verify_publish_claims_selftest.pyis the self-test (covers theon_stopblock path + the loop cap).
Claude Code compatibility
Hook scripts written for Claude Code work unchanged — their output is auto-translated into the fields above:
| Claude Code output | Translated to |
|---|---|
hookSpecificOutput.permissionDecision: "deny" (+ permissionDecisionReason) | decision: block (+ reason) |
hookSpecificOutput.additionalContext | context |
hookSpecificOutput.updatedInput | tool_input (rewrite) |
continue: false (+ stopReason) | decision: block (+ reason) |
systemMessage | add_warning (user-facing note) |
suppressOutput | no-op (our stdout never enters the transcript) |
| exit code 2 with stderr, no stdout | decision: block, stderr is the reason |
Only the output payload is translated — event NAMES stay ours (pre_tool_call, not PreToolUse). Claude Code's Stop hook maps to our on_stop (block → force a redo); its UserPromptSubmit maps to on_user_message. A Stop script that returns {"decision":"block","reason":…} or exits 2 works unchanged once wired to on_stop.
Troubleshooting "my hook never fires"
- Is the event one of the 12 above? (a typo is a silent no-op)
- Is the master switch on?
/hooks listshows it. The self-approve API
enables it automatically; otherwise /hooks on.
- Is the hook approved?
✗ NOT approvedin/hooks list→ re-run the
self-approve API for that command (or /hooks approve as fallback).
- Does the
matcherregex actually match? Too narrow = never spawns. - Run
/hooks doctor— it flags non-executable / tampered / timed-out / non-JSON. - Manual test "allows everything"? Your test payload is probably missing the
event field — a multi-event script dispatches on it and falls through to "continue" without it. This is a test-harness mistake, not a hook bug; the real runtime always sets event.
can't open file '/app/skills/…'/ "mounted" but nothing is blocked? The
command: is a relative or wrong path. The bridge spawns with the server cwd (/app), so skills/… resolves to the empty /app/skills and every spawn fails — and because the bridge fails OPEN, the guard silently protects nothing. Fix: use the ABSOLUTE path /data/workspace/hooks/<script>.py in both the yaml and the approve call, then re-approve and confirm with /hooks doctor. (/app/skills is not the skills dir; the real one is /data/workspace/skills/.)
Deep reference
Full protocol, security model, and per-event payload detail live in the agent's own docs: config/context/references/agent-hooks.md (read it for edge cases this skill summarizes).
