clawrouter

BlockRunAI/ClawRouter

Otheropenclawby BlockRunAI

Summary

OpenClaw plugin exposing 1 skill.

Install to Claude Code

openclaw plugin add BlockRunAI/ClawRouter

Run in Claude Code. Add the marketplace first with /plugin marketplace add BlockRunAI/ClawRouter if you haven't already.

README.md

<div align="center">

<img src="assets/banner.png" alt="ClawRouter Banner" width="600">

<h1>The LLM router built for autonomous agents</h1>

<p>Agents can't sign up for accounts. Agents can't enter credit cards.<br> Agents can only sign transactions.<br><br> <strong>ClawRouter is the only LLM router that lets agents operate independently.</strong><br><br> <em>8 models free, no crypto required. No signup. No API key. No credit card.</em></p>

<br>

<img src="https://img.shields.io/badge/๐Ÿ†“_8_Free_Models-success?style=for-the-badge" alt="8 free models">&nbsp; <img src="https://img.shields.io/badge/๐Ÿค–_Agent--Native-black?style=for-the-badge" alt="Agent native">&nbsp; <img src="https://img.shields.io/badge/๐Ÿ”‘_Zero_API_Keys-blue?style=for-the-badge" alt="No API keys">&nbsp; <img src="https://img.shields.io/badge/โšก_Local_Routing-yellow?style=for-the-badge" alt="Local routing">&nbsp; <img src="https://img.shields.io/badge/๐Ÿ’ฐ_x402_USDC-purple?style=for-the-badge" alt="x402 USDC">&nbsp; <img src="https://img.shields.io/badge/๐Ÿ”“_Open_Source-green?style=for-the-badge" alt="Open source">

![npm version](https://npmjs.com/package/@blockrun/clawrouter) ![npm downloads](https://npmjs.com/package/@blockrun/clawrouter) ![GitHub stars](https://github.com/BlockRunAI/ClawRouter) ![CI](https://github.com/BlockRunAI/ClawRouter/actions) ![TypeScript](https://typescriptlang.org) ![License: MIT](LICENSE)

![USDC Hackathon Winner](https://x.com/USDC/status/2021625822294216977) ![x402 Protocol](https://x402.org) ![Base Network](https://base.org) ![Solana](https://solana.com) ![OpenClaw Plugin](https://openclaw.ai) ![Telegram](https://t.me/blockrunAI)

</div>

> ClawRouter is an open-source smart LLM router that reduces AI API costs by up to 92%. It analyzes each request across 15 dimensions and routes to the cheapest capable model in under 1ms, entirely locally. ClawRouter is the only LLM router built for autonomous AI agents โ€” it uses wallet signatures for authentication (no API keys) and USDC micropayments via the x402 protocol (no credit cards). 55+ models from OpenAI, Anthropic, Google, xAI, DeepSeek, and more. MIT licensed.

---

Why ClawRouter exists

Every other LLM router was built for human developers โ€” create an account, get an API key, pick a model from a dashboard, pay with a credit card.

Agents can't do any of that.

ClawRouter is built for the agent-first world:

  • Starts at $0 โ€” 8 NVIDIA models are free forever (incl. 675B Mistral Large 3, Qwen3.5 122B + a vision-capable Nemotron Omni)
  • No accounts โ€” a wallet is generated locally, no signup
  • No API keys โ€” your wallet signature IS authentication
  • No model selection โ€” 15-dimension scoring picks the right model automatically
  • No credit cards โ€” agents pay per-request with USDC via x402
  • No trust required โ€” runs locally, <1ms routing, zero external dependencies

This is the stack that lets agents operate autonomously: x402 + USDC + local routing.

---

How it compares

| | OpenRouter | LiteLLM | Martian | Portkey | ClawRouter | | ---------------- | ----------------- | ---------------- | ----------------- | ----------------- | ----------------------- | | Models | 200+ | 100+ | Smart routing | Gateway | 55+ | | Free tier | Rate-limited | BYO keys | No | No | 8 models, no signup | | Routing | Manual selection | Manual selection | Smart (closed) | Observability | Smart (open source) | | Auth | Account + API key | Your API keys | Account + API key | Account + API key | Wallet signature | | Payment | Credit card | BYO keys | Credit card | $49-499/mo | USDC per-request | | Runs locally | No | Yes | No | No | Yes | | Open source | No | Yes | No | Partial | Yes | | Agent-ready | No | No | No | No | Yes |

โœ“ Open source ยท โœ“ Smart routing ยท โœ“ Runs locally ยท โœ“ Crypto native ยท โœ“ Agent ready

We're the only one that checks all five boxes.

---

Quick Start

> No wallet? 8 models work free out of the box. Install, run, and pin nvidia/gpt-oss-120b (or any of the 8) โ€” no crypto, no signup, no balance required. Add USDC later when you want paid models.

Option A โ€” OpenClaw Agent

OpenClaw is an AI coding agent. If you're using it, ClawRouter installs as a plugin. Two paths:

A1. Recommended โ€” one-shot install script:

curl -fsSL https://blockrun.ai/ClawRouter-update | bash
openclaw gateway restart

This handles everything: registration, models config, auth profile, wallet setup. Smart routing (blockrun/auto) is now your default model.

A2. If you prefer pure npm:

npm install -g @blockrun/clawrouter
clawrouter setup            # finishes OpenClaw integration โ€” REQUIRED
openclaw gateway restart

> โš ๏ธ Skipping clawrouter setup will leave you broken. Bare npm install -g only puts the package on disk; it does NOT register the plugin with OpenClaw, sync the models allowlist, or write the auth profile. Symptom: /models in your bot shows only ~7 entries (OpenClaw's hardcoded defaults) instead of the full ~38 BlockRun models. Run clawrouter setup to repair, or use path A1 to begin with.

Option B โ€” Standalone (continue.dev, Cursor, VS Code, any OpenAI-compatible client)

> Using Claude Code? Check out BRCC โ€” it's purpose-built for Claude Code with the same smart routing and x402 payments. > > Using NousResearch Hermes? See ClawRouter-Hermes โ€” a Python plugin that wires Hermes into the ClawRouter proxy. Same wallet, same 55+ models, same x402 USDC settlement on Base & Solana.

No OpenClaw required. ClawRouter runs as a local proxy on port 8402.

1. Start the proxy

npx @blockrun/clawrouter

2. Fund your wallet โ€” optional, skip for free tier Your wallet address is printed on first run. For paid models, send a few USDC on Base or Solana โ€” $5 covers thousands of requests. To stay at $0, pin any of the 8 free models (e.g. nvidia/gpt-oss-120b) or use /model free inside OpenClaw.

3. Point your client at http://localhost:8402

<details> <summary><strong>continue.dev</strong> โ€” <code>~/.continue/config.yaml</code></summary>

> Important: apiBase must end with /v1/ (including the trailing slash). Without it, continue.dev constructs the URL as /chat/completions instead of /v1/chat/completions, and the proxy returns 404.

models:
  - name: ClawRouter Auto
    provider: openai
    model: blockrun/auto
    apiBase: http://localhost:8402/v1/
    apiKey: x402
    roles:
      - chat
      - edit
      - apply

To pin a specific model, replace blockrun/auto with any model from blockrun.ai/models, e.g. anthropic/claude-opus-4.8, xai/grok-4-0709.

Both provider: openai and provider: clawrouter work โ€” just make sure apiBase ends with /v1/.

<details> <summary>Legacy JSON format (<code>~/.continue/config.json</code>)</summary>

{
  "models": [
    {
      "title": "ClawRouter Auto",
      "provider": "openai",
      "model": "blockrun/auto",
      "apiBase": "http://localhost:8402/v1/",
      "apiKey": "x402"
    }
  ]
}

</details> </details>

<details> <summary><strong>Cursor</strong> โ€” Settings โ†’ Models โ†’ OpenAI-compatible</summary>

Set base URL to http://localhost:8402, API key to x402, model to blockrun/auto.

</details>

<details> <summary><strong>Any OpenAI SDK</strong></summary>

from openai import OpenAI
client = OpenAI(base_url="http://localhost:8402", api_key="x402")
response = client.chat.completions.create(model="blockrun/auto", messages=[...])

</details>

---

Routing Profiles

Choose your routing strategy with /model <profile>:

| Profile | Strategy | Savings | Best For | | ---------------- | ------------------ | -------- | -------------------- | | /model free | Free NVIDIA models | 100% | $0 balance, learning | | /model auto | Balanced (default) | 74-100% | General use | | /model eco | Cheapest possible | 95-100% | Maximum savings | | /model premium | Best quality | 0% | Mission-critical |

Shortcuts: /model grok, /model br-sonnet, /model gpt5, /model o3

---

How It Works

100% local routing. <1ms latency. Zero external API calls.

Request โ†’ Weighted Scorer (15 dimensions) โ†’ Tier โ†’ Best Model โ†’ Response

| Tier | ECO Model | AUTO Model | PREMIUM Model | | --------- | ------------------------------------- | ------------------------------------- | ---------------------------- | | SIMPLE | nvidia/gpt-oss-120b (FREE) | gemini-2.5-flash ($0.30/$2.50) | kimi-k2.6 ($0.95/$4.00) | | MEDIUM | gemini-3.1-flash-lite ($0.25/$1.50) | kimi-k2.6 ($0.95/$4.00) | gpt-5.3-codex ($1.75/$14.00) | | COMPLEX | gemini-3.1-flash-lite ($0.25/$1.50) | gemini-3.1-pro ($2/$12) | claude-opus-4.8 ($5/$25) | | REASONING | grok-4-1-fast-reasoning ($0.20/$0.50) | grok-4-1-fast-reasoning ($0.20/$0.50) | claude-sonnet-4.6 ($3/$15) |

Blended average: $2.05/M vs $25/M for Claude Opus = 92% savings

---

Image Generation

Generate images directly from chat with /cr-imagegen:

/cr-imagegen a dog dancing on the beach
/cr-imagegen --model dall-e-3 a futuristic city at sunset
/cr-imagegen --model banana-pro --size 2048x2048 mountain landscape

> The slash command is /cr-imagegen to avoid colliding with Telegram channel commands. Typing /imagegen in chat still works for backward compatibility.

| Model | Provider | Price | Max Size | | ---------------------------- | --------------------- | ------------ | --------- | | nano-banana | Google Gemini Flash | $0.05/image | 1024x1024 | | banana-pro | Google Gemini Pro | $0.10/image | 4096x4096 | | dall-e-3 | OpenAI DALL-E 3 | $0.04/image | 1792x1024 | | gpt-image | OpenAI GPT Image 1 | $0.02/image | 1536x1024 | | flux | Black Forest Flux 1.1 | $0.04/image | 1024x1024 | | xai/grok-imagine-image | xAI Grok Imagine | $0.02/image | 1024x1024 | | xai/grok-imagine-image-pro | xAI Grok Imagine Pro | $0.07/image | 1024x1024 | | zai/cogview-4 | Zhipu CogView-4 | $0.015/image | 1440x1440 |

Video Generation

Generate short AI videos directly from chat with /videogen:

/videogen a red apple slowly spinning
/videogen --model seedance-2-fast --duration=5 a cat waving
/videogen --model grok-video a neon city at night

Or drive it over HTTP โ€” ClawRouter proxies the BlockRun gateway, handles x402 payment, and downloads the returned MP4 to local disk, rewriting url to http://localhost:8402/videos/<file>.mp4 so the asset survives past the upstream's temporary bucket.

curl -X POST http://localhost:8402/v1/videos/generations \
  -H "Content-Type: application/json" \
  -d '{"model":"bytedance/seedance-2.0-fast","prompt":"a red apple slowly spinning","duration_seconds":5}'

| Model | Provider | 5s text-to-video | 5s image-to-video | Duration | | ----------------------------- | ------------------- | ---------------- | ----------------- | --------------------- | | bytedance/seedance-1.5-pro | ByteDance Seedance | ~$0.46 | ~$0.46 (flat) | 5s default, up to 10s | | bytedance/seedance-2.0-fast | ByteDance Seedance | ~$1.19 | ~$1.19 (flat) | 5s default, up to 10s | | bytedance/seedance-2.0 | ByteDance Seedance | ~$1.49 | ~$1.49 (flat) | 5s default, up to 10s | | azure/sora-2 | OpenAI Sora (Azure) | ~$0.42 (4s) | ~$0.42 (4s, flat) | 4s default; 4/8/12s | | xai/grok-imagine-video | xAI Grok Imagine | ~$0.42 (8s) | n/a | 8s default |

Seedance is token-priced upstream at ~20,256 tokens/sec โ€” the blockrun videos route now defaults Seedance to resolution=720p with generate_audio=true for text-to-video (2ร— the per-second token count of the older 480p baseline; audio is included in that rate). The quote is duration ร— tokens/sec ร— $/1M tokens ร— 1.05 margin. Image-to-video is priced the same as text-to-video (the earlier ~40% i2v discount was removed upstream on 2026-06-01; only video-to-video remains cheaper). Sora 2 is flat-priced at $0.10/sec for both t2v and i2v โ€” note it rejects human faces in reference images (use Seedance + RealFace for real people). Calls block for 30โ€“120s while the upstream polls the job. Seedance 2.0 Fast typically returns in 60โ€“80s; 2.0 Pro trades latency for quality.

BytePlus RealFace (Seedance 2.0 only)

For real-person character consistency across frames, pass real_face_asset_id (format ta_xxxxxxxx) on 2.0 Fast or 2.0 Pro. Asset IDs come from token360's Asset UI after H5 face verification โ€” see blockrun's /docs/video/real-person-ip for the enrollment walkthrough. Cannot be combined with image_url (both seed the first frame โ€” pick one). Pricing is unchanged.

curl -X POST http://localhost:8402/v1/videos/generations \
  -H "Content-Type: application/json" \
  -d '{"model":"bytedance/seedance-2.0","prompt":"the person walks through a forest","real_face_asset_id":"ta_abc123def","duration_seconds":5}'

Image Editing (img2img)

Edit existing images with /img2img:

/img2img --image ~/photo.png change the background to a starry sky
/img2img --image ./cat.jpg --mask ./mask.png remove the background

| Option | Required | Description | | ----------------- | -------- | ------------------------------------- | | --image <path> | Yes | Local image file path (supports ~/) | | --mask <path> | No | Mask image (white = area to edit) | | --model <model> | No | Model to use (default: gpt-image-1) | | --size <WxH> | No | Output size (default: 1024x1024) |

API endpoint: POST http://localhost:8402/v1/images/image2image โ€” see full docs.

Phone & Voice Calls

Verify phone numbers and place AI-powered outbound voice calls directly from chat. Phone intelligence runs on Twilio; voice calls use Bland.ai. Payment is automatic via x402 from the wallet.

/cr-call +14155552671 "Hi, this is calling to confirm tomorrow's 3pm meeting"
/cr-call +14155552671 "Order a large pepperoni for delivery" --voice josh --max-duration 10

Calls are fire-and-forget: the request returns a call_id and poll_url immediately. The call itself runs in the cloud for up to 30 minutes. Poll GET /v1/voice/call/{call_id} (or clawrouter share/transcripts dashboard) to retrieve the transcript and recording when status is completed.

| Operation | Provider | Price | | --------------------------------- | -------- | ----------------------- | | Phone lookup (carrier, line type) | Twilio | $0.01 | | Fraud check (SIM-swap, fwd) | Twilio | $0.05 | | Buy phone number (30-day lease) | Twilio | $5.00 | | Renew lease (+30 days) | Twilio | $5.00 | | List wallet's owned numbers | Twilio | $0.001 | | Release a number | Twilio | free | | AI voice call (โ‰ค30 min) | Bland.ai | $0.54 flat per call | | Poll call status / transcript | Bland.ai | free |

CLI for wallet-owned numbers:

clawrouter phone numbers list                              # See active numbers + expiry
clawrouter phone numbers buy US --area-code 415            # Provision a SF number
clawrouter phone numbers renew +14155551234                # Extend 30 days
clawrouter phone numbers release +14155551234              # Release
clawrouter phone lookup +14155552671                       # Carrier + line type
clawrouter phone fraud +14155552671                        # SIM-swap + fwd signals

HTTP API:

# Place a call
curl -X POST http://localhost:8402/v1/voice/call \
  -H "Content-Type: application/json" \
  -d '{"to":"+14155552671","task":"Confirm the 3pm Thursday meeting.","max_duration":5}'
# โ†’ { "call_id": "call_abc123", "poll_url": "/v1/voice/call/call_abc123", "status": "queued" }

# Poll for transcript
curl http://localhost:8402/v1/voice/call/call_abc123

LLM agents discover all eight operations as blockrun_phone_ / blockrun_voice_ tools (see /partners).

> โš ๏ธ blockrun_voice_call and /cr-call place a real outbound phone call. Server enforces an emergency-number blocklist; choose --from from wallet-owned numbers via phone numbers list.

---

Crypto Data (Surf)

Surf is BlockRun's unified crypto data API โ€” 84 endpoints across 13 domains: CEX/DEX markets, on-chain SQL over 80+ ClickHouse tables (Ethereum, Base, Arbitrum, BSC, TRON, HyperEVM, Tempo), 100M+ labeled wallets, prediction markets (Polymarket + Kalshi), social/CT mindshare, news, project/DeFi metrics, token analytics, unified search, VC fund intelligence. The killer feature is ad-hoc POST /surf/onchain/sql โ€” agents query the warehouse directly without running an indexer.

ClawRouter ships Surf as a skill, not as typed wrappers. The proxy whitelists /v1/surf/ so any call through the local proxy is paid x402 from the same wallet; the agent reads skills/surf/SKILL.md for the endpoint catalog and crafts the HTTP call. No blockrun_surf_ tool definitions to maintain; a new Surf endpoint requires zero ClawRouter release.

| Tier | Cost | Examples | | ---- | ---------: | ------------------------------------------------------------- | | 1 | $0.001 | prices, rankings, lists, news | | 2 | $0.005 | orderbooks, candles, search, wallet details, social mindshare | | 3 | $0.020 | on-chain SQL / query / schema, chat completions |

Usage (HTTP):

# Aggregated BTC spot price (Tier 1, $0.001)
curl 'http://localhost:8402/v1/surf/market/price?symbol=BTC'

# Bulk wallet labels over 100M+ labeled wallets (Tier 2, $0.005)
curl 'http://localhost:8402/v1/surf/wallet/labels/batch?addresses=0xabc,0xdef,0x123'

# Ad-hoc on-chain SQL (Tier 3, $0.020)
curl -X POST 'http://localhost:8402/v1/surf/onchain/sql' \
  -H 'content-type: application/json' \
  -d '{"sql":"SELECT count() FROM ethereum.transactions WHERE block_timestamp >= now() - INTERVAL 1 HOUR"}'

No Surf account, no API key โ€” settles directly to Surf's Base treasury in USDC via the same wallet as LLM calls. Full endpoint reference: skills/surf/SKILL.md. Upstream marketplace: <https://blockrun.ai/marketplace/surf>.

---

Models & Pricing

55+ models across 9 providers, one wallet. Starting at $0.0002/request.

> ๐Ÿ’ก "Cost per request" = estimated cost for a typical chat message (~500 input + 500 output tokens).

Budget Models (under $0.001/request)

| Model | Input $/M | Output $/M | ~$/request | Context | Features | | --------------------------------------------- | --------: | ---------: | ---------: | ------- | -------------------------------------------- | | nvidia/gpt-oss-120b | FREE | FREE | $0 | 128K | default free model | | nvidia/gpt-oss-20b | FREE | FREE | $0 | 128K | smaller, faster | | nvidia/mistral-large-3-675b | FREE | FREE | $0 | 131K | 675B general flagship | | nvidia/qwen3.5-122b-a10b | FREE | FREE | $0 | 131K | newest-gen Qwen, strong general | | nvidia/qwen3-next-80b-a3b-instruct | FREE | FREE | $0 | 262K | reasoning + coding | | nvidia/llama-4-maverick | FREE | FREE | $0 | 131K | reasoning | | nvidia/seed-oss-36b | FREE | FREE | $0 | 131K | coding | | nvidia/nemotron-3-nano-omni-30b-a3b-reasoning | FREE | FREE | $0 | 256K | reasoning, vision (text+img+video+audio) | | openai/gpt-5-nano | $0.05 | $0.40 | $0.0002 | 128K | tools | | openai/gpt-4.1-nano | $0.10 | $0.40 | $0.0003 | 128K | tools | | google/gemini-2.5-flash-lite | $0.10 | $0.40 | $0.0003 | 1M | tools | | openai/gpt-4o-mini | $0.15 | $0.60 | $0.0004 | 128K | tools | | xai/grok-4-fast | $0.20 | $0.50 | $0.0004 | 131K | tools | | xai/grok-4-fast-reasoning | $0.20 | $0.50 | $0.0004 | 131K | reasoning, tools | | xai/grok-4-1-fast | $0.20 | $0.50 | $0.0004 | 131K | tools | | xai/grok-4-1-fast-reasoning | $0.20 | $0.50 | $0.0004 | 131K | reasoning, tools | | xai/grok-4-0709 | $0.20 | $1.50 | $0.0009 | 131K | reasoning, tools | | openai/gpt-5-mini | $0.25 | $2.00 | $0.0011 | 200K | tools | | deepseek/deepseek-chat | $0.20 | $0.40 | $0.0003 | 1M | tools (V4 Flash chat) | | deepseek/deepseek-reasoner | $0.20 | $0.40 | $0.0003 | 1M | reasoning, tools (V4 Flash thinking) | | deepseek/deepseek-v4-pro | $0.435 | $0.87 | $0.0007 | 1M | reasoning, agentic, tools (V4 flagship) | | zai/glm-5 | $0.60 | $1.92 | $0.0013 | 200K | tools | | zai/glm-5-turbo | $1.20 | $4.00 | $0.0026 | 200K | tools | | xai/grok-3-mini | $0.30 | $0.50 | $0.0004 | 131K | tools | | minimax/minimax-m3 | $0.30 | $1.20 | $0.0008 | 1M | reasoning, agentic, tools | | minimax/minimax-m2.7 | $0.30 | $1.20 | $0.0008 | 205K | reasoning, agentic, tools | | minimax/minimax-m2.5 | $0.30 | $1.20 | $0.0008 | 205K | reasoning, agentic, tools | | google/gemini-2.5-flash | $0.30 | $2.50 | $0.0014 | 1M | vision, tools | | openai/gpt-4.1-mini | $0.40 | $1.60 | $0.0010 | 128K | tools | | google/gemini-3.5-flash | $0.50 | $3.00 | $0.0018 | 1M | reasoning, vision, tools (thinking built-in) | | google/gemini-3-flash-preview | $0.50 | $3.00 | $0.0018 | 1M | vision | | moonshot/kimi-k2.5 | $0.60 | $3.00 | $0.0018 | 262K | reasoning, vision, agentic, tools | | moonshot/kimi-k2.6 | $0.95 | $4.00 | $0.0025 | 262K | reasoning, vision, agentic, tools |

Mid-Range Models ($0.001โ€“$0.01/request)

| Model | Input $/M | Output $/M | ~$/request | Context | Features | | --------------------------- | --------: | ---------: | ---------: | ------- | ----------------------------------------- | | anthropic/claude-haiku-4.5 | $1.00 | $5.00 | $0.0030 | 200K | vision, agentic, tools | | openai/o1-mini | $1.10 | $4.40 | $0.0028 | 128K | reasoning, tools | | openai/o3-mini | $1.10 | $4.40 | $0.0028 | 128K | reasoning, tools | | openai/o4-mini | $1.10 | $4.40 | $0.0028 | 128K | reasoning, tools | | google/gemini-2.5-pro | $1.25 | $10.00 | $0.0056 | 1M | reasoning, vision, tools | | zai/glm-5.2 | $1.40 | $4.40 | $0.0029 | 1M | reasoning, coding, tools (flagship) | | zai/glm-5.1 | $1.40 | $4.40 | $0.0029 | 200K | reasoning, tools (promo ended 2026-06-05) | | xai/grok-4.3 | $1.50 | $4.00 | $0.0028 | 1M | reasoning, vision, agentic, tools | | xai/grok-build-0.1 | $1.50 | $3.00 | $0.0023 | 256K | agentic coding, tools | | openai/gpt-5.2 | $1.75 | $14.00 | $0.0079 | 400K | reasoning, vision, agentic, tools | | openai/gpt-5.3 | $1.75 | $14.00 | $0.0079 | 128K | reasoning, vision, agentic, tools | | openai/gpt-5.3-codex | $1.75 | $14.00 | $0.0079 | 400K | agentic, tools | | openai/gpt-4.1 | $2.00 | $8.00 | $0.0050 | 128K | vision, tools | | openai/o3 | $2.00 | $8.00 | $0.0050 | 200K | reasoning, tools | | google/gemini-3-pro-preview | $2.00 | $12.00 | $0.0070 | 1M | reasoning, vision, tools | | google/gemini-3.1-pro | $2.00 | $12.00 | $0.0070 | 1M | reasoning, vision, tools | | xai/grok-2-vision | $2.00 | $10.00 | $0.0060 | 131K | vision, tools | | openai/gpt-4o | $2.50 | $10.00 | $0.0063 | 128K | vision, agentic, tools | | openai/gpt-5.4 | $2.50 | $15.00 | $0.0088 | 400K | reasoning, vision, agentic, tools |

Premium Models ($0.01+/request)

| Model | Input $/M | Output $/M | ~$/request | Context | Features | | --------------------------- | --------: | ---------: | ---------: | ------- | --------------------------------- | | anthropic/claude-sonnet-4.6 | $3.00 | $15.00 | $0.0090 | 200K | reasoning, vision, agentic, tools | | xai/grok-3 | $3.00 | $15.00 | $0.0090 | 131K | reasoning, tools | | anthropic/claude-opus-4.8 | $5.00 | $25.00 | $0.0150 | 1M | reasoning, vision, agentic, tools | | openai/gpt-5.5 | $5.00 | $30.00 | $0.0175 | 1M | reasoning, vision, agentic, tools | | openai/o1 | $15.00 | $60.00 | $0.0375 | 200K | reasoning, tools | | openai/gpt-5.2-pro | $21.00 | $168.00 | $0.0945 | 400K | reasoning, tools | | openai/gpt-5.4-pro | $30.00 | $180.00 | $0.1050 | 400K | reasoning, tools |

> Free tier: several NVIDIA-hosted models cost nothing โ€” /model free smart-routes across them, or pick one directly (e.g., /model nemotron-omni for vision, /model qwen3-next for 262K reasoning, /model qwen-coder for coding, /model maverick). > Best value: gpt-5-nano and gemini-2.5-flash-lite deliver strong results at ~$0.0003/request.

---

Payment

No account. No API key. Payment IS authentication via x402.

Request โ†’ 402 (price: $0.003) โ†’ wallet signs USDC โ†’ retry โ†’ response

USDC stays in your wallet until spent โ€” non-custodial. Price is visible in the 402 header before signing.

Dual-chain support: Pay with USDC on Base (EVM) or USDC on Solana. Both wallets are derived from a single BIP-39 mnemonic on first run.

/wallet              # Check balance and address (both chains)
/wallet export       # Export mnemonic + keys for backup
/wallet recover      # Restore wallet from mnemonic on a new machine
/wallet solana       # Switch to Solana USDC payments
/wallet base         # Switch back to Base (EVM) USDC payments
/chain solana        # Alias for /wallet solana
/stats               # View usage and savings
/stats clear         # Reset usage statistics
/exclude             # Show excluded models
/exclude add <model> # Block a model from routing (aliases work: "grok-4", "free")
/exclude remove <model> # Unblock a model
/exclude clear       # Remove all exclusions

Fund your wallet:

  • Base (EVM): Send USDC on Base to your EVM address
  • Solana: Send USDC on Solana to your Solana address
  • Coinbase/CEX: Withdraw USDC to either network
  • Credit card: Reach out to @bc1max on Telegram

---

Screenshots

<table> <tr> <td width="50%" align="center"> <strong>Smart Routing in Action</strong><br><br> <img src="docs/clawrouter-savings.png" alt="ClawRouter savings" width="400"> </td> <td width="50%" align="center"> <strong>Telegram Integration</strong><br><br> <img src="assets/telegram-demo.png" alt="Telegram demo" width="400"> </td> </tr> </table>

---

Configuration

For basic usage, no configuration needed. For advanced options:

| Variable | Default | Description | | --------------------------- | ------------------------------------- | ---------------------------------------------------------------- | | BLOCKRUN_WALLET_KEY | auto-generated | Your wallet private key | | BLOCKRUN_PROXY_PORT | 8402 | Local proxy port | | CLAWROUTER_DISABLED | false | Disable smart routing | | CLAWROUTER_DEBUG_HEADERS | on | Set to off to suppress x-clawrouter-* debug response headers | | CLAWROUTER_SOLANA_RPC_URL | https://api.mainnet-beta.solana.com | Solana RPC endpoint |

Full reference: docs/configuration.md

Model Exclusion

Block specific models from being routed to. Useful if a model doesn't follow your agent instructions or you want to control costs.

/exclude add nvidia/gpt-oss-120b   # Block the default free model
/exclude add grok-4                # Aliases work โ€” blocks all grok-4 variants
/exclude add gpt-5.4               # Skip expensive models
/exclude                           # Show current exclusions
/exclude remove grok-4             # Unblock a model
/exclude clear                     # Remove all exclusions

Exclusions persist across restarts (~/.openclaw/blockrun/exclude-models.json). If all models in a tier are excluded, the safety net ignores the filter so routing never breaks.

---

Troubleshooting

When things go wrong, run the doctor:

npx @blockrun/clawrouter doctor

This collects diagnostics and sends them to Claude Sonnet for AI-powered analysis:

๐Ÿฉบ BlockRun Doctor v0.12.24

System
  โœ“ OS: darwin arm64
  โœ“ Node: v22.14.0

Wallet
  โœ“ Address: 0x1234...abcd
  โœ“ Balance: $12.50

Network
  โœ“ BlockRun API: reachable (142ms)
  โœ— Local proxy: not running on :8402

๐Ÿ“ค Sending to Claude Sonnet 4.6 (~$0.003)...

๐Ÿค– AI Analysis:
The local proxy isn't running. Run `openclaw gateway restart` to fix.

Use Opus for complex issues:

npx @blockrun/clawrouter doctor opus

Ask a specific question:

npx @blockrun/clawrouter doctor "why is my request failing?"
npx @blockrun/clawrouter doctor opus "ๆทฑๅบฆๅˆ†ๆžๆˆ‘็š„้…็ฝฎ"

Cost: Sonnet ~$0.003 (default) | Opus ~$0.01

---

Development

git clone https://github.com/BlockRunAI/ClawRouter.git
cd ClawRouter
npm install
npm run build
npm test

---

Support

| Channel | Link | | --------------------- | ------------------------------------------------------------------ | | ๐Ÿ“… Schedule Demo | calendly.com/vickyfu9/30min | | ๐Ÿ’ฌ Community Telegram | t.me/blockrunAI | | ๐Ÿฆ X / Twitter | x.com/blockrunai | | ๐Ÿ“ฑ Founder Telegram | @bc1max | | โœ‰๏ธ Email | vicky@blockrun.ai |

---

From the BlockRun Ecosystem

<table> <tr> <td width="50%">

โšก ClawRouter

The LLM router built for autonomous agents

You're here. 55+ models, local smart routing, x402 USDC payments โ€” the only stack that lets agents operate independently.

curl -fsSL https://blockrun.ai/ClawRouter-update | bash

</td> <td width="50%">

๐Ÿค– BRCC

BlockRun for Claude Code

Run Claude Code with 50+ models, no rate limits, no Anthropic account, no phone verification. Pay per request with USDC โ€” your wallet is your identity.

curl -fsSL https://blockrun.ai/brcc-install | bash

</td> </tr> <tr> <td width="50%">

๐Ÿ ClawRouter-Hermes

ClawRouter for NousResearch Hermes

Python plugin that wraps the ClawRouter proxy for hermes-agent. Same 55+ models, same x402 USDC payments on Base & Solana, native Hermes ergonomics.

pip install hermes-plugin-clawrouter

</td> <td width="50%">

<!-- next ecosystem entry goes here -->

</td> </tr> </table>

---

More Resources

| Resource | Description | | ------------------------------------------------------ | ------------------------ | | Documentation | Full docs | | Model Pricing | All models & prices | | Image Generation & Editing | API examples, 5 models | | Routing Profiles | ECO/AUTO/PREMIUM details | | Architecture | Technical deep dive | | Configuration | Environment variables | | Troubleshooting | Common issues |

Blog

| Article | Topic | | -------------------------------------------------------------------------------------------------- | ------------------------------------------------------- | | 9 Free AI Models, Zero Cost | How BlockRun gives developers top-tier LLMs for nothing | | ClawRouter Cuts LLM API Costs 500ร— | Deep dive into cost savings | | ClawRouter vs OpenRouter | Head-to-head comparison | | Smart LLM Router: 14-Dimension Classifier | How the routing engine works | | LLM Router Benchmark: 46 Models, Sub-1ms | Performance benchmarks | | Anthropic Cost Savings | Reducing Claude API spend |

---

Frequently Asked Questions

What is ClawRouter?

ClawRouter is an open-source (MIT licensed) smart LLM router built for autonomous AI agents. It analyzes each request across 15 dimensions and routes to the cheapest capable model in under 1ms, entirely locally โ€” no external API calls needed for routing decisions.

How much can ClawRouter save on LLM costs?

ClawRouter's blended average cost is $2.05 per million tokens compared to $25/M for Claude Opus, representing 92% savings. Actual savings depend on your workload โ€” simple queries are routed to free models ($0/request), while complex tasks get premium models.

How does ClawRouter compare to OpenRouter?

ClawRouter is open source and runs locally. It uses wallet-based authentication (no API keys) and USDC per-request payments (no credit cards or subscriptions). OpenRouter requires an account, API key, and credit card. ClawRouter also features smart routing โ€” it automatically picks the best model for each request, while OpenRouter requires manual model selection.

How does ClawRouter compare to LiteLLM?

Both are open source and run locally. But ClawRouter adds smart routing (automatic model selection), wallet-based auth, and USDC payments. LiteLLM requires you to bring your own API keys and manually choose models.

What agents does ClawRouter work with?

ClawRouter works with any tool that makes OpenAI-compatible API calls โ€” point it at http://localhost:8402. This includes continue.dev, Cursor, VS Code extensions, ElizaOS, and custom agents. It also integrates as a plugin with OpenClaw (an AI coding agent), which enables additional features like slash commands and usage reports.

Is ClawRouter free?

ClawRouter itself is free and MIT licensed. You pay only for the LLM API calls routed through it โ€” and several NVIDIA-hosted models (gpt-oss-120b, gpt-oss-20b, mistral-large-3-675b, qwen3.5-122b-a10b, qwen3-next-80b-a3b-instruct, llama-4-maverick, seed-oss-36b, nemotron-3-nano-omni-30b-a3b-reasoning) are completely free. Use /model free to smart-route across them, or pick any by name.

---

<div align="center">

MIT License ยท BlockRun โ€” Agent-native AI infrastructure

โญ If ClawRouter powers your agents, consider starring the repo!

</div>

Related plugins

Browse all โ†’