mock-llm-mcp

MCP server for Mock LLM API — mock OpenAI, Anthropic, and Google Gemini responses for testing AI integrations. No real API keys or token spend required.

Installation

pip install mock-llm-mcp
# or
uvx mock-llm-mcp

Claude Desktop Configuration

{
  "mcpServers": {
    "mock-llm": {
      "command": "uvx",
      "args": ["mock-llm-mcp"],
      "env": {
        "MOCK_LLM_API_KEY": "your-key-here"
      }
    }
  }
}

No API key required for the free tier (500 calls/day). Get a key at rebaselabs.online for higher limits.

Tools

| Tool | Description | |------|-------------| | mock_quick | Quickest mock response — provider-agnostic, auto-detects response type | | mock_openai_chat | Drop-in mock for POST /v1/chat/completions (OpenAI format) | | mock_anthropic_message | Drop-in mock for POST /v1/messages (Anthropic format) | | mock_google_generate | Drop-in mock for Google Gemini generateContent | | mock_simulate_error | Simulate specific LLM errors (rate limit, timeout, invalid key, etc.) | | list_mock_models | List available mock models for a provider |

Use Cases

Test without token spend — verify your LLM integration code works without calling real APIs
CI/CD pipelines — deterministic, offline-safe tests using seed-based responses
Error handling — simulate rate limits, 500 errors, auth failures, context length exceeded
Frontend dev — build chat UIs without a real API key
Multi-provider testing — test your abstraction layer against OpenAI, Anthropic, and Google formats

Examples

Quick mock (no format needed)

mock_quick(prompt="Explain quantum computing", length="short")

Test OpenAI integration

mock_openai_chat(
    messages=[{"role": "user", "content": "Hello!"}],
    model="gpt-4o",
    response_type="text"
)

Simulate a rate limit error

mock_simulate_error(provider="anthropic", error_type="rate_limit")

Deterministic response with seed

mock_quick(prompt="Write a haiku", seed=42)

Response Control Headers

All mock tools support:

length: "short", "medium", "long", "xl", "random"
response_type: "auto", "text", "code", "json", "markdown", "list"
error: "none", "rate_limit", "server_error", "timeout", "invalid_key", "context_length", "content_filter"
delay_ms: 0–5000 — artificial latency
seed: integer — reproducible responses

Environment Variables

| Variable | Description | Default | |----------|-------------|---------| | MOCK_LLM_API_KEY | API key for authenticated access | ` (free tier) | | MOCK_LLM_API_URL | Override API base URL | https://mock-llm-api.rebaselabs.online` |

Part of the RebaseKit Agent Infrastructure Stack

Mock LLM MCP is part of the RebaseKit suite of agent-native APIs:

WeTask — web extraction & browser automation
CodeExec — sandboxed code execution
PII API — detect & mask sensitive data
DocParse — document parsing & OCR
DataTransform — data format conversion & querying
Mock LLM — mock any LLM provider for testing

"The internet was built for humans. RebaseKit makes it work for agents."

mock-llm-mcp

mock-llm-mcp

Installation

Claude Desktop Configuration

Tools

Use Cases

Examples

Quick mock (no format needed)

Test OpenAI integration

Simulate a rate limit error

Deterministic response with seed

Response Control Headers

Environment Variables

Part of the RebaseKit Agent Infrastructure Stack

Related MCP servers

MCP servers by category