Enables AI assistants like Claude to manage LLM inference by controlling model registry, backends, and VRAM monitoring through the Model Context Protocol.
Getting started
Add MLX MCP Server to your MCP-capable client — Claude Code, Cursor, Codex, and others — by following the setup at the source, which documents the exact command, configuration, and any required API keys.






