Turn PyTorch into fast CUDA/Triton kernels on real datacenter GPUs with up to 14x speedup.
Getting started
Add forge-mcp-server to your MCP-capable client — Claude Code, Cursor, Codex, and others — by following the setup at the source, which documents the exact command, configuration, and any required API keys.






