Gemini Image Generator MCP Server
Generate and transform images using Google's Gemini AI through the Model Context Protocol (MCP).
 
Features
- Text-to-Image Generation - Create images from natural language prompts
- Image Transformation - Modify existing images with text descriptions
- Automatic Filename Generation - Smart naming based on prompts
- Multi-Language Support - Automatic prompt translation to English
Installation
Get a free API key from Google AI Studio.
git clone https://github.com/jonchun/gemini-image-mcp.git
cd gemini-image-mcp
# Using uv (recommended)
uv venv
source .venv/bin/activate # Windows: .venv\Scripts\activate
uv pip install -e .
Configuration
<details> <summary><b>Claude Desktop</b></summary>
Add to claude_desktop_config.json:
{
"mcpServers": {
"gemini-image-mcp": {
"command": "uv",
"args": [
"--directory",
"/ABSOLUTE/PATH/TO/gemini-image-mcp",
"run",
"gemini-image-mcp"
],
"env": {
"GEMINI_API_KEY": "your-api-key-here",
"DEFAULT_OUTPUT_IMAGE_PATH": "/path/to/images"
}
}
}
}
</details>
<details> <summary><b>OpenCode</b></summary>
{
"gemini-image-mcp": {
"command": "uv",
"args": [
"--directory",
"/ABSOLUTE/PATH/TO/gemini-image-mcp",
"run",
"gemini-image-mcp"
],
"env": {
"GEMINI_API_KEY": "your-api-key-here",
"DEFAULT_OUTPUT_IMAGE_PATH": "/path/to/images"
}
}
}
</details>
<details> <summary><b>Smithery</b></summary>
Install from smithery.ai - search for "gemini-image-mcp".
</details>
Usage
Generate Images
Generate a photorealistic sunset over mountains with purple sky
Create a British Shorthair silver tabby kitten playing with a ball of yarn
Transform Images
Prompt: Add beautiful vibrant aurora borealis (northern lights) dancing across the sky with green, purple, and blue colors
---
Prompt: Add soft natural sunlight streaming through a window, creating beautiful warm light rays and gentle shadows
Available Tools
generate_image_from_text
Creates an image from a text description.
Parameters:
prompt(required): Text description of the imageoutput_dir(optional): Directory to save the imagemodel(optional): Gemini model to use (defaults toGEMINI_MODELenvironment variable)
Returns: Path to the saved image file
transform_image_from_file
Transforms an existing image based on a text prompt.
Parameters:
image_file_path(required): Path to the source imageprompt(required): Description of the transformationoutput_dir(optional): Directory to save the imagemodel(optional): Gemini model to use (defaults toGEMINI_MODELenvironment variable)
Returns: Path to the transformed image file
transform_image_from_encoded
Transforms a base64-encoded image.
Parameters:
encoded_image(required): Base64 data URL (data:image/[format];base64,[data])prompt(required): Description of the transformationoutput_dir(optional): Directory to save the imagemodel(optional): Gemini model to use (defaults toGEMINI_MODELenvironment variable)
Returns: Path to the transformed image file
Configuration
| Variable | Required | Default | Description | | --------------------------- | -------- | ------------------------------------------- | --------------------- | | GEMINI_API_KEY | Yes | - | Your Gemini API key | | DEFAULT_OUTPUT_IMAGE_PATH | No | Current directory | Default save location | | GEMINI_MODEL | No | gemini-2.5-flash-image | Model to use | | GEMINI_BASE_URL | No | https://generativelanguage.googleapis.com | API base URL |
Development
Test the server locally:
fastmcp dev src/gemini_image_mcp/server.py
Opens MCP Inspector at http://localhost:5173/
License
MIT






