CXO AGI
Back to Blog Google Gemini Media MCP Server: Image & Video Generation for AI Agents

Google Gemini Media MCP Server: Image & Video Generation for AI Agents

CxO AGI Team

Introducing Gemini Media MCP Server

We’ve released an open-source MCP server that brings Google’s latest visual AI models directly into Claude or any MCP-compatible AI agent. With Gemini 3 Pro Image (Nano Banana Pro), Imagen 4, and VEO 3, AI agents can now generate studio-quality images and cinematic videos with synchronized audio through natural conversation.

Available Now on Adomo.ai

Try it live on Adomo.ai — Our enterprise AI platform where you can experience the full power of Gemini Media MCP without any setup.

For developers and teams who want to run it themselves, we’ve made it available across multiple platforms:

  • Docker Hub — Pull and run in seconds
  • GitHub — Full source code and documentation
  • PyPI — Install via pip

Real Business Impact

This isn’t just a tech demo — it’s a production-ready tool designed for real enterprise workflows:

E-commerce

Generate product shots and promotional videos at scale while maintaining brand consistency. No more waiting on design teams for every product variation.

Marketing

Create complete campaign assets — from static images to video ads — without leaving your AI workflow. Iterate faster and launch campaigns sooner.

Enterprise

Produce training materials, technical diagrams, and localized content across markets instantly. Scale your content operations without scaling your team.

Why We Open-Sourced This

We originally built Gemini Media MCP for Adomo AI’s enterprise platform. After experiencing the incredible combination of Gemini’s image understanding and VEO’s cinematic video generation, we realized this was too powerful to keep internal.

Every AI agent should be able to create professional visual content. We’re committed to continuing to release and support open-source AI tools for the community.

Created with Claude and Gemini Media MCP

The image and video below were generated using Claude with the Gemini Media MCP server:

Claude and Gemini

Get Started

The server works seamlessly with:

  • Claude Desktop
  • Claude Code
  • Any MCP-compatible platform

Head over to GitHub to get started, or experience it directly on Adomo.ai.

We’d love to hear your feedback — open an issue on GitHub or reach out to us directly.


A huge thanks to the amazing teams at Google and DeepMind for building these incredible models.