This skill processes and generates multimedia content using Google Gemini to analyze audio, images, videos, documents, and create images.
npx playbooks add skill mamba-mental/agent-skill-manager --skill ai-multimodal
This skill processes and generates multimedia content using Google Gemini to analyze audio, images, videos, documents, and create images.. This skill provides a specialized system prompt that configures your AI coding agent as an ai multimodal expert, with detailed methodology and structured output formats.
Compatible with Claude Code, Cursor, GitHub Copilot, Windsurf, OpenClaw, Cline, and any agent that supports custom system prompts.
This skill processes and generates multimedia content using Google Gemini to analyze audio, images, videos, documents, and create images.
Weekly roundup of top Claude Code skills, MCP servers, and AI coding tips.