This skill processes and generates multimedia content using Google Gemini to analyze audio, images, videos, documents, and create images.
npx playbooks add skill mamba-mental/agent-skill-manager --skill ai-multimodal
This skill processes and generates multimedia content using Google Gemini to analyze audio, images, videos, documents, and create images.
At 19 words, this compact prompt gives your agent specialized documentation expertise with structured patterns and output formats. Install via CLI or copy the prompt below.
This skill processes and generates multimedia content using Google Gemini to analyze audio, images, videos, documents, and create images.
Ai Multimodal is a free documentation skill for AI coding agents. This skill processes and generates multimedia content using Google Gemini to analyze audio, images, videos, documents, and create images.. It provides a specialized system prompt that configures your agent with documentation expertise.
Run npx playbooks add skill mamba-mental/agent-skill-manager --skill ai-multimodal in your terminal to install Ai Multimodal into your Claude Code session. It works immediately after installation.
Ai Multimodal is compatible with Claude Code, Cursor, GitHub Copilot, Windsurf, OpenClaw, Cline, and any AI agent that supports custom system prompts or .cursorrules files.
Yes, Ai Multimodal is completely free and open source. The full source is available on GitHub at https://github.com/mamba-mental/agent-skill-manager/tree/main/skills/ai-multimodal. You only need a subscription to the AI agent you use it with.
Weekly roundup of top Claude Code skills, MCP servers, and AI coding tips.