Multimodal AI processing via Google Gemini API (2M tokens context). Capabilities: audio (transcription, 9.5hr max, summarization, music analysis), images (captioning, OCR, object detection, segmentation, visual Q&A), video (scene detection, 6hr max, YouTube URLs, temporal analysis), documents (PDF e
npx playbooks add skill samhvw8/dot-claude --skill ai-multimodal
Multimodal AI processing via Google Gemini API (2M tokens context). Capabilities: audio (transcription, 9.5hr max, summarization, music analysis), images (captioning, OCR, object detection, segmentation, visual Q&A), video (scene detection, 6hr max, YouTube URLs, temporal analysis), documents (PDF e
The 38-word prompt provides structured documentation guidance — covering detailed methodology and consistent output formats. Install it in one command.
Multimodal AI processing via Google Gemini API (2M tokens context). Capabilities: audio (transcription, 9.5hr max, summarization, music analysis), images (captioning, OCR, object detection, segmentation, visual Q&A), video (scene detection, 6hr max, YouTube URLs, temporal analysis), documents (PDF e
Ai Multimodal is a free documentation skill for AI coding agents. Multimodal AI processing via Google Gemini API (2M tokens context). Capabilities: audio (transcription, 9.5hr max, summarization, music analysis), images (captioning, OCR, object detection, segmentation, visual Q&A), video (scene detection, 6hr max, YouTube URLs, temporal analysis), documents (PDF e. It provides a specialized system prompt that configures your agent with documentation expertise.
Run npx playbooks add skill samhvw8/dot-claude --skill ai-multimodal in your terminal to install Ai Multimodal into your Claude Code session. It works immediately after installation.
Ai Multimodal is compatible with Claude Code, Cursor, GitHub Copilot, Windsurf, OpenClaw, Cline, and any AI agent that supports custom system prompts or .cursorrules files.
Yes, Ai Multimodal is completely free and open source. The full source is available on GitHub at https://github.com/samhvw8/dot-claude/tree/main/skills/ai-multimodal. You only need a subscription to the AI agent you use it with.
Weekly roundup of top Claude Code skills, MCP servers, and AI coding tips.