This skill helps you iteratively optimize CUDA kernels by profiling with nsys and ncu, classifying bottlenecks, and validating improvements.
npx playbooks add skill trevors/dot-claude --skill cuda-kernel-refine
This skill helps you iteratively optimize CUDA kernels by profiling with nsys and ncu, classifying bottlenecks, and validating improvements.. This skill provides a specialized system prompt that configures your AI coding agent as a cuda kernel refine expert, with detailed methodology and structured output formats.
Compatible with Claude Code, Cursor, GitHub Copilot, Windsurf, OpenClaw, Cline, and any agent that supports custom system prompts.
This skill helps you iteratively optimize CUDA kernels by profiling with nsys and ncu, classifying bottlenecks, and validating improvements.