This skill enables automated LLM evaluation, regression and security testing with Promptfoo, integrating into CI/CD to improve prompt quality and safety.
npx playbooks add skill phrazzld/claude-config --skill llm-evaluation
This skill enables automated LLM evaluation, regression and security testing with Promptfoo, integrating into CI/CD to improve prompt quality and safety.
At 21 words, this compact prompt gives your agent specialized security expertise with structured patterns and output formats. Install via CLI or copy the prompt below.
This skill enables automated LLM evaluation, regression and security testing with Promptfoo, integrating into CI/CD to improve prompt quality and safety.
Llm Evaluation is a free security skill for AI coding agents. This skill enables automated LLM evaluation, regression and security testing with Promptfoo, integrating into CI/CD to improve prompt quality and safety.. It provides a specialized system prompt that configures your agent with security expertise.
Run npx playbooks add skill phrazzld/claude-config --skill llm-evaluation in your terminal to install Llm Evaluation into your Claude Code session. It works immediately after installation.
Llm Evaluation is compatible with Claude Code, Cursor, GitHub Copilot, Windsurf, OpenClaw, Cline, and any AI agent that supports custom system prompts or .cursorrules files.
Yes, Llm Evaluation is completely free and open source. The full source is available on GitHub at https://github.com/phrazzld/claude-config/tree/main/skills/llm-evaluation. You only need a subscription to the AI agent you use it with.
Weekly roundup of top Claude Code skills, MCP servers, and AI coding tips.