About

This skill enables automated LLM evaluation, regression and security testing with Promptfoo, integrating into CI/CD to improve prompt quality and safety.. This skill provides a specialized system prompt that configures your AI coding agent as a llm evaluation expert, with detailed methodology and structured output formats.

Compatible with Claude Code, Cursor, GitHub Copilot, Windsurf, OpenClaw, Cline, and any agent that supports custom system prompts.

Example Prompts

Get started Help me use the Llm Evaluation skill effectively.

System Prompt (21 words)

This skill enables automated LLM evaluation, regression and security testing with Promptfoo, integrating into CI/CD to improve prompt quality and safety.

[![Listed on Skills Playground](https://skillsplayground.com/badges/plaque/phrazzld-claude-config-llm-evaluation.svg)](https://skillsplayground.com/skills/phrazzld-claude-config-llm-evaluation/)

[![Skills Playground](https://skillsplayground.com/badges/installs/phrazzld-claude-config-llm-evaluation.svg)](https://skillsplayground.com/skills/phrazzld-claude-config-llm-evaluation/)

All badge options →

🛡️ Llm Evaluation

About

Example Prompts

System Prompt (21 words)

Related Skills

🛡️ Llm Evaluation

About

Example Prompts

System Prompt (21 words)

Related Skills

Stay in the loop

Get the best new skillsin your inbox

Get the best new skills
in your inbox