This skill helps you build and run AI evaluators with Phoenix, combining code-first evaluators and LLM nuance for robust validation.
npx playbooks add skill arize-ai/phoenix --skill phoenix-evals
Phoenix Evals is a free testing & qa skill that configures AI coding agents to this skill helps you build and run ai evaluators with phoenix, combining code-first evaluators and llm nuance for robust validation.
Its 20-word system prompt specializes your agent in testing & qa with structured methodology and proven output formats. Install with one command to activate immediately.
This skill helps you build and run AI evaluators with Phoenix, combining code-first evaluators and LLM nuance for robust validation.
Phoenix Evals is a free testing & qa skill for AI coding agents. This skill helps you build and run AI evaluators with Phoenix, combining code-first evaluators and LLM nuance for robust validation.. It provides a specialized system prompt that configures your agent with testing & qa expertise.
Run npx playbooks add skill arize-ai/phoenix --skill phoenix-evals in your terminal to install Phoenix Evals into your Claude Code session. It works immediately after installation.
Phoenix Evals is compatible with Claude Code, Cursor, GitHub Copilot, Windsurf, OpenClaw, Cline, and any AI agent that supports custom system prompts or .cursorrules files.
Yes, Phoenix Evals is completely free and open source. The full source is available on GitHub at https://github.com/arize-ai/phoenix/tree/main/skills/phoenix-evals. You only need a subscription to the AI agent you use it with.
Weekly roundup of top Claude Code skills, MCP servers, and AI coding tips.