Get a week free of Claude Code →

🧪 Phoenix Evals

This skill helps you build and run AI evaluators with Phoenix, combining code-first evaluators and LLM nuance for robust validation.

QUICK INSTALL
npx playbooks add skill arize-ai/phoenix --skill phoenix-evals

About

This skill helps you build and run AI evaluators with Phoenix, combining code-first evaluators and LLM nuance for robust validation.. This skill provides a specialized system prompt that configures your AI coding agent as a phoenix evals expert, with detailed methodology and structured output formats.

Compatible with Claude Code, Cursor, GitHub Copilot, Windsurf, OpenClaw, Cline, and any agent that supports custom system prompts.

Example Prompts

Get started Help me use the Phoenix Evals skill effectively.

System Prompt (20 words)

This skill helps you build and run AI evaluators with Phoenix, combining code-first evaluators and LLM nuance for robust validation.

Related Skills