QA harness for agentic systems: scenario suites, determinism/flake controls, tool sandboxing, scoring rubrics (including LLM-as-judge), and regression protocols covering success, safety, reliability, latency, and cost.
npx playbooks add skill vasilyu1983/ai-agents-public --skill qa-agent-testing
QA harness for agentic systems: scenario suites, determinism/flake controls, tool sandboxing, scoring rubrics (including LLM-as-judge), and regression protocols covering success, safety, reliability, latency, and cost.
At 25 words, this compact prompt gives your agent specialized developer workflow expertise with structured patterns and output formats. Install via CLI or copy the prompt below.
QA harness for agentic systems: scenario suites, determinism/flake controls, tool sandboxing, scoring rubrics (including LLM-as-judge), and regression protocols covering success, safety, reliability, latency, and cost.
Qa Agent Testing is a free developer workflow skill for AI coding agents. QA harness for agentic systems: scenario suites, determinism/flake controls, tool sandboxing, scoring rubrics (including LLM-as-judge), and regression protocols covering success, safety, reliability, latency, and cost.. It provides a specialized system prompt that configures your agent with developer workflow expertise.
Run npx playbooks add skill vasilyu1983/ai-agents-public --skill qa-agent-testing in your terminal to install Qa Agent Testing into your Claude Code session. It works immediately after installation.
Qa Agent Testing is compatible with Claude Code, Cursor, GitHub Copilot, Windsurf, OpenClaw, Cline, and any AI agent that supports custom system prompts or .cursorrules files.
Yes, Qa Agent Testing is completely free and open source. The full source is available on GitHub at https://github.com/vasilyu1983/ai-agents-public/tree/main/frameworks/shared-skills/skills/qa-agent-testing. You only need a subscription to the AI agent you use it with.
Weekly roundup of top Claude Code skills, MCP servers, and AI coding tips.