This skill helps you run, customize, and analyze Terminal-Bench benchmarks for mux agents in CI or Daytona cloud with tailored experiments.
npx playbooks add skill coder/mux --skill tbench
Use Tbench to configure Claude Code, Cursor, or Copilot for testing & qa: it this skill helps you run, customize, and analyze terminal-bench benchmarks for mux agents in ci or daytona cloud with tailored experiments.
This compact 21-word instruction set is purpose-built for testing & qa work in AI coding agents. Install with a single command.
This skill helps you run, customize, and analyze Terminal-Bench benchmarks for mux agents in CI or Daytona cloud with tailored experiments.
Tbench is a free testing & qa skill for AI coding agents. This skill helps you run, customize, and analyze Terminal-Bench benchmarks for mux agents in CI or Daytona cloud with tailored experiments.. It provides a specialized system prompt that configures your agent with testing & qa expertise.
Run npx playbooks add skill coder/mux --skill tbench in your terminal to install Tbench into your Claude Code session. It works immediately after installation.
Tbench is compatible with Claude Code, Cursor, GitHub Copilot, Windsurf, OpenClaw, Cline, and any AI agent that supports custom system prompts or .cursorrules files.
Yes, Tbench is completely free and open source. The full source is available on GitHub at https://github.com/coder/mux/tree/main/.mux/skills/tbench. You only need a subscription to the AI agent you use it with.
Weekly roundup of top Claude Code skills, MCP servers, and AI coding tips.