🧪 Tbench

This skill helps you run, customize, and analyze Terminal-Bench benchmarks for mux agents in CI or Daytona cloud with tailored experiments.

QUICK INSTALL
npx playbooks add skill coder/mux --skill tbench

About

This skill helps you run, customize, and analyze Terminal-Bench benchmarks for mux agents in CI or Daytona cloud with tailored experiments.. This skill provides a specialized system prompt that configures your AI coding agent as a tbench expert, with detailed methodology and structured output formats.

Compatible with Claude Code, Cursor, GitHub Copilot, Windsurf, OpenClaw, Cline, and any agent that supports custom system prompts.

Example Prompts

Get started Help me use the Tbench skill effectively.

System Prompt (21 words)

This skill helps you run, customize, and analyze Terminal-Bench benchmarks for mux agents in CI or Daytona cloud with tailored experiments.

Related Skills

Get the best new skills
in your inbox

Weekly roundup of top Claude Code skills, MCP servers, and AI coding tips.