This skill guides reinforcement learning based training of large language models using verl across PPO, GRPO, and other RL algorithms.
npx playbooks add skill orchestra-research/ai-research-skills --skill verl
This skill guides reinforcement learning based training of large language models using verl across PPO, GRPO, and other RL algorithms.. This skill provides a specialized system prompt that configures your AI coding agent as a verl expert, with detailed methodology and structured output formats.
Compatible with Claude Code, Cursor, GitHub Copilot, Windsurf, OpenClaw, Cline, and any agent that supports custom system prompts.
This skill guides reinforcement learning based training of large language models using verl across PPO, GRPO, and other RL algorithms.