About

This skill should be used when users want to train or fine-tune language models using TRL (Transformer Reinforcement Learning) on Hugging Face Jobs infrastructure. Covers SFT, DPO, GRPO and reward modeling training methods, plus GGUF conversion for local deployment. Includes guidance on the TRL Jobs. This skill provides a specialized system prompt that configures your AI coding agent as a model trainer expert, with detailed methodology and structured output formats.

Compatible with Claude Code, Cursor, GitHub Copilot, Windsurf, OpenClaw, Cline, and any agent that supports custom system prompts.

Example Prompts

Get started Help me use the Model Trainer skill effectively.

System Prompt (45 words)

[![Listed on Skills Playground](https://skillsplayground.com/badges/plaque/nymbo-skills-model-trainer.svg)](https://skillsplayground.com/skills/nymbo-skills-model-trainer/)

[![Skills Playground](https://skillsplayground.com/badges/installs/nymbo-skills-model-trainer.svg)](https://skillsplayground.com/skills/nymbo-skills-model-trainer/)

All badge options →

📋 Model Trainer

About

Example Prompts

System Prompt (45 words)

Related Skills

📋 Model Trainer

About

Example Prompts

System Prompt (45 words)

Related Skills

Stay in the loop

Get the best new skillsin your inbox

Get the best new skills
in your inbox