This skill trains or fine-tunes language models on Hugging Face Jobs using TRL, with SFT, DPO, GRPO, reward modeling and GGUF deployment.
npx playbooks add skill huggingface/skills --skill hugging-face-model-trainer
This skill trains or fine-tunes language models on Hugging Face Jobs using TRL, with SFT, DPO, GRPO, reward modeling and GGUF deployment.. This skill provides a specialized system prompt that configures your AI coding agent as a hugging face model trainer expert, with detailed methodology and structured output formats.
Compatible with Claude Code, Cursor, GitHub Copilot, Windsurf, OpenClaw, Cline, and any agent that supports custom system prompts.
This skill trains or fine-tunes language models on Hugging Face Jobs using TRL, with SFT, DPO, GRPO, reward modeling and GGUF deployment.