TRL Training on Hugging Face Jobs
· Batch Import
Description
Train and fine-tune language models using TRL (Transformer Reinforcement Learning) on Hugging Face cloud infrastructure. Supports SFT, DPO, GRPO, and reward modeling methods with automatic GPU provisioning, real-time monitoring via Trackio, and GGUF conversion for local deployment.
Repository
https://github.com/majiayu000/claude-skill-registry/tree/main/skills/data/trl
View on GitHub