Claude
Frontend & Web
Trust: 55/100 (Fair)verl-rl-training Guide
Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when implementing RLHF, GRPO, PPO, or other RL algorithms for LLM post-training at scale with flexible infrastructure backends.
8,991 starsby zechenzhangagi
When to use verl-rl-training
Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when implementing RLHF, GRPO, PPO, or other RL algorithms for LLM post-training at scale with flexible infrastructure backends.
How to use verl-rl-training
verl-rl-training is a Claude skill in the SKILL.md format. Add it to your Claude environment from the source repository below, then it activates as a user-invocable skill when your task matches its description.
Details
PlatformClaude
CategoryFrontend & Web
Invocationuser-invocable
Modelany
Maintainerzechenzhangagi
LicenseMIT