Claude
Code & Development
Trust: 55/100 (Fair)llm-evaluation Guide
Master comprehensive evaluation strategies for LLM applications, from automated metrics to human evaluation and A/B testing.
27,615 starsby davila7
When to use llm-evaluation
Master comprehensive evaluation strategies for LLM applications, from automated metrics to human evaluation and A/B testing.
How to use llm-evaluation
llm-evaluation is a Claude skill in the SKILL.md format. Add it to your Claude environment from the source repository below, then it activates as a user-invocable skill when your task matches its description.
Details
PlatformClaude
CategoryCode & Development
Invocationuser-invocable
Modelany
Maintainerdavila7
LicenseMIT