agentic-eval Guide

Name: agentic-eval
Author: github

Patterns and techniques for evaluating and improving AI agent outputs. Use this skill when: - Implementing self-critique and reflection loops - Building evaluator-optimizer pipelines for quality-critical generation - Creating test-driven code refinement workflows - Designing rubric-based or LLM-as-judge evaluation systems - Adding iterative improvement to agent outputs (code, reports, analysis) - Measuring and improving agent response quality

33,939 starsby github

When to use agentic-eval

How to use agentic-eval

agentic-eval is a Claude skill in the SKILL.md format. Add it to your Claude environment from the source repository below, then it activates as a user-invocable skill when your task matches its description.

Skill source

https://raw.githubusercontent.com/github/awesome-copilot/main/skills/agentic-eval/SKILL.md

Details

PlatformClaude

CategoryAI & ML

Invocationuser-invocable

Modelany

Maintainergithub

LicenseMIT

agentic-eval Guide

When to use agentic-eval

How to use agentic-eval

Details

Resources