google-agents-cli-eval Guide

Name: google-agents-cli-eval
Author: google

This skill should be used when the user wants to "run an evaluation", "evaluate my ADK agent", "write an evalset", "debug eval scores", "compare eval results", or needs guidance on ADK (Agent Development Kit) evaluation methodology and the eval-fix loop. Covers eval metrics, evalset schema, LLM-as-judge, tool trajectory scoring, and common failure causes. Part of the Google ADK (Agent Development Kit) skills suite. Do NOT use for API code patterns (use google-agents-cli-adk-code), deployment (use google-agents-cli-deploy), or project scaffolding (use google-agents-cli-scaffold).

2,612 starsby google

When to use google-agents-cli-eval

How to use google-agents-cli-eval

google-agents-cli-eval is a Claude skill in the SKILL.md format. Add it to your Claude environment from the source repository below, then it activates as a user-invocable skill when your task matches its description.

Skill source

https://raw.githubusercontent.com/google/agents-cli/main/skills/google-agents-cli-eval/SKILL.md

Details

PlatformClaude

CategoryAI & ML

Invocationuser-invocable

Modelany

Maintainergoogle

LicenseApache-2.0

google-agents-cli-eval Guide

When to use google-agents-cli-eval

How to use google-agents-cli-eval

Details

Resources