agent-eval Guide

Name: agent-eval
Author: affaan-m

カスタムタスクでコーディングエージェント（Claude Code、Aider、Codex など）をヘッドツーヘッドで比較し、合格率、コスト、時間、一貫性のメトリクスを測定します

195,887 starsby affaan-m

When to use agent-eval

How to use agent-eval

agent-eval is a Claude skill in the SKILL.md format. Add it to your Claude environment from the source repository below, then it activates as a user-invocable skill when your task matches its description.

Skill source

https://raw.githubusercontent.com/affaan-m/everything-claude-code/main/docs/ja-JP/skills/agent-eval/SKILL.md

Details

PlatformClaude

CategoryAI & ML

Invocationuser-invocable

Modelany

Maintaineraffaan-m

LicenseMIT

agent-eval Guide

When to use agent-eval

How to use agent-eval

Details

Resources