eval-creator-ci Guide

Name: eval-creator-ci
Author: pskoett

[Beta] CI-only eval regression runner using gh-aw (GitHub Agentic Workflows). Runs all eval cases in .evals/ on a schedule or per-PR, reports pass/fail results, and can block merges on regressions. Also creates new eval cases from promoted patterns flagged by learning-aggregator-ci. Use when: you want automated regression testing of promoted rules in CI/headless pipelines. For interactive eval creation and runs, use eval-creator.

188 starsby pskoett

When to use eval-creator-ci

How to use eval-creator-ci

eval-creator-ci is a Claude skill in the SKILL.md format. Add it to your Claude environment from the source repository below, then it activates as a user-invocable skill when your task matches its description.

Skill source

https://raw.githubusercontent.com/pskoett/pskoett-ai-skills/main/plugin/skills/eval-creator-ci/SKILL.md

Details

PlatformClaude

CategoryCode & Development

Invocationuser-invocable

Modelany

Maintainerpskoett

eval-creator-ci Guide

When to use eval-creator-ci

How to use eval-creator-ci

Details

Resources