llm-obs-eval-pipeline

Community

End-to-end pipeline from unlabeled ml_app traces to a bootstrapped evaluator suite. Runs trace classification → root cause analysis → eval bootstrap in sequence with user checkpoints. Use when user says "run the eval pipeline", "go from traces to evals", "bootstrap evals end to end", "classify then RCA then bootstrap", "build an eval set from scratch", or wants a guided walkthrough from production data to evaluator code.

Claude

121 stars Updated 4 days ago

Allowed Tools

This skill does not declare a tool allowlist. The agent host applies whatever default tools are available at runtime.

Source

SKILL.md / Manifest

https://raw.githubusercontent.com/datadog-labs/agent-skills/main/dd-llmo/llm-obs-eval-pipeline/SKILL.md

Registry

github (via claudemarketplaces.com)

Trust Score

55Fair

Verification10/30

Scope Tightness

llm-obs-eval-pipeline

Allowed Tools

Source

Trust Score

Details