Claude
Code & Development
Trust: 55/100 (Fair)devtu-benchmark-harness Guide
Continuous improvement system for ToolUniverse tools, skills, and plugin. Run benchmarks, diagnose failures, route fixes to devtu skills, retest. Use after skill optimization, tool additions, or as regression check.
1,384 starsby mims-harvard
When to use devtu-benchmark-harness
Continuous improvement system for ToolUniverse tools, skills, and plugin. Run benchmarks, diagnose failures, route fixes to devtu skills, retest. Use after skill optimization, tool additions, or as regression check.
How to use devtu-benchmark-harness
devtu-benchmark-harness is a Claude skill in the SKILL.md format. Add it to your Claude environment from the source repository below, then it activates as a user-invocable skill when your task matches its description.
Details
PlatformClaude
CategoryCode & Development
Invocationuser-invocable
Modelany
Maintainermims-harvard
LicenseApache-2.0