End-to-end benchmark suite for vercel-plugin. Runs realistic projects through skill injection, launches dev servers, verifies everything works, analyzes conversation logs, and produces an improvement report for overnight self-improvement loops.
This skill does not declare a tool allowlist. The agent host applies whatever default tools are available at runtime.
SKILL.md / Manifest
https://raw.githubusercontent.com/vercel-labs/vercel-plugin/main/.claude/skills/benchmark-e2e/SKILL.mdRegistry
github (via claudemarketplaces.com)