benchmark-sandbox Guide

Name: benchmark-sandbox
Author: vercel-labs

Run vercel-plugin eval scenarios in Vercel Sandboxes instead of local WezTerm panels. Provisions ephemeral microVMs with Claude Code + plugin pre-installed, runs benchmark prompts, extracts hook artifacts, and produces coverage reports.

176 starsby vercel-labs

When to use benchmark-sandbox

How to use benchmark-sandbox

benchmark-sandbox is a Claude skill in the SKILL.md format. Add it to your Claude environment from the source repository below, then it activates as a user-invocable skill when your task matches its description.

Skill source

https://raw.githubusercontent.com/vercel-labs/vercel-plugin/main/.claude/skills/benchmark-sandbox/SKILL.md

Details

PlatformClaude

CategoryAI & ML

Invocationuser-invocable

Modelany

Maintainervercel-labs

benchmark-sandbox Guide

When to use benchmark-sandbox

How to use benchmark-sandbox

Details

Resources