claudeindex
Plugin

web-bench

WebBench benchmark runner — executes real-world browser tasks from the Halluminate/WebBench dataset, scores via LLM-as-judge, and produces evaluation reports

Installation

1

Add the marketplace

/plugin marketplace add lespaceman/athena-workflow-marketplace
2

Install plugins

/plugin

Run these commands in Claude Code to add this plugin to your environment. The marketplace must be added before you can install its plugins.