Skills for building LLM evaluations: pipeline audit, error analysis, synthetic data generation, LLM-as-Judge design, evaluator validation, RAG evaluation, and annotation interfaces.
Add the marketplace
/plugin marketplace add hamelsmu/evals-skills
Install plugins
/plugin
Run these commands in Claude Code to add this plugin to your environment. The marketplace must be added before you can install its plugins.
Plugin Source
View Plugin Code
GitHub Repository
hamelsmu/evals-skills
From Marketplace
hamelsmu-evals-skills
View Marketplace
Author
@hamelsmu
View GitHub Profile