Skills for building LLM evaluations
Automated code review loop plugin for Claude Code
Skills for building LLM evaluations: pipeline audit, error analysis, synthetic data generation, LLM-as-Judge design, evaluator validation, RAG evaluation, and annotation interfaces.
Automated code review loop: Claude implements, Codex reviews independently, Claude addresses feedback
CLI tools for processing YouTube videos, Zoom recordings, and newsletters