Autonomous experiment loop that edits code, runs benchmarks, measures metrics, and keeps improvements or reverts — repeating forever. Works for any optimization target: LLM training loss, test speed, bundle size, build time, Lighthouse scores, and more.