Plugin
efficient-orchestration
Always-on token-efficiency operating model — for every task, minimize output tokens and decide inline vs. delegate to cheaper subagents, then monitor spend and checkpoint when work reclassifies.
Output thrift (every turn):
- No narration or preamble — lead with the result
- Terse by default; read only what the task requires
- Cap the tool-call loop; never re-derive what's already done
Delegation tiers:
- Light (Haiku-class) — reading, grepping, listing, mechanical work
- Mid (Sonnet-class) — bounded judgment and implementation
- Top (Opus-class) — synthesis, cross-cutting review, high-stakes decisions
The large model holds strategy and verification; mechanical work fans out to the smallest capable model. Never spend a bigger model than the job needs.