AI/ML · Advanced pack
Model Latency Budgeter
Tune timeout, retry, and concurrency budgets across multi-model routes so orchestration stays fast without silent quality collapse.
CodexClaude CodeKimi Codeorchestrator-mcp
Best use case
Use Model Latency Budgeter when you need to tune timeout, retry, and concurrency budgets across multi-model routes so orchestration stays fast without silent quality collapse, especially when the work is driven by latency budget and timeout policy.
Trigger signals
latency budgettimeout policymodel concurrencyrouting policy
Validation hooks
verify_latency_sloverify_text_unchanged
Install surface
Copy the exact command path you need.
Inspect
pip install "orchestrator-mcp[dashboard]"
orchestrator-mcp skills show model-latency-budgeter
Use
orchestrator-mcp skills export model-latency-budgeter --to ./skillforge-packs
# copy the exported pack into your preferred agent environment
Export
cp -R skills/model-latency-budgeter ./your-agent-skills/model-latency-budgeter
# or open skills/model-latency-budgeter/SKILL.md in a markdown-first client
File patterns
**/*.yaml**/*.yml**/routing/****/config/**
Model preferences
Related skills
Adjacent packs to compose next.
Allocate work across local, cloud, and premium models so teams maximize capability coverage per dollar and per latency budget.
CodexClaude Code
Reshape sprawling repositories and briefs into stable context lanes, memory checkpoints, and retrieval boundaries for long-horizon agent work.
CodexClaude Code
Tune provider routing policy for quality, cost ceilings, and fallback behavior across multiple model subscriptions.
CodexClaude Code