Skip to content

AI/ML · Global library

LLM Load Balancer Designer

Design intelligent load balancing for LLM inference with request routing, session affinity, and dynamic capacity management

CodexClaude CodeKimi Codeorchestrator-mcp

Best use case

Use LLM Load Balancer Designer when you need to design intelligent load balancing for LLM inference with request routing, session affinity, and dynamic capacity management, especially when the work is driven by load balancing and request routing.

Trigger signals

load balancingrequest routingsession affinityweighted routingleast connections

Validation hooks

load-distributionsession-affinity

Install surface

Copy the exact command path you need.

Inspect

pip install "orchestrator-mcp[dashboard]"
orchestrator-mcp skills show llm-load-balancer-designer

Use

orchestrator-mcp skills export llm-load-balancer-designer --to ./skillforge-packs
# copy the exported pack into your preferred agent environment

Export

cp -R skills/llm-load-balancer-designer ./your-agent-skills/llm-load-balancer-designer
# or open skills/llm-load-balancer-designer/SKILL.md in a markdown-first client

File patterns

*.py*.yamlnginx.confloadbalancer/*.py

Model preferences

claude-sonnet-4gpt-4oclaude-haiku-3

Related skills

Adjacent packs to compose next.

AI/MLGlobal library

Agent Lifecycle Manager

Open pack

Manage complete agent lifecycles from initialization through graceful shutdown with health monitoring, scaling, and resource optimization

CodexClaude Code
AI/MLGlobal library

Agent Memory Designer

Open pack

Design short-term, long-term, and episodic memory layers for agents without turning retrieval into an unbounded context leak.

CodexClaude Code