llm-capacity
Capacity planning for reserved LLM throughput: latency, headroom, and synthetic workload simulation.