largelanguagemodel
AI-powered testing agent
Fast LLM inference with 2.8x speedup using speculative decoding