python-llm
NobodyWho is an inference engine that lets you run LLMs locally and efficiently on any device.