throttle-requests
Modular, way of implementing rate-limiting in python with a few handy default implementations
A simple and generic rate limiter to prevent too many function calls or http requests.
Multi-resource rate limiting for LLM APIs. Reserve tokens before you call, refund what you don't use, stay under the limit across workers.