Question d’entretien chez OpenAI

Design a distributed rate limiter for an LLM inference API