Question d’entretien chez kipi.ai

How to evaluate large language models?