Question d’entretien chez NVIDIA

Questions around Quantization, inference optimization , LLM system design