Question d’entretien chez Microsoft

Transformer architecture and implement self-attention from scratch