Question d’entretien chez Avanade

How to identify duplicates using Spark SQL from a datasets