Question d’entretien chez IBM

Clean a data set that has repeated words