Question d’entretien chez Kyndryl

Can we use CNN in multi-modal architecture for image processing?