no code implementations • 19 Sep 2023 • Clement Laroudie, Andrei Bursuc, Mai Lan Ha, Gianni Franchi
This paper examines the robustness of a multi-modal computer vision model, CLIP (Contrastive Language-Image Pretraining), in the context of unsupervised learning.