1 code implementation • 30 Nov 2021 • Arnau Martí Sarri, Victor Rodriguez-Fernandez
CLIP (Contrastive Language-Image Pretraining) is an efficient method for learning computer vision tasks from natural language supervision that has powered a recent breakthrough in deep learning due to its zero-shot transfer capabilities.