1 code implementation • 19 Jun 2023 • Shivaen Ramshetty, Gaurav Verma, Srijan Kumar
The robustness of multimodal deep learning models to realistic changes in the input text is critical for their applicability to important tasks such as text-to-image retrieval and cross-modal entailment.