1 code implementation • 18 Jul 2023 • Kaavya Rekanar, Ciarán Eising, Ganesh Sistu, Martin Hayes
This short paper presents a preliminary analysis of three popular Visual Question Answering (VQA) models, namely ViLBERT, ViLT, and LXMERT, in the context of answering questions relating to driving scenarios.