1 code implementation • 16 Jun 2023 • Rabiul Awal, Le Zhang, Aishwarya Agrawal
In summary, our research sheds light on the intricacies of prompting strategies in VLMs for VQA, emphasizing the synergistic use of captions, templates, and pre-processing to enhance model efficacy.
2 code implementations • 15 Jun 2023 • Le Zhang, Rabiul Awal, Aishwarya Agrawal
However, the compositional reasoning abilities of existing VLMs remains subpar.