Search Results for author: Gurneet Arora

Found 1 papers, 0 papers with code

iPerceive: Applying Common-Sense Reasoning to Multi-Modal Dense Video Captioning and Video Question Answering

no code implementations • 16 Nov 2020 • Aman Chadha, Gurneet Arora, Navpreet Kaloty

Most prior art in visual understanding relies solely on analyzing the "what" (e. g., event recognition) and "where" (e. g., event localization), which in some cases, fails to describe correct contextual relationships between events or leads to incorrect underlying visual attention.

Ranked #4 on Video Question Answering on TVQA

Common Sense Reasoning Dense Video Captioning +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.