Search Results for author: Asmar Nadeem

Found 2 papers, 0 papers with code

CAD -- Contextual Multi-modal Alignment for Dynamic AVQA

no code implementations • 25 Oct 2023 • Asmar Nadeem, Adrian Hilton, Robert Dawes, Graham Thomas, Armin Mustafa

In the context of Audio Visual Question Answering (AVQA) tasks, the audio visual modalities could be learnt on three levels: 1) Spatial, 2) Temporal, and 3) Semantic.

Ranked #3 on Audio-visual Question Answering on MUSIC-AVQA

Audio-visual Question Answering Audio-Visual Question Answering (AVQA) +2

Paper
Add Code

SEM-POS: Grammatically and Semantically Correct Video Captioning

no code implementations • 26 Mar 2023 • Asmar Nadeem, Adrian Hilton, Robert Dawes, Graham Thomas, Armin Mustafa

Generating grammatically and semantically correct captions in video captioning is a challenging task.

Ranked #13 on Video Captioning on MSVD

POS Video Captioning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.