Search Results for author: Asmar Nadeem

Found 2 papers, 0 papers with code

CAD -- Contextual Multi-modal Alignment for Dynamic AVQA

no code implementations25 Oct 2023 Asmar Nadeem, Adrian Hilton, Robert Dawes, Graham Thomas, Armin Mustafa

In the context of Audio Visual Question Answering (AVQA) tasks, the audio visual modalities could be learnt on three levels: 1) Spatial, 2) Temporal, and 3) Semantic.

Audio-visual Question Answering Audio-Visual Question Answering (AVQA) +2

Cannot find the paper you are looking for? You can Submit a new open access paper.