Search Results for author: Sharadh Ramaswamy

Found 1 papers, 1 papers with code

AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection

1 code implementation • 5 Jan 2019 • Joseph Roth, Sourish Chaudhuri, Ondrej Klejch, Radhika Marvin, Andrew Gallagher, Liat Kaver, Sharadh Ramaswamy, Arkadiusz Stopczynski, Cordelia Schmid, Zhonghua Xi, Caroline Pantofaru

The dataset contains temporally labeled face tracks in video, where each face instance is labeled as speaking or not, and whether the speech is audible.

Audio-Visual Active Speaker Detection speaker-diarization +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.