Search Results for author: Davide Berghi

Found 5 papers, 2 papers with code

Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection

1 code implementation14 Dec 2023 Davide Berghi, Peipei Wu, Jinzheng Zhao, Wenwu Wang, Philip J. B. Jackson

Sound event localization and detection (SELD) combines two subtasks: sound event detection (SED) and direction of arrival (DOA) estimation.

Data Augmentation Event Detection +2

Audio Inputs for Active Speaker Detection and Localization via Microphone Array

no code implementations27 Jul 2023 Davide Berghi, Philip J. B. Jackson

This study considers the problem of detecting and locating an active talker's horizontal position from multichannel audio captured by a microphone array.

Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research

no code implementations4 Dec 2022 Davide Berghi, Marco Volino, Philip J. B. Jackson

This is partly due to the lack of available datasets enabling audio-visual research in this direction.

Visually Supervised Speaker Detection and Localization via Microphone Array

no code implementations7 Mar 2022 Davide Berghi, Adrian Hilton, Philip J. B. Jackson

We propose to generate weak labels using a pre-trained active speaker detector on pre-extracted face tracks.

Cannot find the paper you are looking for? You can Submit a new open access paper.