no code implementations • 16 Jan 2024 • Jenthe Thienpondt, Kris Demuynck
In this paper, we present ECAPA2, a novel hybrid neural network architecture and training strategy to produce robust speaker embeddings.
no code implementations • 10 Jul 2023 • Jenthe Thienpondt, Caroline M. Speksnijder, Kris Demuynck
In this paper, we analyze the behavior of speaker embeddings of patients during oral cancer treatment.
no code implementations • 7 Apr 2023 • Jenthe Thienpondt, Nilesh Madhu, Kris Demuynck
Most speaker verification systems are designed with the assumption of a single speaker being present in a given audio segment.
no code implementations • 19 Jun 2022 • Jenthe Thienpondt, Kris Demuynck
This can mainly be attributed to the absence of large children's speech corpora to train robust ASR models and the resulting domain mismatch when decoding children's speech with systems trained on adult data.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 18 Oct 2021 • Jenthe Thienpondt, Brecht Desplanques, Kris Demuynck
This paper contains a post-challenge performance analysis on cross-lingual speaker verification of the IDLab submission to the VoxCeleb Speaker Recognition Challenge 2021 (VoxSRC-21).
no code implementations • 9 Sep 2021 • Jenthe Thienpondt, Brecht Desplanques, Kris Demuynck
The final system fusion with two ECAPA CNN-TDNNs and three SE-ResNets enhanced with frequency positional information achieved a third place on the VoxSRC-21 leaderboard for both track 1 and 2 with a minDCF of 0. 1291 and 0. 1313 respectively.
no code implementations • 6 Apr 2021 • Jenthe Thienpondt, Brecht Desplanques, Kris Demuynck
These learnable feature map biases along the frequency axis offer this architecture a straightforward way to exploit frequency positional information.
no code implementations • 3 Apr 2021 • Nauman Dawalatabad, Mirco Ravanelli, François Grondin, Jenthe Thienpondt, Brecht Desplanques, Hwidong Na
Learning robust speaker embeddings is a crucial step in speaker diarization.
no code implementations • 15 Jul 2020 • Jenthe Thienpondt, Brecht Desplanques, Kris Demuynck
In this paper we describe the top-scoring IDLab submission for the text-independent task of the Short-duration Speaker Verification (SdSV) Challenge 2020.