no code implementations • 12 Aug 2021 • Youxuan Ma, Zongze Ren, Shugong Xu
In recent years, synthetic speech generated by advanced text-to-speech (TTS) and voice conversion (VC) systems has caused great harms to automatic speaker verification (ASV) systems, urging us to design a synthetic speech detection system to protect ASV systems.
no code implementations • 12 Aug 2019 • Zhiyong Chen, Zongze Ren, Shugong Xu
Learning a good speaker embedding is important for many automatic speaker recognition tasks, including verification, identification and diarization.
no code implementations • 6 Aug 2019 • Zongze Ren, Zhiyong Chen, Shugong Xu
The improvements are both based on triplet cause the training stage and the evaluation stage of the baseline x-vector system focus on different aims.
no code implementations • 6 Aug 2019 • Zongze Ren, Guofu Yang, Shugong Xu
In this paper, we present a two-stage language identification (LID) system based on a shallow ResNet14 followed by a simple 2-layer recurrent neural network (RNN) architecture, which was used for Xunfei (iFlyTek) Chinese Dialect Recognition Challenge and won the first place among 110 teams.