no code implementations • 5 Jul 2022 • Rosanna Milner, Md Asif Jalal, Raymond W. M. Ng, Thomas Hain
This shows positive information transfer from acted datasets to those with more natural emotions and the benefits from training on different corpora.
no code implementations • CONLL 2018 • Rory Beard, Ritwik Das, Raymond W. M. Ng, P. G. Keerthana Gopalakrishnan, Luka Eerens, Pawel Swietojanski, Ondrej Miksik
Natural human communication is nuanced and inherently multi-modal.
no code implementations • 10 Jun 2016 • Mortaza Doulaty, Oscar Saz, Raymond W. M. Ng, Thomas Hain
Huge amounts of digital videos are being produced and broadcast every day, leading to giant media archives.
no code implementations • 21 Dec 2015 • Oscar Saz, Mortaza Doulaty, Salil Deena, Rosanna Milner, Raymond W. M. Ng, Madina Hasan, Yu-Lan Liu, Thomas Hain
We describe the University of Sheffield system for participation in the 2015 Multi-Genre Broadcast (MGB) challenge task of transcribing multi-genre broadcast shows.
no code implementations • 16 Nov 2015 • Mortaza Doulaty, Oscar Saz, Raymond W. M. Ng, Thomas Hain
This paper presents a new method for the discovery of latent domains in diverse speech data, for the use of adaptation of Deep Neural Networks (DNNs) for Automatic Speech Recognition.
no code implementations • 13 Sep 2015 • Raymond W. M. Ng, Mortaza Doulaty, Rama Doddipatla, Wilker Aziz, Kashif Shah, Oscar Saz, Madina Hasan, Ghada Alharbi, Lucia Specia, Thomas Hain
The USFD primary system incorporates state-of-the-art ASR and MT techniques and gives a BLEU score of 23. 45 and 14. 75 on the English-to-French and English-to-German speech-to-text translation task with the IWSLT 2014 data.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4