Search Results for author: Iuliia Nigmatulina

Found 13 papers, 5 papers with code

ASR for Non-standardised Languages with Dialectal Variation: the case of Swiss German

no code implementations • VarDial (COLING) 2020 • Iuliia Nigmatulina, Tannon Kew, Tanja Samardzic

A formal comparison shows that the system trained on the normalised transcriptions achieves better results in word error rate (WER) (29. 39%) but underperforms at the character level, suggesting dialectal transcriptions offer a viable solution for downstream applications where dialectal differences are important.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Implementing contextual biasing in GPU decoder for online ASR

1 code implementation • 23 Jun 2023 • Iuliia Nigmatulina, Srikanth Madikeri, Esaú Villatoro-Tello, Petr Motliček, Juan Zuluaga-Gomez, Karthik Pandia, Aravind Ganapathiraju

GPU decoding significantly accelerates the output of ASR predictions.

Decoder Language Modelling

Paper
Code

Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding

no code implementations • 2 May 2023 • Juan Zuluaga-Gomez, Iuliia Nigmatulina, Amrutha Prasad, Petr Motlicek, Driss Khalil, Srikanth Madikeri, Allan Tart, Igor Szoke, Vincent Lenders, Mickael Rigault, Khalid Choukri

This paper explores the lessons learned from the ATCO2 project, a project that aimed to develop a unique platform to collect and preprocess large amounts of ATC data from airspace in real time.

Automatic Speech Recognition Language Identification +2

Paper
Add Code

A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers

no code implementations • 16 Apr 2023 • Juan Zuluaga-Gomez, Amrutha Prasad, Iuliia Nigmatulina, Petr Motlicek, Matthias Kleinert

The overall pipeline is composed of the following submodules: (i) automatic speech recognition (ASR) system that transforms audio into a sequence of words; (ii) high-level air traffic control (ATC) related entity parser that understands the transcribed voice communication; and (iii) a text-to-speech submodule that generates a spoken utterance that resembles a pilot based on the situation of the dialogue.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks

1 code implementation • 16 Dec 2022 • Esaú Villatoro-Tello, Srikanth Madikeri, Juan Zuluaga-Gomez, Bidisha Sharma, Seyyed Saeed Sarfjoo, Iuliia Nigmatulina, Petr Motlicek, Alexei V. Ivanov, Aravind Ganapathiraju

In this paper, we perform an exhaustive evaluation of different representations to address the intent classification problem in a Spoken Language Understanding (SLU) setup.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Code

Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator

no code implementations • 14 Dec 2022 • Amrutha Prasad, Juan Zuluaga-Gomez, Petr Motlicek, Saeed Sarfjoo, Iuliia Nigmatulina, Karel Vesely

The system understands the voice communications issued by the ATCo, and, in turn, it generates a spoken prompt that follows the pilot's phraseology to the initial communication.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications

3 code implementations • 8 Nov 2022 • Juan Zuluaga-Gomez, Karel Veselý, Igor Szöke, Alexander Blatt, Petr Motlicek, Martin Kocour, Mickael Rigault, Khalid Choukri, Amrutha Prasad, Seyyed Saeed Sarfjoo, Iuliia Nigmatulina, Claudia Cevenini, Pavel Kolčárek, Allan Tart, Jan Černocký, Dietrich Klakow

In this paper, we introduce the ATCO2 corpus, a dataset that aims at fostering research on the challenging ATC field, which has lagged behind due to lack of annotated data.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +6

Paper
Code

How Does Pre-trained Wav2Vec 2.0 Perform on Domain Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications

2 code implementations • 31 Mar 2022 • Juan Zuluaga-Gomez, Amrutha Prasad, Iuliia Nigmatulina, Saeed Sarfjoo, Petr Motlicek, Matthias Kleinert, Hartmut Helmke, Oliver Ohneiser, Qingran Zhan

Recent work on self-supervised pre-training focus on leveraging large-scale unlabeled speech data to build robust end-to-end (E2E) acoustic models (AM) that can be later fine-tuned on downstream tasks e. g., automatic speech recognition (ASR).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Code

A two-step approach to leverage contextual data: speech recognition in air-traffic communications

no code implementations • 8 Feb 2022 • Iuliia Nigmatulina, Juan Zuluaga-Gomez, Amrutha Prasad, Seyyed Saeed Sarfjoo, Petr Motlicek

Automatic Speech Recognition (ASR), as the assistance of speech communication between pilots and air-traffic controllers, can significantly reduce the complexity of the task and increase the reliability of transmitted information.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Add Code

BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications

2 code implementations • 12 Oct 2021 • Juan Zuluaga-Gomez, Seyyed Saeed Sarfjoo, Amrutha Prasad, Iuliia Nigmatulina, Petr Motlicek, Karel Ondrej, Oliver Ohneiser, Hartmut Helmke

We propose a system that combines SAD and a BERT model to perform speaker change detection and speaker role detection (SRD) by chunking ASR transcripts, i. e., SD with a defined number of speakers together with SRD.

Action Detection Activity Detection +7

Paper
Code

Grammar Based Speaker Role Identification for Air Traffic Control Speech Recognition

no code implementations • 27 Aug 2021 • Amrutha Prasad, Juan Zuluaga-Gomez, Petr Motlicek, Saeed Sarfjoo, Iuliia Nigmatulina, Oliver Ohneiser, Hartmut Helmke

In this work, we propose to (1) automatically segment the ATCO and pilot data based on an intuitive approach exploiting ASR transcripts and (2) subsequently consider an automatic recognition of ATCOs' and pilots' voice as two separate tasks.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Improving callsign recognition with air-surveillance data in air-traffic communication

no code implementations • 27 Aug 2021 • Iuliia Nigmatulina, Rudolf Braun, Juan Zuluaga-Gomez, Petr Motlicek

Automatic Speech Recognition (ASR) can be used as the assistance of speech communication between pilots and air-traffic controllers.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems

no code implementations • 8 Apr 2021 • Juan Zuluaga-Gomez, Iuliia Nigmatulina, Amrutha Prasad, Petr Motlicek, Karel Veselý, Martin Kocour, Igor Szöke

Results show that `unseen domains' (e. g. data from airports not present in the supervised training data) are further aided by contextual SSL when compared to standalone SSL.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.