Search Results for author: Thi Ngoc Tho Nguyen

Found 8 papers, 4 papers with code

Polyphonic audio event detection: multi-label or multi-class multi-task classification problem?

no code implementations • 29 Jan 2022 • Huy Phan, Thi Ngoc Tho Nguyen, Philipp Koch, Alfred Mertins

The network is composed of a backbone subnet and multiple task-specific subnets.

Paper
Add Code

SALSA-Lite: A Fast and Effective Feature for Polyphonic Sound Event Localization and Detection with Microphone Arrays

4 code implementations • 16 Nov 2021 • Thi Ngoc Tho Nguyen, Douglas L. Jones, Karn N. Watcharasupat, Huy Phan, Woon-Seng Gan

In this work, we introduce SALSA-Lite, a fast and effective feature for polyphonic SELD using microphone array inputs.

Sound Event Localization and Detection

Paper
Code

End-to-End Complex-Valued Multidilated Convolutional Neural Network for Joint Acoustic Echo Cancellation and Noise Suppression

no code implementations • 2 Oct 2021 • Karn N. Watcharasupat, Thi Ngoc Tho Nguyen, Woon-Seng Gan, Shengkui Zhao, Bin Ma

We also propose a dual-mask technique for joint echo and noise suppression with simultaneous speech enhancement.

Acoustic echo cancellation Speech Enhancement

Paper
Add Code

SALSA: Spatial Cue-Augmented Log-Spectrogram Features for Polyphonic Sound Event Localization and Detection

1 code implementation • 1 Oct 2021 • Thi Ngoc Tho Nguyen, Karn N. Watcharasupat, Ngoc Khanh Nguyen, Douglas L. Jones, Woon-Seng Gan

Sound event localization and detection (SELD) consists of two subtasks, which are sound event detection and direction-of-arrival estimation.

Ranked #1 on Sound Event Localization and Detection on TAU-NIGENS Spatial Sound Events 2021

Direction of Arrival Estimation Event Detection +2

Paper
Code

Improving Polyphonic Sound Event Detection on Multichannel Recordings with the Sørensen-Dice Coefficient Loss and Transfer Learning

no code implementations • 22 Jul 2021 • Karn N. Watcharasupat, Thi Ngoc Tho Nguyen, Ngoc Khanh Nguyen, Zhen Jian Lee, Douglas L. Jones, Woon Seng Gan

The S{\o}rensen--Dice Coefficient has recently seen rising popularity as a loss function (also known as Dice loss) due to its robustness in tasks where the number of negative samples significantly exceeds that of positive samples, such as semantic segmentation, natural language processing, and sound event detection.

Data Augmentation Event Detection +3

Paper
Add Code

What Makes Sound Event Localization and Detection Difficult? Insights from Error Analysis

1 code implementation • 22 Jul 2021 • Thi Ngoc Tho Nguyen, Karn N. Watcharasupat, Zhen Jian Lee, Ngoc Khanh Nguyen, Douglas L. Jones, Woon Seng Gan

Sound event localization and detection (SELD) is an emerging research topic that aims to unify the tasks of sound event detection and direction-of-arrival estimation.

Direction of Arrival Estimation Event Detection +2

Paper
Code

DCASE 2021 Task 3: Spectrotemporally-aligned Features for Polyphonic Sound Event Localization and Detection

1 code implementation • 29 Jun 2021 • Thi Ngoc Tho Nguyen, Karn Watcharasupat, Ngoc Khanh Nguyen, Douglas L. Jones, Woon Seng Gan

Sound event localization and detection consists of two subtasks which are sound event detection and direction-of-arrival estimation.

Audio Classification Direction of Arrival Estimation +3

Paper
Code

AMVNet: Assertion-based Multi-View Fusion Network for LiDAR Semantic Segmentation

no code implementations • 9 Dec 2020 • Venice Erin Liong, Thi Ngoc Tho Nguyen, Sergi Widjaja, Dhananjai Sharma, Zhuang Jie Chong

In this paper, we present an Assertion-based Multi-View Fusion network (AMVNet) for LiDAR semantic segmentation which aggregates the semantic features of individual projection-based networks using late fusion.

Ranked #14 on LIDAR Semantic Segmentation on nuScenes

Autonomous Vehicles LIDAR Semantic Segmentation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.