no code implementations • 29 Jan 2022 • Huy Phan, Thi Ngoc Tho Nguyen, Philipp Koch, Alfred Mertins
The network is composed of a backbone subnet and multiple task-specific subnets.
4 code implementations • 16 Nov 2021 • Thi Ngoc Tho Nguyen, Douglas L. Jones, Karn N. Watcharasupat, Huy Phan, Woon-Seng Gan
In this work, we introduce SALSA-Lite, a fast and effective feature for polyphonic SELD using microphone array inputs.
no code implementations • 2 Oct 2021 • Karn N. Watcharasupat, Thi Ngoc Tho Nguyen, Woon-Seng Gan, Shengkui Zhao, Bin Ma
We also propose a dual-mask technique for joint echo and noise suppression with simultaneous speech enhancement.
1 code implementation • 1 Oct 2021 • Thi Ngoc Tho Nguyen, Karn N. Watcharasupat, Ngoc Khanh Nguyen, Douglas L. Jones, Woon-Seng Gan
Sound event localization and detection (SELD) consists of two subtasks, which are sound event detection and direction-of-arrival estimation.
no code implementations • 22 Jul 2021 • Karn N. Watcharasupat, Thi Ngoc Tho Nguyen, Ngoc Khanh Nguyen, Zhen Jian Lee, Douglas L. Jones, Woon Seng Gan
The S{\o}rensen--Dice Coefficient has recently seen rising popularity as a loss function (also known as Dice loss) due to its robustness in tasks where the number of negative samples significantly exceeds that of positive samples, such as semantic segmentation, natural language processing, and sound event detection.
1 code implementation • 22 Jul 2021 • Thi Ngoc Tho Nguyen, Karn N. Watcharasupat, Zhen Jian Lee, Ngoc Khanh Nguyen, Douglas L. Jones, Woon Seng Gan
Sound event localization and detection (SELD) is an emerging research topic that aims to unify the tasks of sound event detection and direction-of-arrival estimation.
1 code implementation • 29 Jun 2021 • Thi Ngoc Tho Nguyen, Karn Watcharasupat, Ngoc Khanh Nguyen, Douglas L. Jones, Woon Seng Gan
Sound event localization and detection consists of two subtasks which are sound event detection and direction-of-arrival estimation.
no code implementations • 9 Dec 2020 • Venice Erin Liong, Thi Ngoc Tho Nguyen, Sergi Widjaja, Dhananjai Sharma, Zhuang Jie Chong
In this paper, we present an Assertion-based Multi-View Fusion network (AMVNet) for LiDAR semantic segmentation which aggregates the semantic features of individual projection-based networks using late fusion.
Ranked #14 on LIDAR Semantic Segmentation on nuScenes