KUIELab-MDX-Net: A Two-Stream Neural Network for Music Demixing

24 Nov 2021  ·  Minseok Kim, Woosung Choi, Jaehwa Chung, Daewon Lee, Soonyoung Jung ·

Recently, many methods based on deep learning have been proposed for music source separation. Some state-of-the-art methods have shown that stacking many layers with many skip connections improve the SDR performance. Although such a deep and complex architecture shows outstanding performance, it usually requires numerous computing resources and time for training and evaluation. This paper proposes a two-stream neural network for music demixing, called KUIELab-MDX-Net, which shows a good balance of performance and required resources. The proposed model has a time-frequency branch and a time-domain branch, where each branch separates stems, respectively. It blends results from two streams to generate the final estimation. KUIELab-MDX-Net took second place on leaderboard A and third place on leaderboard B in the Music Demixing Challenge at ISMIR 2021. This paper also summarizes experimental results on another benchmark, MUSDB18. Our source code is available online.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Music Source Separation MUSDB18 KUIELab-MDX-Net SDR (vocals) 9.00 # 6
SDR (drums) 7.33 # 10
SDR (other) 5.95 # 6
SDR (bass) 7.86 # 7
SDR (avg) 7.54 # 7
Music Source Separation MUSDB18-HQ KUIELab-MDX-Net SDR (drums) 7.20 # 10
SDR (bass) 7.83 # 8
SDR (others) 5.90 # 9
SDR (vocals) 8.97 # 9
SDR (avg) 7.47 # 10

Methods


No methods listed for this paper. Add relevant methods here