TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Speech Enhancement	VoiceBank + DEMAND	DCT	PESQ	2.7	# 23
Speech Enhancement	VoiceBank + DEMAND	DCT	CSIG	3.9	# 20
Speech Enhancement	VoiceBank + DEMAND	DCT	CBAK	3.29	# 13
Speech Enhancement	VoiceBank + DEMAND	DCT	COVL	3.29	# 20

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/end-to-end-speech-enhancement-based-on/speech-enhancement-on-demand)](https://paperswithcode.com/sota/speech-enhancement-on-demand?p=end-to-end-speech-enhancement-based-on)`

End-to-end speech enhancement based on discrete cosine transform

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019 · Chuang Geng, Lei Wang ·

Previous speech enhancement methods focus on estimating the short-time spectrum of speech signals due to its short-term stability. However, these methods often only estimate the clean magnitude spectrum and reuse the noisy phase when resynthesize speech signals, which is unlikely a valid short-time Fourier transform (STFT). Recently, DNN based speech enhancement methods mainly joint estimation of the magnitude and phase spectrum. These methods usually give better performance than magnitude spectrum estimation but need much larger computation and memory overhead. In this paper, we propose using the Discrete Cosine Transform (DCT) to reconstruct a valid short-time spectrum. Under the U-net structure, we enhance the real spectrogram and finally achieve perfect performance.

PDF Abstract

Code

Add Remove Mark official

BYRTIMO/END-TO-END-SPEECH-ENHANCEME… official

IMLHF/DRUnet-SE

BYRTIMO/Speech-enhancement-based-on…

Datasets

VoiceBank + DEMAND

Edit Social Preview

End-to-end speech enhancement based on discrete cosine transform

Code Edit Add Remove Mark official

Categories

Datasets Edit

Code

Add Remove Mark official

Datasets