A Progressive Enhancement Method for Noisy and Reverberant Speech

Xiaofeng Shu, Yi Zhou, Yin Cao

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

2 Citations (Scopus)

Abstract

In this paper, a speech enhancement method based on the framework of progressive deep neural networks (PDNNs) is proposed for low signal-to-noise ratio (SNR) and highly reverberant environments. It aims at assisting the complicated regression task of mapping noisy and reverberant speech to clean speech by utilizing two independent tasks, which suppress reverberation and noises respectively. Furthermore, a progressive learning approach is used for each task, which brings intermediate learning targets to enhance system performances. Experimental results reveal that the proposed method can achieve improvements in both objective and subjective evaluations in low SNR and high reverberation time 60 (RT60) environments when compared with the conventional deep neural network-based method.

Original languageEnglish
Title of host publication2018 IEEE 23rd International Conference on Digital Signal Processing, DSP 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781538668115
DOIs
Publication statusPublished - 2 Jul 2018
Externally publishedYes
Event23rd IEEE International Conference on Digital Signal Processing, DSP 2018 - Shanghai, China
Duration: 19 Nov 201821 Nov 2018

Publication series

NameInternational Conference on Digital Signal Processing, DSP
Volume2018-November

Conference

Conference23rd IEEE International Conference on Digital Signal Processing, DSP 2018
Country/TerritoryChina
CityShanghai
Period19/11/1821/11/18

Keywords

  • PDNNs
  • Progressive learning
  • RT60
  • Regression task
  • SNR
  • Speech enhancement

Fingerprint

Dive into the research topics of 'A Progressive Enhancement Method for Noisy and Reverberant Speech'. Together they form a unique fingerprint.

Cite this