A Progressive Enhancement Method for Noisy and Reverberant Speech

Xiaofeng Shu; Yi Zhou; Yin Cao

doi:10.1109/ICDSP.2018.8631860

A Progressive Enhancement Method for Noisy and Reverberant Speech

Xiaofeng Shu, Yi Zhou, Yin Cao

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

2 Citations (Scopus)

Abstract

In this paper, a speech enhancement method based on the framework of progressive deep neural networks (PDNNs) is proposed for low signal-to-noise ratio (SNR) and highly reverberant environments. It aims at assisting the complicated regression task of mapping noisy and reverberant speech to clean speech by utilizing two independent tasks, which suppress reverberation and noises respectively. Furthermore, a progressive learning approach is used for each task, which brings intermediate learning targets to enhance system performances. Experimental results reveal that the proposed method can achieve improvements in both objective and subjective evaluations in low SNR and high reverberation time 60 (RT₆₀) environments when compared with the conventional deep neural network-based method.

Original language	English
Title of host publication	2018 IEEE 23rd International Conference on Digital Signal Processing, DSP 2018
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9781538668115
DOIs	https://doi.org/10.1109/ICDSP.2018.8631860
Publication status	Published - 2 Jul 2018
Externally published	Yes
Event	23rd IEEE International Conference on Digital Signal Processing, DSP 2018 - Shanghai, China Duration: 19 Nov 2018 → 21 Nov 2018

Publication series

Name	International Conference on Digital Signal Processing, DSP
Volume	2018-November

Conference

Conference	23rd IEEE International Conference on Digital Signal Processing, DSP 2018
Country/Territory	China
City	Shanghai
Period	19/11/18 → 21/11/18

Keywords

PDNNs
Progressive learning
RT60
Regression task
SNR
Speech enhancement

Access to Document

10.1109/ICDSP.2018.8631860

Cite this

Shu, X., Zhou, Y., & Cao, Y. (2018). A Progressive Enhancement Method for Noisy and Reverberant Speech. In 2018 IEEE 23rd International Conference on Digital Signal Processing, DSP 2018 Article 8631860 (International Conference on Digital Signal Processing, DSP; Vol. 2018-November). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICDSP.2018.8631860

@inproceedings{1e123f6191c942f59b7983f475ead444,

title = "A Progressive Enhancement Method for Noisy and Reverberant Speech",

abstract = "In this paper, a speech enhancement method based on the framework of progressive deep neural networks (PDNNs) is proposed for low signal-to-noise ratio (SNR) and highly reverberant environments. It aims at assisting the complicated regression task of mapping noisy and reverberant speech to clean speech by utilizing two independent tasks, which suppress reverberation and noises respectively. Furthermore, a progressive learning approach is used for each task, which brings intermediate learning targets to enhance system performances. Experimental results reveal that the proposed method can achieve improvements in both objective and subjective evaluations in low SNR and high reverberation time 60 (RT60) environments when compared with the conventional deep neural network-based method.",

keywords = "PDNNs, Progressive learning, RT60, Regression task, SNR, Speech enhancement",

author = "Xiaofeng Shu and Yi Zhou and Yin Cao",

note = "Publisher Copyright: {\textcopyright} 2018 IEEE.; 23rd IEEE International Conference on Digital Signal Processing, DSP 2018 ; Conference date: 19-11-2018 Through 21-11-2018",

year = "2018",

month = jul,

day = "2",

doi = "10.1109/ICDSP.2018.8631860",

language = "English",

series = "International Conference on Digital Signal Processing, DSP",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2018 IEEE 23rd International Conference on Digital Signal Processing, DSP 2018",

}

Shu, X, Zhou, Y & Cao, Y 2018, A Progressive Enhancement Method for Noisy and Reverberant Speech. in 2018 IEEE 23rd International Conference on Digital Signal Processing, DSP 2018., 8631860, International Conference on Digital Signal Processing, DSP, vol. 2018-November, Institute of Electrical and Electronics Engineers Inc., 23rd IEEE International Conference on Digital Signal Processing, DSP 2018, Shanghai, China, 19/11/18. https://doi.org/10.1109/ICDSP.2018.8631860

A Progressive Enhancement Method for Noisy and Reverberant Speech. / Shu, Xiaofeng; Zhou, Yi; Cao, Yin.
2018 IEEE 23rd International Conference on Digital Signal Processing, DSP 2018. Institute of Electrical and Electronics Engineers Inc., 2018. 8631860 (International Conference on Digital Signal Processing, DSP; Vol. 2018-November).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - A Progressive Enhancement Method for Noisy and Reverberant Speech

AU - Shu, Xiaofeng

AU - Zhou, Yi

AU - Cao, Yin

PY - 2018/7/2

Y1 - 2018/7/2

N2 - In this paper, a speech enhancement method based on the framework of progressive deep neural networks (PDNNs) is proposed for low signal-to-noise ratio (SNR) and highly reverberant environments. It aims at assisting the complicated regression task of mapping noisy and reverberant speech to clean speech by utilizing two independent tasks, which suppress reverberation and noises respectively. Furthermore, a progressive learning approach is used for each task, which brings intermediate learning targets to enhance system performances. Experimental results reveal that the proposed method can achieve improvements in both objective and subjective evaluations in low SNR and high reverberation time 60 (RT60) environments when compared with the conventional deep neural network-based method.

AB - In this paper, a speech enhancement method based on the framework of progressive deep neural networks (PDNNs) is proposed for low signal-to-noise ratio (SNR) and highly reverberant environments. It aims at assisting the complicated regression task of mapping noisy and reverberant speech to clean speech by utilizing two independent tasks, which suppress reverberation and noises respectively. Furthermore, a progressive learning approach is used for each task, which brings intermediate learning targets to enhance system performances. Experimental results reveal that the proposed method can achieve improvements in both objective and subjective evaluations in low SNR and high reverberation time 60 (RT60) environments when compared with the conventional deep neural network-based method.

KW - PDNNs

KW - Progressive learning

KW - RT60

KW - Regression task

KW - SNR

KW - Speech enhancement

UR - http://www.scopus.com/inward/record.url?scp=85062783928&partnerID=8YFLogxK

U2 - 10.1109/ICDSP.2018.8631860

DO - 10.1109/ICDSP.2018.8631860

M3 - Conference Proceeding

AN - SCOPUS:85062783928

T3 - International Conference on Digital Signal Processing, DSP

BT - 2018 IEEE 23rd International Conference on Digital Signal Processing, DSP 2018

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 23rd IEEE International Conference on Digital Signal Processing, DSP 2018

Y2 - 19 November 2018 through 21 November 2018

ER -

A Progressive Enhancement Method for Noisy and Reverberant Speech

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this