TY - JOUR
T1 - Deep learning based speech separation technology and its developments
AU - Liu, Wen Ju
AU - Nie, Shuai
AU - Liang, Shan
AU - Zhang, Xue Liang
N1 - Publisher Copyright:
Copyright © 2016 Acta Automatica Sinica. All rights reserved.
PY - 2016/6/1
Y1 - 2016/6/1
N2 - Nowadays, speech interaction technology has been widely used in our daily life. However, due to the interfer- ences, the performances of speech interaction systems in real-world environments are far from being satisfactory. Speech separation technology has been proven to be an effective way to improve the performance of speech interaction in noisy environments. To this end, decades of efforts have been devoted to speech separation. There have been many methods proposed and a lot of success achieved. Especially with the rise of deep learning, deep learning-based speech separation has been proposed and extensively studied, which has been shown considerable promise and become a main research line. So far, there have been many deep learning-based speech separation methods proposed. However, there is little systematic analysis and summary on the deep learning-based speech separation technology. We try to give a detail analysis and summary on the general procedures and components of speech separation in this regard. Moreover, we survey a wide range of supervised speech separation techniques from three aspects: 1) features, 2) targets, 3) models. And finally we give some views on its developments.
AB - Nowadays, speech interaction technology has been widely used in our daily life. However, due to the interfer- ences, the performances of speech interaction systems in real-world environments are far from being satisfactory. Speech separation technology has been proven to be an effective way to improve the performance of speech interaction in noisy environments. To this end, decades of efforts have been devoted to speech separation. There have been many methods proposed and a lot of success achieved. Especially with the rise of deep learning, deep learning-based speech separation has been proposed and extensively studied, which has been shown considerable promise and become a main research line. So far, there have been many deep learning-based speech separation methods proposed. However, there is little systematic analysis and summary on the deep learning-based speech separation technology. We try to give a detail analysis and summary on the general procedures and components of speech separation in this regard. Moreover, we survey a wide range of supervised speech separation techniques from three aspects: 1) features, 2) targets, 3) models. And finally we give some views on its developments.
KW - Computational auditory scene analysis
KW - Machine learning
KW - Neural network
KW - Speech separation
UR - http://www.scopus.com/inward/record.url?scp=84977275703&partnerID=8YFLogxK
U2 - 10.16383/j.aas.2016.c150734
DO - 10.16383/j.aas.2016.c150734
M3 - Review article
AN - SCOPUS:84977275703
SN - 0254-4156
VL - 42
SP - 819
EP - 833
JO - Zidonghua Xuebao/Acta Automatica Sinica
JF - Zidonghua Xuebao/Acta Automatica Sinica
IS - 6
ER -