TY - GEN
T1 - Monaural voiced speech separation with multipitch tracking
AU - Jiang, Wei
AU - Hu, Pengfei
AU - Liang, Shan
AU - Liu, Wenju
AU - Yang, Zhanlei
PY - 2012
Y1 - 2012
N2 - Separating voiced speech from its mixtures with interferences in monaural condition is not only an important but also challenging task. As multipitch tracking can enable much better performance of speech separation for CASA systems, we propose a new multipitch determination algorithm, which can be used under various kinds of noise conditions. In the process of multipitch estimation, a new representation method is utilized together with maximum support constraint and harmonic completeness constraint. The proposed approach can reliably detect up to two pitches in each frame. Sequential grouping is performed based on a new target pitch tracking strategy. System evaluations show that our algorithm leads to significantly better speech separation results than previous ones.
AB - Separating voiced speech from its mixtures with interferences in monaural condition is not only an important but also challenging task. As multipitch tracking can enable much better performance of speech separation for CASA systems, we propose a new multipitch determination algorithm, which can be used under various kinds of noise conditions. In the process of multipitch estimation, a new representation method is utilized together with maximum support constraint and harmonic completeness constraint. The proposed approach can reliably detect up to two pitches in each frame. Sequential grouping is performed based on a new target pitch tracking strategy. System evaluations show that our algorithm leads to significantly better speech separation results than previous ones.
KW - computational auditory scene analysis
KW - multipitch determination
KW - voiced speech separation
UR - http://www.scopus.com/inward/record.url?scp=84867120637&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-33506-8_69
DO - 10.1007/978-3-642-33506-8_69
M3 - Conference Proceeding
AN - SCOPUS:84867120637
SN - 9783642335051
T3 - Communications in Computer and Information Science
SP - 564
EP - 571
BT - Pattern Recognition - Chinese Conference, CCPR 2012, Proceedings
T2 - 2012 5th Chinese Conference on Pattern Recognition, CCPR 2012
Y2 - 24 September 2012 through 26 September 2012
ER -