Monaural voiced speech separation with multipitch tracking

Wei Jiang; Pengfei Hu; Shan Liang; Wenju Liu; Zhanlei Yang

doi:10.1007/978-3-642-33506-8_69

Monaural voiced speech separation with multipitch tracking

Wei Jiang, Pengfei Hu, Shan Liang, Wenju Liu^*, Zhanlei Yang

^*Corresponding author for this work

CAS - Institute of Automation

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

Abstract

Separating voiced speech from its mixtures with interferences in monaural condition is not only an important but also challenging task. As multipitch tracking can enable much better performance of speech separation for CASA systems, we propose a new multipitch determination algorithm, which can be used under various kinds of noise conditions. In the process of multipitch estimation, a new representation method is utilized together with maximum support constraint and harmonic completeness constraint. The proposed approach can reliably detect up to two pitches in each frame. Sequential grouping is performed based on a new target pitch tracking strategy. System evaluations show that our algorithm leads to significantly better speech separation results than previous ones.

Original language	English
Title of host publication	Pattern Recognition - Chinese Conference, CCPR 2012, Proceedings
Pages	564-571
Number of pages	8
DOIs	https://doi.org/10.1007/978-3-642-33506-8_69
Publication status	Published - 2012
Externally published	Yes
Event	2012 5th Chinese Conference on Pattern Recognition, CCPR 2012 - Beijing, China Duration: 24 Sept 2012 → 26 Sept 2012

Publication series

Name	Communications in Computer and Information Science
Volume	321 CCIS
ISSN (Print)	1865-0929

Conference

Conference	2012 5th Chinese Conference on Pattern Recognition, CCPR 2012
Country/Territory	China
City	Beijing
Period	24/09/12 → 26/09/12

Keywords

computational auditory scene analysis
multipitch determination
voiced speech separation

Access to Document

10.1007/978-3-642-33506-8_69

Cite this

@inproceedings{98578ad54b044260bf7e41ca20d1a4c0,

title = "Monaural voiced speech separation with multipitch tracking",

abstract = "Separating voiced speech from its mixtures with interferences in monaural condition is not only an important but also challenging task. As multipitch tracking can enable much better performance of speech separation for CASA systems, we propose a new multipitch determination algorithm, which can be used under various kinds of noise conditions. In the process of multipitch estimation, a new representation method is utilized together with maximum support constraint and harmonic completeness constraint. The proposed approach can reliably detect up to two pitches in each frame. Sequential grouping is performed based on a new target pitch tracking strategy. System evaluations show that our algorithm leads to significantly better speech separation results than previous ones.",

keywords = "computational auditory scene analysis, multipitch determination, voiced speech separation",

author = "Wei Jiang and Pengfei Hu and Shan Liang and Wenju Liu and Zhanlei Yang",

year = "2012",

doi = "10.1007/978-3-642-33506-8_69",

language = "English",

isbn = "9783642335051",

series = "Communications in Computer and Information Science",

pages = "564--571",

booktitle = "Pattern Recognition - Chinese Conference, CCPR 2012, Proceedings",

note = "2012 5th Chinese Conference on Pattern Recognition, CCPR 2012 ; Conference date: 24-09-2012 Through 26-09-2012",

}

Jiang, W, Hu, P, Liang, S, Liu, W & Yang, Z 2012, Monaural voiced speech separation with multipitch tracking. in Pattern Recognition - Chinese Conference, CCPR 2012, Proceedings. Communications in Computer and Information Science, vol. 321 CCIS, pp. 564-571, 2012 5th Chinese Conference on Pattern Recognition, CCPR 2012, Beijing, China, 24/09/12. https://doi.org/10.1007/978-3-642-33506-8_69

TY - GEN

T1 - Monaural voiced speech separation with multipitch tracking

AU - Jiang, Wei

AU - Hu, Pengfei

AU - Liang, Shan

AU - Liu, Wenju

AU - Yang, Zhanlei

PY - 2012

Y1 - 2012

N2 - Separating voiced speech from its mixtures with interferences in monaural condition is not only an important but also challenging task. As multipitch tracking can enable much better performance of speech separation for CASA systems, we propose a new multipitch determination algorithm, which can be used under various kinds of noise conditions. In the process of multipitch estimation, a new representation method is utilized together with maximum support constraint and harmonic completeness constraint. The proposed approach can reliably detect up to two pitches in each frame. Sequential grouping is performed based on a new target pitch tracking strategy. System evaluations show that our algorithm leads to significantly better speech separation results than previous ones.

AB - Separating voiced speech from its mixtures with interferences in monaural condition is not only an important but also challenging task. As multipitch tracking can enable much better performance of speech separation for CASA systems, we propose a new multipitch determination algorithm, which can be used under various kinds of noise conditions. In the process of multipitch estimation, a new representation method is utilized together with maximum support constraint and harmonic completeness constraint. The proposed approach can reliably detect up to two pitches in each frame. Sequential grouping is performed based on a new target pitch tracking strategy. System evaluations show that our algorithm leads to significantly better speech separation results than previous ones.

KW - computational auditory scene analysis

KW - multipitch determination

KW - voiced speech separation

UR - http://www.scopus.com/inward/record.url?scp=84867120637&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-33506-8_69

DO - 10.1007/978-3-642-33506-8_69

M3 - Conference Proceeding

AN - SCOPUS:84867120637

SN - 9783642335051

T3 - Communications in Computer and Information Science

SP - 564

EP - 571

BT - Pattern Recognition - Chinese Conference, CCPR 2012, Proceedings

T2 - 2012 5th Chinese Conference on Pattern Recognition, CCPR 2012

Y2 - 24 September 2012 through 26 September 2012

ER -

Monaural voiced speech separation with multipitch tracking

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this