Supervised imbalanced multi-domain adaptation for text-independent speaker verification

Zhiyong Chen, Zongze Ren, Shugong Xu

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

Abstract

Speaker verification is an important recognition task in speech signal processing. Domain adaptation for speaker verification is challenging and it is one of the practical problems put forward in the INTERSPEECH2020 Short-duration Speaker Verification (SdSV) Challenge 2020. Although there are several previous researches focused on the domain mismatch problem of the speaker verification task, many methods are not easy to show effectiveness in the real conditions. This is due to the suboptimal loss design, as well as the real-world datasets could contain multiple domains and imbalanced data in each domain. We have explored various domain adaptation methods and proposed one that is both effective and robust in this task by optimizing loss design and explicitly considering the data-imbalance problem. The proposed method is also designed to fit the scenarios where the datasets contain multiple domains. Significant single-model performance improvements have been observed by evaluating on the SdSV20 challenge testbench with our proposed method.

Original languageEnglish
Title of host publicationICCPR 2020 - Proceedings of 2020 9th International Conference on Computing and Pattern Recognition
PublisherAssociation for Computing Machinery
Pages431-438
Number of pages8
ISBN (Electronic)9781450387835
DOIs
Publication statusPublished - 30 Oct 2020
Externally publishedYes
Event9th International Conference on Computing and Pattern Recognition, ICCPR 2020 - Virtual, Online, China
Duration: 30 Oct 20201 Nov 2020

Publication series

NameACM International Conference Proceeding Series

Conference

Conference9th International Conference on Computing and Pattern Recognition, ICCPR 2020
Country/TerritoryChina
CityVirtual, Online
Period30/10/201/11/20

Keywords

  • Automatic speaker verification
  • Domain adaptation
  • Transfer learning

Fingerprint

Dive into the research topics of 'Supervised imbalanced multi-domain adaptation for text-independent speaker verification'. Together they form a unique fingerprint.

Cite this