Transductive Feature Space Regularization for Few-shot Bioacoustic Event Detection

Yizhou Tan; Haojun Ai; Shengchen Li; Feng Zhang

doi:10.21437/Interspeech.2023-579

Transductive Feature Space Regularization for Few-shot Bioacoustic Event Detection

Yizhou Tan, Haojun Ai^*, Shengchen Li, Feng Zhang

^*Corresponding author for this work

Department of Intelligent Science

Wuhan University

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

2 Citations (Scopus)

Abstract

In few-shot bioacoustic event detection, besides interested target events, background noises and various uninterested sound events lead to complex decision boundaries, which require regularized feature distributions in feature space. Due to the low label availability of uncertain noise events, existing few-shot learning methods with entropy-based regularizers suffer from overfitting during optimization. In this paper, we propose a transductive inference model with a prior knowledge based regularizer (PKR) to overcome the overfitting problem. We use a task-adaptive feature extractor to reconstruct a regularized feature space. A PKR is proposed to minimize the divergence between the original and reconstructed feature space. The development set of DCASE 2022 Task 5 is adopted as the experimental dataset. With the increasing iterations, the proposed model performs with long-lasting results around 55.43 F-measure, and well solves the overfitting problem in transductive inference.

Original language	English
Title of host publication	Proceeding of INTERSPEECH 2023
Pages	571-575
Number of pages	5
Volume	2023-August
DOIs	https://doi.org/10.21437/Interspeech.2023-579
Publication status	Published - 20 Aug 2023
Event	24th International Speech Communication Association, Interspeech 2023 - Dublin, Ireland Duration: 20 Aug 2023 → 24 Aug 2023

Publication series

Name	Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
ISSN (Print)	2308-457X

Conference

Conference	24th International Speech Communication Association, Interspeech 2023
Country/Territory	Ireland
City	Dublin
Period	20/08/23 → 24/08/23

Keywords

Bioacoustic Event Detection
Few-shot Learning
Transductive Inference

Access to Document

10.21437/Interspeech.2023-579

Cite this

@inproceedings{01107befc6384daaa96a2fd5ef561fae,

title = "Transductive Feature Space Regularization for Few-shot Bioacoustic Event Detection",

abstract = "In few-shot bioacoustic event detection, besides interested target events, background noises and various uninterested sound events lead to complex decision boundaries, which require regularized feature distributions in feature space. Due to the low label availability of uncertain noise events, existing few-shot learning methods with entropy-based regularizers suffer from overfitting during optimization. In this paper, we propose a transductive inference model with a prior knowledge based regularizer (PKR) to overcome the overfitting problem. We use a task-adaptive feature extractor to reconstruct a regularized feature space. A PKR is proposed to minimize the divergence between the original and reconstructed feature space. The development set of DCASE 2022 Task 5 is adopted as the experimental dataset. With the increasing iterations, the proposed model performs with long-lasting results around 55.43 F-measure, and well solves the overfitting problem in transductive inference.",

keywords = "Bioacoustic Event Detection, Few-shot Learning, Transductive Inference",

author = "Yizhou Tan and Haojun Ai and Shengchen Li and Feng Zhang",

note = "Funding Information: The research project is supported partly by the National Natural Science Foundation of China (No: 62001038 and No: 61971316), and Gusu Innovation and Entrepreneurship Leading Talents Programme - Youth Innovation Leading Talent (ZXL2022472). *Corresponding Author Publisher Copyright: {\textcopyright} 2023 International Speech Communication Association. All rights reserved.; 24th International Speech Communication Association, Interspeech 2023 ; Conference date: 20-08-2023 Through 24-08-2023",

year = "2023",

month = aug,

day = "20",

doi = "10.21437/Interspeech.2023-579",

language = "English",

volume = "2023-August",

series = "Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH",

pages = "571--575",

booktitle = "Proceeding of INTERSPEECH 2023",

}

Tan, Y, Ai, H, Li, S & Zhang, F 2023, Transductive Feature Space Regularization for Few-shot Bioacoustic Event Detection. in Proceeding of INTERSPEECH 2023. vol. 2023-August, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 571-575, 24th International Speech Communication Association, Interspeech 2023, Dublin, Ireland, 20/08/23. https://doi.org/10.21437/Interspeech.2023-579

Transductive Feature Space Regularization for Few-shot Bioacoustic Event Detection. / Tan, Yizhou; Ai, Haojun; Li, Shengchen et al.
Proceeding of INTERSPEECH 2023. Vol. 2023-August 2023. p. 571-575 (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Transductive Feature Space Regularization for Few-shot Bioacoustic Event Detection

AU - Tan, Yizhou

AU - Ai, Haojun

AU - Li, Shengchen

AU - Zhang, Feng

N1 - Funding Information: The research project is supported partly by the National Natural Science Foundation of China (No: 62001038 and No: 61971316), and Gusu Innovation and Entrepreneurship Leading Talents Programme - Youth Innovation Leading Talent (ZXL2022472). *Corresponding Author Publisher Copyright: © 2023 International Speech Communication Association. All rights reserved.

PY - 2023/8/20

Y1 - 2023/8/20

N2 - In few-shot bioacoustic event detection, besides interested target events, background noises and various uninterested sound events lead to complex decision boundaries, which require regularized feature distributions in feature space. Due to the low label availability of uncertain noise events, existing few-shot learning methods with entropy-based regularizers suffer from overfitting during optimization. In this paper, we propose a transductive inference model with a prior knowledge based regularizer (PKR) to overcome the overfitting problem. We use a task-adaptive feature extractor to reconstruct a regularized feature space. A PKR is proposed to minimize the divergence between the original and reconstructed feature space. The development set of DCASE 2022 Task 5 is adopted as the experimental dataset. With the increasing iterations, the proposed model performs with long-lasting results around 55.43 F-measure, and well solves the overfitting problem in transductive inference.

AB - In few-shot bioacoustic event detection, besides interested target events, background noises and various uninterested sound events lead to complex decision boundaries, which require regularized feature distributions in feature space. Due to the low label availability of uncertain noise events, existing few-shot learning methods with entropy-based regularizers suffer from overfitting during optimization. In this paper, we propose a transductive inference model with a prior knowledge based regularizer (PKR) to overcome the overfitting problem. We use a task-adaptive feature extractor to reconstruct a regularized feature space. A PKR is proposed to minimize the divergence between the original and reconstructed feature space. The development set of DCASE 2022 Task 5 is adopted as the experimental dataset. With the increasing iterations, the proposed model performs with long-lasting results around 55.43 F-measure, and well solves the overfitting problem in transductive inference.

KW - Bioacoustic Event Detection

KW - Few-shot Learning

KW - Transductive Inference

UR - http://www.scopus.com/inward/record.url?scp=85171551514&partnerID=8YFLogxK

U2 - 10.21437/Interspeech.2023-579

DO - 10.21437/Interspeech.2023-579

M3 - Conference Proceeding

AN - SCOPUS:85171551514

VL - 2023-August

T3 - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

SP - 571

EP - 575

BT - Proceeding of INTERSPEECH 2023

T2 - 24th International Speech Communication Association, Interspeech 2023

Y2 - 20 August 2023 through 24 August 2023

ER -

Transductive Feature Space Regularization for Few-shot Bioacoustic Event Detection

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this