TY - GEN
T1 - A Preliminary Case Study on Long-Form In-the-Wild Audio Spoofing Detection
AU - Liu, Xuechen
AU - Wang, Xin
AU - Yamagishi, Junichi
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Audio spoofing detection has become increasingly important due to the rise in real-world cases. Current spoofing detectors, referred to as spoofing countermeasures (CM), are mainly trained and focused on audio waveforms with a single speaker and short duration. This study explores spoofing detection in more realistic scenarios, where the audio is long in duration and features multiple speakers and complex acoustic conditions. We test the widely-acquired AASIST under this challenging scenario, looking at the impact of multiple variations such as duration, speaker presence, and acoustic complexities on CM performance. Our work reveals key issues with current methods and suggests preliminary ways to improve them. We aim to make spoofing detection more applicable in more in-the-wild scenarios. This research is served as an important step towards developing detection systems that can handle the challenges of audio spoofing in real-world applications.
AB - Audio spoofing detection has become increasingly important due to the rise in real-world cases. Current spoofing detectors, referred to as spoofing countermeasures (CM), are mainly trained and focused on audio waveforms with a single speaker and short duration. This study explores spoofing detection in more realistic scenarios, where the audio is long in duration and features multiple speakers and complex acoustic conditions. We test the widely-acquired AASIST under this challenging scenario, looking at the impact of multiple variations such as duration, speaker presence, and acoustic complexities on CM performance. Our work reveals key issues with current methods and suggests preliminary ways to improve them. We aim to make spoofing detection more applicable in more in-the-wild scenarios. This research is served as an important step towards developing detection systems that can handle the challenges of audio spoofing in real-world applications.
KW - Audio DeepFake
KW - Long-form Audio
KW - Spoof Detection
UR - https://www.scopus.com/pages/publications/85217281109
U2 - 10.1109/BIOSIG61931.2024.10786724
DO - 10.1109/BIOSIG61931.2024.10786724
M3 - Conference Proceeding
AN - SCOPUS:85217281109
T3 - BIOSIG 2024 - Proceedings of the 23rd International Conference of the Biometrics Special Interest Group
BT - BIOSIG 2024 - Proceedings of the 23rd International Conference of the Biometrics Special Interest Group
A2 - Boutros, Fadi
A2 - Damer, Naser
A2 - Fang, Meiling
A2 - Gomez-Barrero, Marta
A2 - Raja, Kiran
A2 - Rathgeb, Christian
A2 - Sequeira, Ana F.
A2 - Todisco, Massimiliano
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 23rd International Conference of the Biometrics Special Interest Group, BIOSIG 2024
Y2 - 25 September 2024 through 27 September 2024
ER -