Projects per year
Personal profile
Personal profile
My research concerns using signal processing and machine learning for acoustics, audio, and speech processing. In particular, audio event detection and classification, sound source localization, sound source separation, speech enhancement, speech echo cancellation, active noise control, sound field control, array signal processing, and vibration and structural acoustics.
Research interests
Acoustics, audio, and speech processing. In particular, audio event detection and classification, sound source localization, sound source separation, speech enhancement, speech echo cancellation, active noise control, sound field control, array signal processing, and vibration and structural acoustics.
Experience
Research Fellow, Centre for Vision, Speech, and Signal Processing, University of Surrey - 2020
Associate Research Scientist, Institute of Acoustics, Chinese Academy of Sciences - 2018
Postdoc, Acoustic Research Group, Brigham Young University - 2015
Teaching
INT402, Data Mining and Big Data Analytics
Awards and honours
Second top ranking in the challenge of DCASE 2022 Task 3.
Top ranking in the challenge of ICASSP 2022 L3DAS22 Task 2.
Top ranking in the challenge of DCASE 2020 Task 4.
Reproducible System Award in the workshop of DCASE 2020.
Judges Award in the workshop of DCASE 2020.
Reproducible System Award in the workshop of DCASE 2019.
Second top ranking in the challenge of DCASE 2019 Task 3.
Education/Academic qualification
Ph.D., Institute of Acoustics, Chinese Academy of Sciences -2013
BSc, Nanjing University - 2008
Person Types
- Staff
Fingerprint
Collaborations and top research areas from the last five years
Projects
- 1 Active
-
Methods Study on Multi-Task Learning for 3D Computational Environmental Audio Analysis
1/01/23 → 31/12/25
Project: Internal Research Project
-
EDTC: enhance depth of text comprehension in automated audio captioning
Tan, L. & Cao, Y., Feb 2024.Research output: Contribution to conference › Paper
-
Selective-Memory Meta-Learning with Environment Representations for Sound Event Localization and Detection
Hu, J. & Cao, Y., Aug 2024, In: IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP).Research output: Contribution to journal › Article › peer-review
-
Towards Out-of-Distribution Detection in Vocoder Recognition via Latent Feature Reconstruction
Du, R., Yao, J. & Cao, Y., Jun 2024.Research output: Contribution to conference › Paper
-
WavCraft: Audio Editing and Generation with Large Language Models
Liang, J., Zhang, H., Liu, H. & Cao, Y., Mar 2024, International Conference on Learning Representations 2024.Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review
-
META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection
Hu, J., Cao, Y., Wu, M., Yang, F., Yu, Z., Wang, W., Plumbley, M. & Yang, J., 2023, The proceedings of the DCASE2023 Workshop have been published as an electronic publication.Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review
-
Audio Deepfake Detection
Yin Cao (Supervisor)
2023 → …Activity: Supervision › Master Dissertation Supervision
-
Deep source separation for speech and music
Yin Cao (Supervisor)
2023 → …Activity: Supervision › Completed SURF Project