Audio-Radar SMC-PHD Filtering for Indoor Multi-Speaker Tracking

Yi Zhou, Miguel Lopez-Benitez, Limin Yu*, Yutao Yue*

*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

Abstract

High-resolution millimetre-wave (mmWave) radar sensors have become increasingly popular in consumer markets. This study addresses the challenge of tracking multiple active speakers in indoor environments using high-resolution radar and microphone array. Through our experiments, we have observed that the Sequential Monte Carlo Probability Hypothesis Density (SMC-PHD) filter, when given point cloud data from a high-resolution radar as input, can provide promising tracking performance. In this work, we add another modality, i.e., audio, to the radar SMC-PHD filtering framework for the active speaker tracking task. Specifically, we use the audio Direction of Arrival (DoA) to guide the particle birth and relocation process in the SMC-PHD filtering framework. Furthermore, we propose a likelihood function that jointly considers the spatial and angular estimation from radar and audio. Experimental results on the RAV4D dataset demonstrate that our audio-radar SMC-PHD filtering approach produces reliable trajectories, especially in the challenging cases such as varying numbers of speakers.

Original languageEnglish
Title of host publication2024 9th International Conference on Signal and Image Processing, ICSIP 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages282-286
Number of pages5
ISBN (Electronic)9798350350920
DOIs
Publication statusPublished - 2024
Event9th International Conference on Signal and Image Processing, ICSIP 2024 - Hybrid, Nanjing, China
Duration: 12 Jul 202414 Jul 2024

Publication series

Name2024 9th International Conference on Signal and Image Processing, ICSIP 2024

Conference

Conference9th International Conference on Signal and Image Processing, ICSIP 2024
Country/TerritoryChina
CityHybrid, Nanjing
Period12/07/2414/07/24

Keywords

  • audio-radar fusion
  • object tracking
  • PHD filtering

Fingerprint

Dive into the research topics of 'Audio-Radar SMC-PHD Filtering for Indoor Multi-Speaker Tracking'. Together they form a unique fingerprint.

Cite this