RAV4D: A Radar-Audia-Visual Dataset for Indoor Multi - Person Tracking

Yi Zhou, Ningfei Song, Jieming Ma, Ka Lok Man, Miguel López-Benítez, Limin Yu*, Yutao Yue*

*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

1 Citation (Scopus)

Abstract

Indoor multi-person tracking is a widely explored area of research. However, publicly available datasets are either oversimplified or provide only visual data. To fill this gap, our paper presents the RAV4D dataset, a novel multimodal dataset that includes data from radar, microphone arrays, and stereo cameras. This dataset is characterised by the provision of 3D positions, Euler angles and Doppler velocities. By integrating these different data types, RAV 4D aims to exploit the synergistic and complementary capabilities of these modalities to improve tracking performance. The development of RAV4D addresses two main challenges: sensor calibration and 3D annotation. A novel calibration target is designed to effectively calibrate the radar, stereo camera and microphone array. In addition, a visually guided annotation framework is proposed to address the challenge of annotating radar data. This framework uses head positions, heading orientation and depth information from stereo cameras and radar to establish accurate ground truth for multimodal tracking trajectories. The dataset is publicly available at https://zenodo.org/records/l0208199.

Original languageEnglish
Title of host publicationRadarConf 2024 - 2024 IEEE Radar Conference, Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350329209
DOIs
Publication statusPublished - May 2024
Event2024 IEEE Radar Conference, RadarConf 2024 - Denver, United States
Duration: 6 May 202410 May 2024

Publication series

NameProceedings of the IEEE Radar Conference
ISSN (Print)1097-5764
ISSN (Electronic)2375-5318

Conference

Conference2024 IEEE Radar Conference, RadarConf 2024
Country/TerritoryUnited States
CityDenver
Period6/05/2410/05/24

Keywords

  • Multiple Object Tracking
  • Radar Tracking
  • Sensor Fusion
  • Speaker Tracking

Fingerprint

Dive into the research topics of 'RAV4D: A Radar-Audia-Visual Dataset for Indoor Multi - Person Tracking'. Together they form a unique fingerprint.

Cite this