ASY-VRNet: Waterway Panoptic Driving Perception Model based on Asymmetric Fair Fusion of Vision and 4D mmWave Radar

Runwei Guan; Shanliang Yao; Xiaohui Zhu; Ka Lok Man; Yong Yue; Jeremy S. Smith; Eng Gee Lim; Yutao Yue

doi:10.1109/IROS58592.2024.10802447

ASY-VRNet: Waterway Panoptic Driving Perception Model based on Asymmetric Fair Fusion of Vision and 4D mmWave Radar

Runwei Guan, Shanliang Yao, Xiaohui Zhu, Ka Lok Man, Yong Yue, Jeremy S. Smith, Eng Gee Lim, Yutao Yue

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

1 Citation (Scopus)

Abstract

Panoptic Driving Perception (PDP) is critical for the autonomous navigation of Unmanned Surface Vehicles (USVs). A PDP model typically integrates multiple tasks, necessitating the simultaneous and robust execution of various perception tasks to facilitate downstream path planning. The fusion of visual and radar sensors is currently acknowledged as a robust and cost-effective approach. However, most existing research has primarily focused on fusing visual and radar features dedicated to object detection or utilizing a shared feature space for multiple tasks, neglecting the individual representation differences between various tasks. To address this gap, we propose a pair of Asymmetric Fair Fusion (AFF) modules with favorable explainability designed to efficiently interact with independent features from both visual and radar modalities, tailored to the specific requirements of object detection and semantic segmentation tasks. The AFF modules treat image and radar maps as irregular point sets and transform these features into a crossed-shared feature space for multitasking, ensuring equitable treatment of vision and radar point cloud features. Leveraging AFF modules, we propose a novel and efficient PDP model, ASY-VRNet, which processes image and radar features based on irregular super-pixel point sets. Additionally, we propose an effective multitask learning method specifically designed for PDP models. Compared to other lightweight models, ASY-VRNet achieves state-of-the-art performance in object detection, semantic segmentation, and drivable-area segmentation on the WaterScenes benchmark. Our project is publicly available at https://github.com/GuanRunwei/ASY-VRNet.

Original language	English
Title of host publication	2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024)
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	12479-12486
Number of pages	8
ISBN (Electronic)	9798350377705
DOIs	https://doi.org/10.1109/IROS58592.2024.10802447
Publication status	Published - 2024
Event	2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024 - Abu Dhabi, United Arab Emirates Duration: 14 Oct 2024 → 18 Oct 2024

Publication series

Name	IEEE International Conference on Intelligent Robots and Systems
ISSN (Print)	2153-0858
ISSN (Electronic)	2153-0866

Conference

Conference	2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024
Country/Territory	United Arab Emirates
City	Abu Dhabi
Period	14/10/24 → 18/10/24

Access to Document

10.1109/IROS58592.2024.10802447

Cite this

Guan, R., Yao, S., Zhu, X., Man, K. L., Yue, Y., Smith, J. S., Lim, E. G., & Yue, Y. (2024). ASY-VRNet: Waterway Panoptic Driving Perception Model based on Asymmetric Fair Fusion of Vision and 4D mmWave Radar. In 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024) (pp. 12479-12486). (IEEE International Conference on Intelligent Robots and Systems). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IROS58592.2024.10802447

Guan, Runwei ; Yao, Shanliang ; Zhu, Xiaohui et al. / ASY-VRNet: Waterway Panoptic Driving Perception Model based on Asymmetric Fair Fusion of Vision and 4D mmWave Radar. 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024). Institute of Electrical and Electronics Engineers Inc., 2024. pp. 12479-12486 (IEEE International Conference on Intelligent Robots and Systems).

@inproceedings{c085ba1645ac4eb884dc098d371c9ecd,

title = "ASY-VRNet: Waterway Panoptic Driving Perception Model based on Asymmetric Fair Fusion of Vision and 4D mmWave Radar",

abstract = "Panoptic Driving Perception (PDP) is critical for the autonomous navigation of Unmanned Surface Vehicles (USVs). A PDP model typically integrates multiple tasks, necessitating the simultaneous and robust execution of various perception tasks to facilitate downstream path planning. The fusion of visual and radar sensors is currently acknowledged as a robust and cost-effective approach. However, most existing research has primarily focused on fusing visual and radar features dedicated to object detection or utilizing a shared feature space for multiple tasks, neglecting the individual representation differences between various tasks. To address this gap, we propose a pair of Asymmetric Fair Fusion (AFF) modules with favorable explainability designed to efficiently interact with independent features from both visual and radar modalities, tailored to the specific requirements of object detection and semantic segmentation tasks. The AFF modules treat image and radar maps as irregular point sets and transform these features into a crossed-shared feature space for multitasking, ensuring equitable treatment of vision and radar point cloud features. Leveraging AFF modules, we propose a novel and efficient PDP model, ASY-VRNet, which processes image and radar features based on irregular super-pixel point sets. Additionally, we propose an effective multitask learning method specifically designed for PDP models. Compared to other lightweight models, ASY-VRNet achieves state-of-the-art performance in object detection, semantic segmentation, and drivable-area segmentation on the WaterScenes benchmark. Our project is publicly available at https://github.com/GuanRunwei/ASY-VRNet.",

author = "Runwei Guan and Shanliang Yao and Xiaohui Zhu and Man, {Ka Lok} and Yong Yue and Smith, {Jeremy S.} and Lim, {Eng Gee} and Yutao Yue",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024 ; Conference date: 14-10-2024 Through 18-10-2024",

year = "2024",

doi = "10.1109/IROS58592.2024.10802447",

language = "English",

series = "IEEE International Conference on Intelligent Robots and Systems",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "12479--12486",

booktitle = "2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024)",

}

Guan, R, Yao, S, Zhu, X, Man, KL , Yue, Y, Smith, JS, Lim, EG & Yue, Y 2024, ASY-VRNet: Waterway Panoptic Driving Perception Model based on Asymmetric Fair Fusion of Vision and 4D mmWave Radar. in 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024). IEEE International Conference on Intelligent Robots and Systems, Institute of Electrical and Electronics Engineers Inc., pp. 12479-12486, 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024, Abu Dhabi, United Arab Emirates, 14/10/24. https://doi.org/10.1109/IROS58592.2024.10802447

ASY-VRNet: Waterway Panoptic Driving Perception Model based on Asymmetric Fair Fusion of Vision and 4D mmWave Radar. / Guan, Runwei; Yao, Shanliang; Zhu, Xiaohui et al.
2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024). Institute of Electrical and Electronics Engineers Inc., 2024. p. 12479-12486 (IEEE International Conference on Intelligent Robots and Systems).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - ASY-VRNet: Waterway Panoptic Driving Perception Model based on Asymmetric Fair Fusion of Vision and 4D mmWave Radar

AU - Guan, Runwei

AU - Yao, Shanliang

AU - Zhu, Xiaohui

AU - Man, Ka Lok

AU - Yue, Yong

AU - Smith, Jeremy S.

AU - Lim, Eng Gee

AU - Yue, Yutao

PY - 2024

Y1 - 2024

N2 - Panoptic Driving Perception (PDP) is critical for the autonomous navigation of Unmanned Surface Vehicles (USVs). A PDP model typically integrates multiple tasks, necessitating the simultaneous and robust execution of various perception tasks to facilitate downstream path planning. The fusion of visual and radar sensors is currently acknowledged as a robust and cost-effective approach. However, most existing research has primarily focused on fusing visual and radar features dedicated to object detection or utilizing a shared feature space for multiple tasks, neglecting the individual representation differences between various tasks. To address this gap, we propose a pair of Asymmetric Fair Fusion (AFF) modules with favorable explainability designed to efficiently interact with independent features from both visual and radar modalities, tailored to the specific requirements of object detection and semantic segmentation tasks. The AFF modules treat image and radar maps as irregular point sets and transform these features into a crossed-shared feature space for multitasking, ensuring equitable treatment of vision and radar point cloud features. Leveraging AFF modules, we propose a novel and efficient PDP model, ASY-VRNet, which processes image and radar features based on irregular super-pixel point sets. Additionally, we propose an effective multitask learning method specifically designed for PDP models. Compared to other lightweight models, ASY-VRNet achieves state-of-the-art performance in object detection, semantic segmentation, and drivable-area segmentation on the WaterScenes benchmark. Our project is publicly available at https://github.com/GuanRunwei/ASY-VRNet.

AB - Panoptic Driving Perception (PDP) is critical for the autonomous navigation of Unmanned Surface Vehicles (USVs). A PDP model typically integrates multiple tasks, necessitating the simultaneous and robust execution of various perception tasks to facilitate downstream path planning. The fusion of visual and radar sensors is currently acknowledged as a robust and cost-effective approach. However, most existing research has primarily focused on fusing visual and radar features dedicated to object detection or utilizing a shared feature space for multiple tasks, neglecting the individual representation differences between various tasks. To address this gap, we propose a pair of Asymmetric Fair Fusion (AFF) modules with favorable explainability designed to efficiently interact with independent features from both visual and radar modalities, tailored to the specific requirements of object detection and semantic segmentation tasks. The AFF modules treat image and radar maps as irregular point sets and transform these features into a crossed-shared feature space for multitasking, ensuring equitable treatment of vision and radar point cloud features. Leveraging AFF modules, we propose a novel and efficient PDP model, ASY-VRNet, which processes image and radar features based on irregular super-pixel point sets. Additionally, we propose an effective multitask learning method specifically designed for PDP models. Compared to other lightweight models, ASY-VRNet achieves state-of-the-art performance in object detection, semantic segmentation, and drivable-area segmentation on the WaterScenes benchmark. Our project is publicly available at https://github.com/GuanRunwei/ASY-VRNet.

UR - http://www.scopus.com/inward/record.url?scp=85201941735&partnerID=8YFLogxK

U2 - 10.1109/IROS58592.2024.10802447

DO - 10.1109/IROS58592.2024.10802447

M3 - Conference Proceeding

AN - SCOPUS:85201941735

T3 - IEEE International Conference on Intelligent Robots and Systems

SP - 12479

EP - 12486

BT - 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024)

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024

Y2 - 14 October 2024 through 18 October 2024

ER -

Guan R, Yao S, Zhu X, Man KL , Yue Y, Smith JS et al. ASY-VRNet: Waterway Panoptic Driving Perception Model based on Asymmetric Fair Fusion of Vision and 4D mmWave Radar. In 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024). Institute of Electrical and Electronics Engineers Inc. 2024. p. 12479-12486. (IEEE International Conference on Intelligent Robots and Systems). doi: 10.1109/IROS58592.2024.10802447

ASY-VRNet: Waterway Panoptic Driving Perception Model based on Asymmetric Fair Fusion of Vision and 4D mmWave Radar

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this