Enhancing Object Detection in Adverse Weather Conditions Through Entropy and Guided Multimodal Fusion

Zhenrong Zhang; Haoyan Gong; Yuzheng Feng; Zixuan Chu; Hongbin Liu

doi:10.1007/978-981-96-0972-7_2

Enhancing Object Detection in Adverse Weather Conditions Through Entropy and Guided Multimodal Fusion

Zhenrong Zhang, Haoyan Gong, Yuzheng Feng, Zixuan Chu, Hongbin Liu^*

^*Corresponding author for this work

School of AI and Advanced Computing

Xi'an Jiaotong-Liverpool University

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

Abstract

Integrating diverse representations from complementary sensing modalities is essential for robust scene interpretation in autonomous driving. Deep learning architectures that fuse vision and range data have advanced 2D and 3D object detection in recent years. However, these modalities often suffer degradation in adverse weather or lighting conditions, leading to decreased performance. While domain adaptation methods have been developed to bridge the gap between source and target domains, they typically fall short because of the inherent discrepancy between the source and target domains. This discrepancy can manifest in different distributions of data and different feature spaces. This paper introduces a comprehensive domain-adaptive object detection framework. Developed through deep transfer learning, the framework is designed to robustly generalize from labelled clear-weather data to unlabeled adverse weather conditions, enhancing the performance of deep learning-based object detection models. The innovative Patch Entropy Fusion Module (PEFM) is central to our approach, which dynamically integrates sensor data, emphasizing critical information and minimizing background distractions. This is further complemented by a novel Weighted Decision Module (WDM) that adjusts the contributions of different sensors based on their efficacy under specific environmental conditions, thereby optimizing detection accuracy. Additionally, we integrate a domain align loss during the transfer learning process to ensure effective domain adaptation by regularizing the feature map discrepancies between clear and adverse weather datasets. We evaluate our model on diverse datasets, including ExDark (unimodal), Cityscapes (unimodal), and Dense (multimodal), where it ranks 1^st in all datasets at the point in time of our evaluation.

Original language	English
Title of host publication	Computer Vision – ACCV 2024 - 17th Asian Conference on Computer Vision, Proceedings
Editors	Minsu Cho, Ivan Laptev, Du Tran, Angela Yao, Hongbin Zha
Publisher	Springer Science and Business Media Deutschland GmbH
Pages	22-38
Number of pages	17
ISBN (Print)	9789819609710
DOIs	https://doi.org/10.1007/978-981-96-0972-7_2
Publication status	Published - 2025
Event	17th Asian Conference on Computer Vision, ACCV 2024 - Hanoi, Viet Nam Duration: 8 Dec 2024 → 12 Dec 2024

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	15481 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	17th Asian Conference on Computer Vision, ACCV 2024
Country/Territory	Viet Nam
City	Hanoi
Period	8/12/24 → 12/12/24

Keywords

Domain adaptation
Entropy fusion
Multimodal fusion

Access to Document

10.1007/978-981-96-0972-7_2

Cite this

Zhang, Z., Gong, H., Feng, Y., Chu, Z., & Liu, H. (2025). Enhancing Object Detection in Adverse Weather Conditions Through Entropy and Guided Multimodal Fusion. In M. Cho, I. Laptev, D. Tran, A. Yao, & H. Zha (Eds.), Computer Vision – ACCV 2024 - 17th Asian Conference on Computer Vision, Proceedings (pp. 22-38). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 15481 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-96-0972-7_2

Zhang, Zhenrong ; Gong, Haoyan ; Feng, Yuzheng et al. / Enhancing Object Detection in Adverse Weather Conditions Through Entropy and Guided Multimodal Fusion. Computer Vision – ACCV 2024 - 17th Asian Conference on Computer Vision, Proceedings. editor / Minsu Cho ; Ivan Laptev ; Du Tran ; Angela Yao ; Hongbin Zha. Springer Science and Business Media Deutschland GmbH, 2025. pp. 22-38 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{76d8371eaab647988b2537a87b5c4df8,

title = "Enhancing Object Detection in Adverse Weather Conditions Through Entropy and Guided Multimodal Fusion",

abstract = "Integrating diverse representations from complementary sensing modalities is essential for robust scene interpretation in autonomous driving. Deep learning architectures that fuse vision and range data have advanced 2D and 3D object detection in recent years. However, these modalities often suffer degradation in adverse weather or lighting conditions, leading to decreased performance. While domain adaptation methods have been developed to bridge the gap between source and target domains, they typically fall short because of the inherent discrepancy between the source and target domains. This discrepancy can manifest in different distributions of data and different feature spaces. This paper introduces a comprehensive domain-adaptive object detection framework. Developed through deep transfer learning, the framework is designed to robustly generalize from labelled clear-weather data to unlabeled adverse weather conditions, enhancing the performance of deep learning-based object detection models. The innovative Patch Entropy Fusion Module (PEFM) is central to our approach, which dynamically integrates sensor data, emphasizing critical information and minimizing background distractions. This is further complemented by a novel Weighted Decision Module (WDM) that adjusts the contributions of different sensors based on their efficacy under specific environmental conditions, thereby optimizing detection accuracy. Additionally, we integrate a domain align loss during the transfer learning process to ensure effective domain adaptation by regularizing the feature map discrepancies between clear and adverse weather datasets. We evaluate our model on diverse datasets, including ExDark (unimodal), Cityscapes (unimodal), and Dense (multimodal), where it ranks 1st in all datasets at the point in time of our evaluation.",

keywords = "Domain adaptation, Entropy fusion, Multimodal fusion",

author = "Zhenrong Zhang and Haoyan Gong and Yuzheng Feng and Zixuan Chu and Hongbin Liu",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.; 17th Asian Conference on Computer Vision, ACCV 2024 ; Conference date: 08-12-2024 Through 12-12-2024",

year = "2025",

doi = "10.1007/978-981-96-0972-7_2",

language = "English",

isbn = "9789819609710",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "22--38",

editor = "Minsu Cho and Ivan Laptev and Du Tran and Angela Yao and Hongbin Zha",

booktitle = "Computer Vision – ACCV 2024 - 17th Asian Conference on Computer Vision, Proceedings",

}

Zhang, Z, Gong, H, Feng, Y, Chu, Z & Liu, H 2025, Enhancing Object Detection in Adverse Weather Conditions Through Entropy and Guided Multimodal Fusion. in M Cho, I Laptev, D Tran, A Yao & H Zha (eds), Computer Vision – ACCV 2024 - 17th Asian Conference on Computer Vision, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 15481 LNCS, Springer Science and Business Media Deutschland GmbH, pp. 22-38, 17th Asian Conference on Computer Vision, ACCV 2024, Hanoi, Viet Nam, 8/12/24. https://doi.org/10.1007/978-981-96-0972-7_2

Enhancing Object Detection in Adverse Weather Conditions Through Entropy and Guided Multimodal Fusion. / Zhang, Zhenrong; Gong, Haoyan; Feng, Yuzheng et al.
Computer Vision – ACCV 2024 - 17th Asian Conference on Computer Vision, Proceedings. ed. / Minsu Cho; Ivan Laptev; Du Tran; Angela Yao; Hongbin Zha. Springer Science and Business Media Deutschland GmbH, 2025. p. 22-38 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 15481 LNCS).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Enhancing Object Detection in Adverse Weather Conditions Through Entropy and Guided Multimodal Fusion

AU - Zhang, Zhenrong

AU - Gong, Haoyan

AU - Feng, Yuzheng

AU - Chu, Zixuan

AU - Liu, Hongbin

N1 - Publisher Copyright: © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

PY - 2025

Y1 - 2025

N2 - Integrating diverse representations from complementary sensing modalities is essential for robust scene interpretation in autonomous driving. Deep learning architectures that fuse vision and range data have advanced 2D and 3D object detection in recent years. However, these modalities often suffer degradation in adverse weather or lighting conditions, leading to decreased performance. While domain adaptation methods have been developed to bridge the gap between source and target domains, they typically fall short because of the inherent discrepancy between the source and target domains. This discrepancy can manifest in different distributions of data and different feature spaces. This paper introduces a comprehensive domain-adaptive object detection framework. Developed through deep transfer learning, the framework is designed to robustly generalize from labelled clear-weather data to unlabeled adverse weather conditions, enhancing the performance of deep learning-based object detection models. The innovative Patch Entropy Fusion Module (PEFM) is central to our approach, which dynamically integrates sensor data, emphasizing critical information and minimizing background distractions. This is further complemented by a novel Weighted Decision Module (WDM) that adjusts the contributions of different sensors based on their efficacy under specific environmental conditions, thereby optimizing detection accuracy. Additionally, we integrate a domain align loss during the transfer learning process to ensure effective domain adaptation by regularizing the feature map discrepancies between clear and adverse weather datasets. We evaluate our model on diverse datasets, including ExDark (unimodal), Cityscapes (unimodal), and Dense (multimodal), where it ranks 1st in all datasets at the point in time of our evaluation.

AB - Integrating diverse representations from complementary sensing modalities is essential for robust scene interpretation in autonomous driving. Deep learning architectures that fuse vision and range data have advanced 2D and 3D object detection in recent years. However, these modalities often suffer degradation in adverse weather or lighting conditions, leading to decreased performance. While domain adaptation methods have been developed to bridge the gap between source and target domains, they typically fall short because of the inherent discrepancy between the source and target domains. This discrepancy can manifest in different distributions of data and different feature spaces. This paper introduces a comprehensive domain-adaptive object detection framework. Developed through deep transfer learning, the framework is designed to robustly generalize from labelled clear-weather data to unlabeled adverse weather conditions, enhancing the performance of deep learning-based object detection models. The innovative Patch Entropy Fusion Module (PEFM) is central to our approach, which dynamically integrates sensor data, emphasizing critical information and minimizing background distractions. This is further complemented by a novel Weighted Decision Module (WDM) that adjusts the contributions of different sensors based on their efficacy under specific environmental conditions, thereby optimizing detection accuracy. Additionally, we integrate a domain align loss during the transfer learning process to ensure effective domain adaptation by regularizing the feature map discrepancies between clear and adverse weather datasets. We evaluate our model on diverse datasets, including ExDark (unimodal), Cityscapes (unimodal), and Dense (multimodal), where it ranks 1st in all datasets at the point in time of our evaluation.

KW - Domain adaptation

KW - Entropy fusion

KW - Multimodal fusion

UR - http://www.scopus.com/inward/record.url?scp=85213372201&partnerID=8YFLogxK

U2 - 10.1007/978-981-96-0972-7_2

DO - 10.1007/978-981-96-0972-7_2

M3 - Conference Proceeding

AN - SCOPUS:85213372201

SN - 9789819609710

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 22

EP - 38

BT - Computer Vision – ACCV 2024 - 17th Asian Conference on Computer Vision, Proceedings

A2 - Cho, Minsu

A2 - Laptev, Ivan

A2 - Tran, Du

A2 - Yao, Angela

A2 - Zha, Hongbin

PB - Springer Science and Business Media Deutschland GmbH

T2 - 17th Asian Conference on Computer Vision, ACCV 2024

Y2 - 8 December 2024 through 12 December 2024

ER -

Zhang Z, Gong H, Feng Y, Chu Z, Liu H. Enhancing Object Detection in Adverse Weather Conditions Through Entropy and Guided Multimodal Fusion. In Cho M, Laptev I, Tran D, Yao A, Zha H, editors, Computer Vision – ACCV 2024 - 17th Asian Conference on Computer Vision, Proceedings. Springer Science and Business Media Deutschland GmbH. 2025. p. 22-38. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-981-96-0972-7_2