TY - JOUR
T1 - Crack detection of masonry structure based on thermal and visible image fusion and semantic segmentation
AU - Huang, Hong
AU - Cai, Yuanzhi
AU - Zhang, Cheng
AU - Lu, Yiming
AU - Hammad, Amin
AU - Fan, Lei
N1 - Publisher Copyright:
© 2023 Elsevier B.V.
PY - 2024/2
Y1 - 2024/2
N2 - The integration of visible and thermal images has demonstrated the potential ability to enhance crack segmentation accuracy. However, due to the intricate texture of masonry structures and the challenges posed in precisely aligning these cross-modality images, it is necessary to explore pixel-level alignment and develop a comprehensive dataset to enable deep-learning based methods. Therefore, a dataset, Crack900, including five image types, is developed together with a proposed two-step registration to achieve highly accurate pixel-level alignment. In addition, both Train from Scratch and Transfer Learning (TL) strategies are applied on eleven models to investigate the impact of different fused image types. Our findings reveal that the concatenation strategy markedly improves segmentation accuracy, and the performance of TL depends on the compatibility of channel numbers and domain difference between pre-trained and target models. These findings pave the way for further development of cross-modality in masonry crack segmentation methodologies for structural health monitorin.
AB - The integration of visible and thermal images has demonstrated the potential ability to enhance crack segmentation accuracy. However, due to the intricate texture of masonry structures and the challenges posed in precisely aligning these cross-modality images, it is necessary to explore pixel-level alignment and develop a comprehensive dataset to enable deep-learning based methods. Therefore, a dataset, Crack900, including five image types, is developed together with a proposed two-step registration to achieve highly accurate pixel-level alignment. In addition, both Train from Scratch and Transfer Learning (TL) strategies are applied on eleven models to investigate the impact of different fused image types. Our findings reveal that the concatenation strategy markedly improves segmentation accuracy, and the performance of TL depends on the compatibility of channel numbers and domain difference between pre-trained and target models. These findings pave the way for further development of cross-modality in masonry crack segmentation methodologies for structural health monitorin.
KW - CNN-based networks
KW - Crack segmentation
KW - Masonry structure
KW - Semantic segmentation
KW - Thermal and visible image fusion
KW - Transformer-based networks
UR - http://www.scopus.com/inward/record.url?scp=85179123160&partnerID=8YFLogxK
U2 - 10.1016/j.autcon.2023.105213
DO - 10.1016/j.autcon.2023.105213
M3 - Article
AN - SCOPUS:85179123160
SN - 0926-5805
VL - 158
JO - Automation in Construction
JF - Automation in Construction
M1 - 105213
ER -