Crack detection of masonry structure based on thermal and visible image fusion and semantic segmentation

Hong Huang, Yuanzhi Cai, Cheng Zhang*, Yiming Lu, Amin Hammad, Lei Fan

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)


The integration of visible and thermal images has demonstrated the potential ability to enhance crack segmentation accuracy. However, due to the intricate texture of masonry structures and the challenges posed in precisely aligning these cross-modality images, it is necessary to explore pixel-level alignment and develop a comprehensive dataset to enable deep-learning based methods. Therefore, a dataset, Crack900, including five image types, is developed together with a proposed two-step registration to achieve highly accurate pixel-level alignment. In addition, both Train from Scratch and Transfer Learning (TL) strategies are applied on eleven models to investigate the impact of different fused image types. Our findings reveal that the concatenation strategy markedly improves segmentation accuracy, and the performance of TL depends on the compatibility of channel numbers and domain difference between pre-trained and target models. These findings pave the way for further development of cross-modality in masonry crack segmentation methodologies for structural health monitorin.

Original languageEnglish
Article number105213
JournalAutomation in Construction
Publication statusPublished - Feb 2024


  • CNN-based networks
  • Crack segmentation
  • Masonry structure
  • Semantic segmentation
  • Thermal and visible image fusion
  • Transformer-based networks


Dive into the research topics of 'Crack detection of masonry structure based on thermal and visible image fusion and semantic segmentation'. Together they form a unique fingerprint.

Cite this