IndoorMS: A Multispectral Dataset for Semantic Segmentation in Indoor Scene Understanding

Qinfeng Zhu; Jingjing Xiao; Lei Fan

doi:10.1109/JSEN.2025.3559348

IndoorMS: A Multispectral Dataset for Semantic Segmentation in Indoor Scene Understanding

Qinfeng Zhu, Jingjing Xiao, Lei Fan^*

^*Corresponding author for this work

Army Medical University

Research output: Contribution to journal › Article › peer-review

Abstract

Indoor scene understanding is a critical task in computer vision, traditionally relying on RGB data for deep learning-based semantic segmentation to achieve pixel-level understanding. However, indoor environments provide valuable information beyond the visible light spectrum, which has been largely overlooked in existing research. To address this gap, we introduce IndoorMS, a comprehensive multispectral dataset specifically designed for the semantic segmentation of indoor scenes. The dataset comprises images captured using a multispectral sensor in 17 buildings across diverse indoor settings, including meeting rooms, halls, lounges, offices, corridors, and classrooms. With 19 finely annotated semantic categories, IndoorMS enables robust evaluation of indoor scene segmentation. Benchmark experiments are performed using several leading semantic segmentation frameworks, followed by a thorough analysis of their performance. The results indicate that the optimal model combination, namely ConvNeXt-s with UperNet, achieved an mF1 score of 82.38 and an mIoU score of 72.90. Despite these promising results, IndoorMS's challenges on segmentation networks remain, such as class distribution imbalance and domain gaps between RGB and multispectral data. This work marks the first effort to support multispectral indoor scene understanding with a dedicated dataset, offering new opportunities for research in this domain. Potential avenues for future research are presented. The project page for the IndoorMS dataset is available at https://zhuqinfeng1999.github.io/IndoorMS/. (The dataset will be publicly available for download after peer review.).

Original language	English
Pages (from-to)	19837-19847
Number of pages	11
Journal	IEEE Sensors Journal
Volume	25
Issue number	11
DOIs	https://doi.org/10.1109/JSEN.2025.3559348
Publication status	Accepted/In press - 2025

Keywords

Dataset
Image
Indoor
Multispectral
Scene Understanding
Semantic Segmentation

Access to Document

10.1109/JSEN.2025.3559348

Cite this

@article{85f1c0491bdf402890c14467362898b1,

title = "IndoorMS: A Multispectral Dataset for Semantic Segmentation in Indoor Scene Understanding",

abstract = "Indoor scene understanding is a critical task in computer vision, traditionally relying on RGB data for deep learning-based semantic segmentation to achieve pixel-level understanding. However, indoor environments provide valuable information beyond the visible light spectrum, which has been largely overlooked in existing research. To address this gap, we introduce IndoorMS, a comprehensive multispectral dataset specifically designed for the semantic segmentation of indoor scenes. The dataset comprises images captured using a multispectral sensor in 17 buildings across diverse indoor settings, including meeting rooms, halls, lounges, offices, corridors, and classrooms. With 19 finely annotated semantic categories, IndoorMS enables robust evaluation of indoor scene segmentation. Benchmark experiments are performed using several leading semantic segmentation frameworks, followed by a thorough analysis of their performance. The results indicate that the optimal model combination, namely ConvNeXt-s with UperNet, achieved an mF1 score of 82.38 and an mIoU score of 72.90. Despite these promising results, IndoorMS's challenges on segmentation networks remain, such as class distribution imbalance and domain gaps between RGB and multispectral data. This work marks the first effort to support multispectral indoor scene understanding with a dedicated dataset, offering new opportunities for research in this domain. Potential avenues for future research are presented. The project page for the IndoorMS dataset is available at https://zhuqinfeng1999.github.io/IndoorMS/. (The dataset will be publicly available for download after peer review.).",

keywords = "Dataset, Image, Indoor, Multispectral, Scene Understanding, Semantic Segmentation",

author = "Qinfeng Zhu and Jingjing Xiao and Lei Fan",

note = "Publisher Copyright: {\textcopyright} 2001-2012 IEEE.",

year = "2025",

doi = "10.1109/JSEN.2025.3559348",

language = "English",

volume = "25",

pages = "19837--19847",

journal = "IEEE Sensors Journal",

issn = "1530-437X",

number = "11",

}

TY - JOUR

T1 - IndoorMS

T2 - A Multispectral Dataset for Semantic Segmentation in Indoor Scene Understanding

AU - Zhu, Qinfeng

AU - Xiao, Jingjing

AU - Fan, Lei

PY - 2025

Y1 - 2025

N2 - Indoor scene understanding is a critical task in computer vision, traditionally relying on RGB data for deep learning-based semantic segmentation to achieve pixel-level understanding. However, indoor environments provide valuable information beyond the visible light spectrum, which has been largely overlooked in existing research. To address this gap, we introduce IndoorMS, a comprehensive multispectral dataset specifically designed for the semantic segmentation of indoor scenes. The dataset comprises images captured using a multispectral sensor in 17 buildings across diverse indoor settings, including meeting rooms, halls, lounges, offices, corridors, and classrooms. With 19 finely annotated semantic categories, IndoorMS enables robust evaluation of indoor scene segmentation. Benchmark experiments are performed using several leading semantic segmentation frameworks, followed by a thorough analysis of their performance. The results indicate that the optimal model combination, namely ConvNeXt-s with UperNet, achieved an mF1 score of 82.38 and an mIoU score of 72.90. Despite these promising results, IndoorMS's challenges on segmentation networks remain, such as class distribution imbalance and domain gaps between RGB and multispectral data. This work marks the first effort to support multispectral indoor scene understanding with a dedicated dataset, offering new opportunities for research in this domain. Potential avenues for future research are presented. The project page for the IndoorMS dataset is available at https://zhuqinfeng1999.github.io/IndoorMS/. (The dataset will be publicly available for download after peer review.).

AB - Indoor scene understanding is a critical task in computer vision, traditionally relying on RGB data for deep learning-based semantic segmentation to achieve pixel-level understanding. However, indoor environments provide valuable information beyond the visible light spectrum, which has been largely overlooked in existing research. To address this gap, we introduce IndoorMS, a comprehensive multispectral dataset specifically designed for the semantic segmentation of indoor scenes. The dataset comprises images captured using a multispectral sensor in 17 buildings across diverse indoor settings, including meeting rooms, halls, lounges, offices, corridors, and classrooms. With 19 finely annotated semantic categories, IndoorMS enables robust evaluation of indoor scene segmentation. Benchmark experiments are performed using several leading semantic segmentation frameworks, followed by a thorough analysis of their performance. The results indicate that the optimal model combination, namely ConvNeXt-s with UperNet, achieved an mF1 score of 82.38 and an mIoU score of 72.90. Despite these promising results, IndoorMS's challenges on segmentation networks remain, such as class distribution imbalance and domain gaps between RGB and multispectral data. This work marks the first effort to support multispectral indoor scene understanding with a dedicated dataset, offering new opportunities for research in this domain. Potential avenues for future research are presented. The project page for the IndoorMS dataset is available at https://zhuqinfeng1999.github.io/IndoorMS/. (The dataset will be publicly available for download after peer review.).

KW - Dataset

KW - Image

KW - Indoor

KW - Multispectral

KW - Scene Understanding

KW - Semantic Segmentation

UR - http://www.scopus.com/inward/record.url?scp=105002805455&partnerID=8YFLogxK

U2 - 10.1109/JSEN.2025.3559348

DO - 10.1109/JSEN.2025.3559348

M3 - Article

AN - SCOPUS:105002805455

SN - 1530-437X

VL - 25

SP - 19837

EP - 19847

JO - IEEE Sensors Journal

JF - IEEE Sensors Journal

IS - 11

ER -

IndoorMS: A Multispectral Dataset for Semantic Segmentation in Indoor Scene Understanding

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this