TY - GEN
T1 - A MIMO Wireless Channel Foundation Model via CIR-CSI Consistency
AU - Jiang, Jun
AU - Yu, Wenjun
AU - Li, Yunfan
AU - Gao, Yuan
AU - Xu, Shugong
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - In the field of artificial intelligence, self-supervised learning has demonstrated superior generalization capabilities by leveraging large-scale unlabeled datasets for pretraining, which is especially critical for wireless communication models to adapt to a variety of scenarios. This paper innovatively treats Channel State Information (CSI) and Channel Impulse Response (CIR) as naturally aligned multi-modal data and proposes the first MIMO wireless channel foundation model, named CSI-CLIP. By effectively capturing the joint representations of both CIR and CSI, CSI-CLIP exhibits remarkable adaptability across scenarios and robust feature extraction capabilities. Experimental results show that in positioning task, CSI-CLIP reduces the mean error distance by 22%; in beam management task, it increases accuracy by 1% compared to traditional supervised methods, as well as in the channel identification task. These improvements not only highlight the potential and value of CSI-CLIP in integrating sensing and communication but also demonstrate its significant advantages over existing techniques. Moreover, viewing CSI and CIR as multi-modal pairs and contrastive learning for wireless channel foundation model open up new research directions in the domain of MIMO wireless communications.
AB - In the field of artificial intelligence, self-supervised learning has demonstrated superior generalization capabilities by leveraging large-scale unlabeled datasets for pretraining, which is especially critical for wireless communication models to adapt to a variety of scenarios. This paper innovatively treats Channel State Information (CSI) and Channel Impulse Response (CIR) as naturally aligned multi-modal data and proposes the first MIMO wireless channel foundation model, named CSI-CLIP. By effectively capturing the joint representations of both CIR and CSI, CSI-CLIP exhibits remarkable adaptability across scenarios and robust feature extraction capabilities. Experimental results show that in positioning task, CSI-CLIP reduces the mean error distance by 22%; in beam management task, it increases accuracy by 1% compared to traditional supervised methods, as well as in the channel identification task. These improvements not only highlight the potential and value of CSI-CLIP in integrating sensing and communication but also demonstrate its significant advantages over existing techniques. Moreover, viewing CSI and CIR as multi-modal pairs and contrastive learning for wireless channel foundation model open up new research directions in the domain of MIMO wireless communications.
KW - Beam Management
KW - Channel Identification
KW - Foundation Models
KW - Integrating Sensing And Communication (ISAC)
KW - Positioning
KW - Self-Supervised Learning
UR - https://www.scopus.com/pages/publications/105016779443
UR - https://arxiv.org/abs/2507.13637
U2 - 10.1109/ICMLCN64995.2025.11140262
DO - 10.1109/ICMLCN64995.2025.11140262
M3 - Conference Proceeding
AN - SCOPUS:105016779443
T3 - 2025 IEEE International Conference on Machine Learning for Communication and Networking, ICMLCN 2025
BT - 2025 IEEE International Conference on Machine Learning for Communication and Networking, ICMLCN 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2nd IEEE International Conference on Machine Learning for Communication and Networking, ICMLCN 2025
Y2 - 26 May 2025 through 29 May 2025
ER -