TY - JOUR
T1 - Cross-Scenario Foundation Localization Models
T2 - Architecture, Key Technologies, and Challenges
AU - Si, Haonan
AU - Guo, Xiansheng
AU - Boateng, Gordon Owusu
AU - Xia, Huang
N1 - Publisher Copyright:
© 2002-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - As a core technology in the realm of ubiquitous mobile intelligence, indoor localization is shifting from the traditional “single scenario-driven” and static localization to a “cross-scenario” and generalized dynamic localization paradigm. However, the high dynamics and uncertainties across different scenarios, sensor heterogeneity, and the lack of standardized training data pose severe challenges to the dynamic localization concept. Recently, foundation models (e.g., DeepSeek, GPT, and LLaMA, etc.) have raised an unprecedented wave of research fervor by virtue of their extraordinary capability to seamlessly adapt to multiple downstream tasks across diverse scenarios. Inspired by the capabilities of foundation models and the cross-scenario dynamic localization, we introduce the concept of Foundation Localization Models (FLMs), aiming to achieve cross-scenario and cross-modality generalized localization. Subsequently, we propose its architecture and three key techniques: geometry-aligned backbone components, multi-modality co-temporal-spatial attention, and multi-dimensional data generation framework, which are interwoven to lay a solid theoretical foundation for cross-scenario generalized localization, clarifying the coupling mechanisms between localization tasks and the environments in complex scenarios. Experimental results are presented to demonstrate the feasibility and efficiency of the proposed FLM concept. Finally, several challenges and future research directions for implementing the FLM framework are highlighted.
AB - As a core technology in the realm of ubiquitous mobile intelligence, indoor localization is shifting from the traditional “single scenario-driven” and static localization to a “cross-scenario” and generalized dynamic localization paradigm. However, the high dynamics and uncertainties across different scenarios, sensor heterogeneity, and the lack of standardized training data pose severe challenges to the dynamic localization concept. Recently, foundation models (e.g., DeepSeek, GPT, and LLaMA, etc.) have raised an unprecedented wave of research fervor by virtue of their extraordinary capability to seamlessly adapt to multiple downstream tasks across diverse scenarios. Inspired by the capabilities of foundation models and the cross-scenario dynamic localization, we introduce the concept of Foundation Localization Models (FLMs), aiming to achieve cross-scenario and cross-modality generalized localization. Subsequently, we propose its architecture and three key techniques: geometry-aligned backbone components, multi-modality co-temporal-spatial attention, and multi-dimensional data generation framework, which are interwoven to lay a solid theoretical foundation for cross-scenario generalized localization, clarifying the coupling mechanisms between localization tasks and the environments in complex scenarios. Experimental results are presented to demonstrate the feasibility and efficiency of the proposed FLM concept. Finally, several challenges and future research directions for implementing the FLM framework are highlighted.
UR - https://www.scopus.com/pages/publications/105020979544
U2 - 10.1109/MWC.2025.3614570
DO - 10.1109/MWC.2025.3614570
M3 - Article
AN - SCOPUS:105020979544
SN - 1536-1284
JO - IEEE Wireless Communications
JF - IEEE Wireless Communications
ER -