DSOSplat: Monocular 3D Gaussian SLAM with Direct Tracking

Yi Zhou, Zhetao Guo, Dong Li, Runwei Guan, Yuxiang Ren, Hongyu Wang, Mingrui Li*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Simultaneous Localization and Mapping (SLAM) is one of the key technologies in robotics navigation, augmented reality, and autonomous driving. However, existing dense SLAM methods are constrained by their reliance on external depth observers and high computational costs, limiting applications in the field of human-computer interaction, particularly in AR and VR. We propose DSOSplat, a monocular SLAM framework based on 3D Gaussian Splatting to address these challenges. We generate dense depth maps with absolute scale consistency by employing a self-calibrated adaptive multiview stereo (SC-AMVS) algorithm. Additionally, the accuracy and robustness of depth estimation are significantly improved through dynamic weighted fusion, local constraints, and a scale calibration factor. Our visual odometry module leverages composite depth maps and a keyframe selection strategy to further enhance tracking and reconstruction performance. Furthermore, we propose a depth smoothing regularization (DSR) method that optimizes local gradients and global consistency, thereby improving the geometric expressiveness of Gaussian Splatting and the quality of scene reconstruction. Experimental results demonstrate that DSOSplat achieves efficient localization and high-accuracy scene reconstruction in dynamic and complex environments, offering new possibilities for the development of monocular SLAM. In addition, we perform evaluations in real-world scenarios, where the algorithm also exhibited noteworthy performance.

Original languageEnglish
JournalIEEE Sensors Journal
DOIs
Publication statusAccepted/In press - 2025
Externally publishedYes

Keywords

  • 3D Gaussian Splatting
  • Scene Reconstruction
  • Simultaneous Localization and Mapping (SLAM)

Fingerprint

Dive into the research topics of 'DSOSplat: Monocular 3D Gaussian SLAM with Direct Tracking'. Together they form a unique fingerprint.

Cite this