Skip to main navigation Skip to search Skip to main content

Large Language Models as Causal Reasoners for Robust Medical Segmentation: CAUSAL-SAM-LLM

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

63 Downloads (Pure)

Abstract

The clinical utility of deep learning for medical image segmentation is hindered by poor generalization, as models often learn spurious correlations between anatomy and domain-specific styles. To address this, we introduce Causal-SAM-LLM, a framework that leverages Large Language Models(LLMs) for causal reasoning atop a frozen Segment Anything Model (SAM) encoder. Our approach introduces two synergistic components. First, Linguistic Adversarial Disentanglement (LAD) uses a Vision-Language Model to generate textual descriptions of confounding styles and contrastively trains the segmentation features to be style-invariant. Second, Test-Time Causal Intervention (TCI) allows an LLM to interpret a clinician’s natural language commands to modulate decoder features for real-time error correction. On a composite benchmark from four public datasets (BTCV, CHAOS, AMOS, BraTS), we evaluate out-of-distribution (OOD) generalization across scanners, modalities, and anatomies.Causal-SAM-LLM sets a new state-of-the-art, outperforming the strongest baseline by up to 5.2 points in Dice score and 10.0 mm in Hausdorff Distance, while fine-tuning under 8% of the total parameters. Our work charts a path toward robust, efficient, and interactively controllable medical AI.
Original languageEnglish
Title of host publication2026 IEEE International Conference on Acoustics, Speech, and Signal Processing
Subtitle of host publicationICASSP 2026
PublisherIEEE Press
Number of pages5
Publication statusAccepted/In press - 22 Jan 2026
Event2026 IEEE International Conference on Acoustics, Speech, and Signal Processing: ICASSP 2026 - Barcelona, Spain, Barcelona, Spain
Duration: 4 May 20268 May 2026
https://2026.ieeeicassp.org/

Conference

Conference2026 IEEE International Conference on Acoustics, Speech, and Signal Processing
Country/TerritorySpain
CityBarcelona
Period4/05/268/05/26
Internet address

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Cite this