Textual Analysis of Insurance Claims with Large Language Models

Dongchen Li; Zhuo Jin; Linyi Qian; Hailiang Yang

Textual Analysis of Insurance Claims with Large Language Models

Dongchen Li, Zhuo Jin, Linyi Qian^*, Hailiang Yang

^*Corresponding author for this work

Department of Financial and Actuarial Mathematics

Research output: Contribution to journal › Article › peer-review

Abstract

This study proposes a comprehensive and general framework designed for exam-ining discrepancies in textual content by large language models (LLMs), broading application scenarios in the ï¬�elds of insurtech and risk management and conduct-ing empirical research based on actual needs and real-world data. Our framework incorporates OpenAIâ€™s interface to embed texts and project them into external cat-egories, and utilizes distance metrics to fulï¬�ll discrepancy judgement. To exhibit signiï¬�cant disparities, we design prompts to analyse three relationships: the same information, logical relationship and potential relationship. In our empirical anal-ysis, ChatGPT reveals 22.1% of samples exhibit substantial semantic discrepancy in text statements and require further manual investigation, 38.1% of samples with large diï¬€erences contain at least one of the identiï¬�ed relationships. The average processing time for each sample does not exceed 4 seconds, and all processes can be adjusted and explained according to actual needs. The backtesting results and comparisons with traditional NLP methods further indicate that our method is both eï¬€ective and robust.

Original language	English
Number of pages	31
Journal	Journal of Risk and Insurance
Publication status	Accepted/In press - 5 Mar 2025

Keywords

Large language model;
insurance claim settlement, risk mangagement, discrepancy analysis, distance metrics

Cite this

@article{fe75f95b06a9405997164173517d149e,

title = "Textual Analysis of Insurance Claims with Large Language Models",

abstract = "This study proposes a comprehensive and general framework designed for exam-ining discrepancies in textual content by large language models (LLMs), broading application scenarios in the {\"i}¬�elds of insurtech and risk management and conduct-ing empirical research based on actual needs and real-world data. Our framework incorporates OpenAI{\^a}€{\texttrademark}s interface to embed texts and project them into external cat-egories, and utilizes distance metrics to ful{\"i}¬�ll discrepancy judgement. To exhibit signi{\"i}¬�cant disparities, we design prompts to analyse three relationships: the same information, logical relationship and potential relationship. In our empirical anal-ysis, ChatGPT reveals 22.1% of samples exhibit substantial semantic discrepancy in text statements and require further manual investigation, 38.1% of samples with large di{\"i}¬€erences contain at least one of the identi{\"i}¬�ed relationships. The average processing time for each sample does not exceed 4 seconds, and all processes can be adjusted and explained according to actual needs. The backtesting results and comparisons with traditional NLP methods further indicate that our method is both e{\"i}¬€ective and robust.",

keywords = "Large language model;, insurance claim settlement, risk mangagement, discrepancy analysis, distance metrics",

author = "Dongchen Li and Zhuo Jin and Linyi Qian and Hailiang Yang",

year = "2025",

month = mar,

day = "5",

language = "English",

journal = "Journal of Risk and Insurance",

}

TY - JOUR

T1 - Textual Analysis of Insurance Claims with Large Language Models

AU - Li, Dongchen

AU - Jin, Zhuo

AU - Qian, Linyi

AU - Yang, Hailiang

PY - 2025/3/5

Y1 - 2025/3/5

N2 - This study proposes a comprehensive and general framework designed for exam-ining discrepancies in textual content by large language models (LLMs), broading application scenarios in the ï¬�elds of insurtech and risk management and conduct-ing empirical research based on actual needs and real-world data. Our framework incorporates OpenAIâ€™s interface to embed texts and project them into external cat-egories, and utilizes distance metrics to fulï¬�ll discrepancy judgement. To exhibit signiï¬�cant disparities, we design prompts to analyse three relationships: the same information, logical relationship and potential relationship. In our empirical anal-ysis, ChatGPT reveals 22.1% of samples exhibit substantial semantic discrepancy in text statements and require further manual investigation, 38.1% of samples with large diï¬€erences contain at least one of the identiï¬�ed relationships. The average processing time for each sample does not exceed 4 seconds, and all processes can be adjusted and explained according to actual needs. The backtesting results and comparisons with traditional NLP methods further indicate that our method is both eï¬€ective and robust.

AB - This study proposes a comprehensive and general framework designed for exam-ining discrepancies in textual content by large language models (LLMs), broading application scenarios in the ï¬�elds of insurtech and risk management and conduct-ing empirical research based on actual needs and real-world data. Our framework incorporates OpenAIâ€™s interface to embed texts and project them into external cat-egories, and utilizes distance metrics to fulï¬�ll discrepancy judgement. To exhibit signiï¬�cant disparities, we design prompts to analyse three relationships: the same information, logical relationship and potential relationship. In our empirical anal-ysis, ChatGPT reveals 22.1% of samples exhibit substantial semantic discrepancy in text statements and require further manual investigation, 38.1% of samples with large diï¬€erences contain at least one of the identiï¬�ed relationships. The average processing time for each sample does not exceed 4 seconds, and all processes can be adjusted and explained according to actual needs. The backtesting results and comparisons with traditional NLP methods further indicate that our method is both eï¬€ective and robust.

KW - Large language model;

KW - insurance claim settlement, risk mangagement, discrepancy analysis, distance metrics

M3 - Article

JO - Journal of Risk and Insurance

JF - Journal of Risk and Insurance

ER -

Textual Analysis of Insurance Claims with Large Language Models

Abstract

Keywords

Fingerprint

Cite this