TY - JOUR
T1 - Textual analysis of insurance claims with large language models
AU - Li, Dongchen
AU - Jin, Zhuo
AU - Qian, Linyi
AU - Yang, Hailiang
N1 - Publisher Copyright:
© 2025 American Risk and Insurance Association.
PY - 2025/6
Y1 - 2025/6
N2 - This study proposes a comprehensive and general framework for examining discrepancies in textual content using large language models (LLMs), broadening application scenarios in the insurtech and risk management fields, and conducting empirical research based on actual needs and real-world data. Our framework integrates OpenAI's interface to embed texts and project them into external categories while utilizing distance metrics to evaluate discrepancies. To identify significant disparities, we design prompts to analyze three types of relationships: identical information, logical relationships and potential relationships. Our empirical analysis shows that 22.1% of samples exhibit substantial semantic discrepancies, and 38.1% of the samples with significant differences contain at least one of the identified relationships. The average processing time for each sample does not exceed 4 s, and all processes can be adjusted based on actual needs. Backtesting results and comparisons with traditional NLP methods further demonstrate that our proposed method is both effective and robust.
AB - This study proposes a comprehensive and general framework for examining discrepancies in textual content using large language models (LLMs), broadening application scenarios in the insurtech and risk management fields, and conducting empirical research based on actual needs and real-world data. Our framework integrates OpenAI's interface to embed texts and project them into external categories while utilizing distance metrics to evaluate discrepancies. To identify significant disparities, we design prompts to analyze three types of relationships: identical information, logical relationships and potential relationships. Our empirical analysis shows that 22.1% of samples exhibit substantial semantic discrepancies, and 38.1% of the samples with significant differences contain at least one of the identified relationships. The average processing time for each sample does not exceed 4 s, and all processes can be adjusted based on actual needs. Backtesting results and comparisons with traditional NLP methods further demonstrate that our proposed method is both effective and robust.
KW - discrepancy analysis
KW - distance metrics
KW - insurance claim settlement
KW - large language model
KW - risk management
UR - http://www.scopus.com/inward/record.url?scp=105000944564&partnerID=8YFLogxK
U2 - 10.1111/jori.70004
DO - 10.1111/jori.70004
M3 - Article
AN - SCOPUS:105000944564
SN - 0022-4367
VL - 92
SP - 505
EP - 535
JO - Journal of Risk and Insurance
JF - Journal of Risk and Insurance
IS - 2
ER -