Search papers, labs, and topics across Lattice.
The paper investigates the application of Large Language Models (LLMs) for automating corrective-action recommendations in insurance claim processing. They fine-tuned pre-trained LLMs using Low-Rank Adaptation (LoRA) on millions of historical warranty claims to generate structured recommendations from unstructured claim narratives. The results demonstrate that domain-specific fine-tuning significantly outperforms general-purpose and prompt-based LLMs, achieving near-identical matches to ground-truth corrective actions in approximately 80% of evaluated cases.
Domain-specific fine-tuning of LLMs on insurance claims data achieves 80% accuracy in generating corrective actions, outperforming general-purpose models and unlocking potential for automation in regulated industries.
While Large Language Models (LLMs) have achieved strong performance on general-purpose language tasks, their deployment in regulated and data-sensitive domains, including insurance, remains limited. Leveraging millions of historical warranty claims, we propose a locally deployed governance-aware language modeling component that generates structured corrective-action recommendations from unstructured claim narratives. We fine-tune pretrained LLMs using Low-Rank Adaptation (LoRA), scoping the model to an initial decision module within the claim processing pipeline to speed up claim adjusters' decisions. We assess this module using a multi-dimensional evaluation framework that combines automated semantic similarity metrics with human evaluation, enabling a rigorous examination of both practical utility and predictive accuracy. Our results show that domain-specific fine-tuning substantially outperforms commercial general-purpose and prompt-based LLMs, with approximately 80% of the evaluated cases achieving near-identical matches to ground-truth corrective actions. Overall, this study provides both theoretical and empirical evidence to prove that domain-adaptive fine-tuning can align model output distributions more closely with real-world operational data, demonstrating its promise as a reliable and governable building block for insurance applications.