Search papers, labs, and topics across Lattice.
The paper introduces AgenticSum, an agentic framework for clinical text summarization that decomposes the process into context selection, generation, verification via attention grounding, and targeted correction. This framework aims to improve factual consistency in LLM-generated summaries of clinical text by identifying and revising weakly supported spans. Experiments on two public datasets demonstrate that AgenticSum outperforms vanilla LLMs and other baselines across reference-based metrics, LLM-as-a-judge assessment, and human evaluation, indicating the effectiveness of structured, agentic design for clinical note summarization.
Clinical text summarization gets a boost in factual accuracy with AgenticSum, an agentic framework that selectively revises weakly supported spans identified via internal attention grounding.
Large language models (LLMs) offer substantial promise for automating clinical text summarization, yet maintaining factual consistency remains challenging due to the length, noise, and heterogeneity of clinical documentation. We present AgenticSum, an inference-time, agentic framework that separates context selection, generation, verification, and targeted correction to reduce hallucinated content. The framework decomposes summarization into coordinated stages that compress task-relevant context, generate an initial draft, identify weakly supported spans using internal attention grounding signals, and selectively revise flagged content under supervisory control. We evaluate AgenticSum on two public datasets, using reference-based metrics, LLM-as-a-judge assessment, and human evaluation. Across various measures, AgenticSum demonstrates consistent improvements compared to vanilla LLMs and other strong baselines. Our results indicate that structured, agentic design with targeted correction offers an effective inference time solution to improve clinical note summarization using LLMs.