Search papers, labs, and topics across Lattice.
This paper introduces an automated classification framework for geriatric Cardiovascular Risk Management using unstructured Electronic Health Records (EHRs). The study benchmarks classical machine learning, specialized deep learning architectures, and general-purpose LLMs on longitudinal Dutch clinical narratives from 3,482 patients. Results show that a custom Transformer architecture with hierarchical attention outperforms both traditional methods and LLMs, achieving the highest F1-scores and Matthews Correlation Coefficients.
Forget finetuning LLMs for medical risk stratification: a custom Transformer with hierarchical attention beats them at extracting insights from long-range clinical narratives.
To overcome the limitations of manual administrative coding in geriatric Cardiovascular Risk Management, this study introduces an automated classification framework leveraging unstructured Electronic Health Records (EHRs). Using a dataset of 3,482 patients, we benchmarked three distinct modeling paradigms on longitudinal Dutch clinical narratives: classical machine learning baselines, specialized deep learning architectures optimized for large-context sequences, and general-purpose generative Large Language Models (LLMs) in a zero-shot setting. Additionally, we evaluated a late fusion strategy to integrate unstructured text with structured medication embeddings and anthropometric data. Our analysis reveals that the custom Transformer architecture outperforms both traditional methods and generative \acs{llm}s, achieving the highest F1-scores and Matthews Correlation Coefficients. These findings underscore the critical role of specialized hierarchical attention mechanisms in capturing long-range dependencies within medical texts, presenting a robust, automated alternative to manual workflows for clinical risk stratification.