Search papers, labs, and topics across Lattice.
This paper introduces a novel end-to-end neural network architecture for jointly extracting concepts, assertions, and relations from clinical text, addressing the limitations of independent task settings in prior work. The model jointly optimizes the three stages, enabling direct comparison with existing pipeline approaches. Experiments demonstrate that the joint system significantly outperforms the pipeline baseline, achieving F1 score improvements of +0.3, +1.4, and +3.1 for concept, assertion, and relation extraction, respectively.
Jointly modeling concept, assertion, and relation extraction in clinical text yields substantial performance gains over traditional pipelines, paving the way for more accurate clinical information processing.
Clinical information extraction (e.g., 2010 i2b2/VA challenge) usually presents tasks of concept recognition, assertion classification, and relation extraction. Jointly modeling the multi-stage tasks in the clinical domain is an underexplored topic. The existing independent task setting (reference inputs given in each stage) makes the joint models not directly comparable to the existing pipeline work. To address these issues, we define a joint task setting and propose a novel end-to-end system to jointly optimize three-stage tasks. We empirically investigate the joint evaluation of our proposal and the pipeline baseline with various embedding techniques: word, contextual, and in-domain contextual embeddings. The proposed joint system substantially outperforms the pipeline baseline by +0.3, +1.4, +3.1 for the concept, assertion, and relation F1. This work bridges joint approaches and clinical information extraction. The proposed approach could serve as a strong joint baseline for future research. The code is publicly available.