Search papers, labs, and topics across Lattice.
DxEvolve, a self-evolving diagnostic agent, was developed to mimic the dynamic cue acquisition and continuous expertise accumulation inherent in clinical diagnosis. It autonomously requests examinations and externalizes clinical experience as diagnostic cognition primitives. Experiments on the MIMIC-CDM benchmark showed DxEvolve improved diagnostic accuracy by 11.2% over backbone models, reaching 90.4% on a reader-study subset, comparable to clinician performance.
Clinical AI can achieve clinician-level diagnostic accuracy and continuous improvement via a self-evolving framework that actively learns from clinical experience.
Clinical diagnosis is a complex cognitive process, grounded in dynamic cue acquisition and continuous expertise accumulation. Yet most current artificial intelligence (AI) systems are misaligned with this reality, treating diagnosis as single-pass retrospective prediction while lacking auditable mechanisms for governed improvement. We developed DxEvolve, a self-evolving diagnostic agent that bridges these gaps through an interactive deep clinical research workflow. The framework autonomously requisitions examinations and continually externalizes clinical experience from increasing encounter exposure as diagnostic cognition primitives. On the MIMIC-CDM benchmark, DxEvolve improved diagnostic accuracy by 11.2% on average over backbone models and reached 90.4% on a reader-study subset, comparable to the clinician reference (88.8%). DxEvolve improved accuracy on an independent external cohort by 10.2% (categories covered by the source cohort) and 17.1% (uncovered categories) compared to the competitive method. By transforming experience into a governable learning asset, DxEvolve supports an accountable pathway for the continual evolution of clinical AI.