Search papers, labs, and topics across Lattice.
This paper introduces PACT (Periodic Anchor Consensus Training), a novel framework that enhances the training of medical dialogue agents by integrating supervised multi-paradigm synthesis with consensus-based Branch training. By utilizing complete electronic medical records (EMRs) while restricting access to patient-visible information, PACT effectively generates validated dialogues across four diagnostic reasoning paradigms without compromising sensitive data. Experimental results demonstrate that PACT outperforms existing medical dialogue systems in both diagnostic outcomes and consultation processes, marking a significant advancement in the field of AI-driven clinical diagnosis.
PACT achieves state-of-the-art performance in medical dialogue systems by leveraging a unique combination of multi-paradigm synthesis and consensus training, all while safeguarding patient data.
Clinical diagnosis requires flexible use of multiple reasoning paradigms under incomplete patient information. Existing LLM-based medical agents show strong medical reasoning ability, but single-paradigm or naively mixed dialogue supervision makes these paradigms difficult to learn without interference. We propose \textbf{PACT} (Periodic Anchor Consensus Training), a framework that couples supervised multi-paradigm dialogue synthesis with consensus-based Branch training. At the data level, \textbf{DPS} (Doctor-Patient-Supervisor) uses complete electronic medical records (EMRs) for quality control while keeping the doctor agent restricted to patient-visible information. This produces validated dialogues under four diagnostic reasoning paradigms without leaking hidden clinical answers. At the training level, PACT trains one paradigm-specific LoRA Branch per paradigm and periodically aggregates Branches into a shared Anchor through sign consensus. We further construct a dynamic multi-turn Chinese medical diagnosis benchmark for interactive consultation. Experiments show that PACT achieves state-of-the-art performance among compared proprietary, medical-specialized, and task-adapted baselines on diagnostic outcome and consultation-process metrics.