Search papers, labs, and topics across Lattice.
The paper introduces TCM-DiffRAG, a novel retrieval-augmented generation (RAG) framework that integrates knowledge graphs (KG) and chain-of-thought (CoT) prompting to improve LLM performance in personalized TCM syndrome differentiation. TCM-DiffRAG leverages both universal and personalized knowledge graphs to align general knowledge with individual clinical reasoning. Experiments on three TCM datasets demonstrate that TCM-DiffRAG significantly outperforms native LLMs, supervised fine-tuned LLMs, and other RAG baselines, particularly for non-Chinese LLMs.
TCM diagnosis gets a boost: RAG enhanced with knowledge graphs and chain-of-thought reasoning leaps ahead of fine-tuning and other RAG methods, particularly for non-Chinese LLMs tackling individualized cases.
Background: Retrieval augmented generation (RAG) technology can empower large language models (LLMs) to generate more accurate, professional, and timely responses without fine tuning. However, due to the complex reasoning processes and substantial individual differences involved in traditional Chinese medicine (TCM) clinical diagnosis and treatment, traditional RAG methods often exhibit poor performance in this domain. Objective: To address the limitations of conventional RAG approaches in TCM applications, this study aims to develop an improved RAG framework tailored to the characteristics of TCM reasoning. Methods: We developed TCM-DiffRAG, an innovative RAG framework that integrates knowledge graphs (KG) with chains of thought (CoT). TCM-DiffRAG was evaluated on three distinctive TCM test datasets. Results: The experimental results demonstrated that TCM-DiffRAG achieved significant performance improvements over native LLMs. For example, the qwen-plus model achieved scores of 0.927, 0.361, and 0.038, which were significantly enhanced to 0.952, 0.788, and 0.356 with TCM-DiffRAG. The improvements were even more pronounced for non-Chinese LLMs. Additionally, TCM-DiffRAG outperformed directly supervised fine-tuned (SFT) LLMs and other benchmark RAG methods. Conclusions: TCM-DiffRAG shows that integrating structured TCM knowledge graphs with Chain of Thought based reasoning substantially improves performance in individualized diagnostic tasks. The joint use of universal and personalized knowledge graphs enables effective alignment between general knowledge and clinical reasoning. These results highlight the potential of reasoning-aware RAG frameworks for advancing LLM applications in traditional Chinese medicine.