Search papers, labs, and topics across Lattice.
The paper introduces MisEdu-RAG, a dual-hypergraph RAG framework designed to assist novice math teachers in diagnosing and remediating student misconceptions. It organizes pedagogical knowledge and student mistake cases into separate hypergraphs, enabling a two-stage retrieval process to gather relevant evidence for response generation. Experiments on the MisstepMath dataset demonstrate that MisEdu-RAG improves token-F1 by 10.95% and response quality by up to 15.3% compared to baseline models, with a pilot study confirming its practical applicability.
Novice math teachers can now get AI-powered help diagnosing and fixing student misconceptions, thanks to a new RAG framework that links pedagogical principles with real-world student errors.
Novice math teachers often encounter students' mistakes that are difficult to diagnose and remediate. Misconceptions are especially challenging because teachers must explain what went wrong and how to solve them. Although many existing large language model (LLM) platforms can assist in generating instructional feedback, these LLMs loosely connect pedagogical knowledge and student mistakes, which might make the guidance less actionable for teachers. To address this gap, we propose MisEdu-RAG, a dual-hypergraph-based retrieval-augmented generation (RAG) framework that organizes pedagogical knowledge as a concept hypergraph and real student mistake cases as an instance hypergraph. Given a query, MisEdu-RAG performs a two-stage retrieval to gather connected evidence from both layers and generates a response grounded in the retrieved cases and pedagogical principles. We evaluate on \textit{MisstepMath}, a dataset of math mistakes paired with teacher solutions, as a benchmark for misconception-aware retrieval and response generation across topics and error types. Evaluation results on \textit{MisstepMath} show that, compared with baseline models, MisEdu-RAG improves token-F1 by 10.95\% and yields up to 15.3\% higher five-dimension response quality, with the largest gains on \textit{Diversity} and \textit{Empowerment}. To verify its applicability in practical use, we further conduct a pilot study through a questionnaire survey of 221 teachers and interviews with 6 novices. The findings suggest that MisEdu-RAG provides diagnosis results and concrete teaching moves for high-demand misconception scenarios. Overall, MisEdu-RAG demonstrates strong potential for scalable teacher training and AI-assisted instruction for misconception handling. Our code is available on GitHub: https://github.com/GEMLab-HKU/MisEdu-RAG.