Search papers, labs, and topics across Lattice.
This paper introduces a probabilistic machine learning model to identify instances of mechanistic reasoning in student team conversations. The model uses an inductive bias to guide probabilistic dynamics, incorporating both individual utterances and group contributions. Results demonstrate that this inductive bias improves the model's generalization to new students and discussion contexts, suggesting inherent interpretability.
Inductive biases make machine learning models better at spotting mechanistic reasoning in student discussions, even when those students are tackling new problems.
STEM education researchers are often interested in identifying moments of students'mechanistic reasoning for deeper analysis, but have limited capacity to search through many team conversation transcripts to find segments with a high concentration of such reasoning. We offer a solution in the form of an interpretable machine learning model that outputs time-varying probabilities that individual students are engaging in acts of mechanistic reasoning, leveraging evidence from their own utterances as well as contributions from the rest of the group. Using the toolkit of intentionally-designed probabilistic models, we introduce a specific inductive bias that steers the probabilistic dynamics toward desired, domain-aligned behavior. Experiments compare trained models with and without the inductive bias components, investigating whether their presence improves the desired model behavior on transcripts involving never-before-seen students and a novel discussion context. Our results show that the inductive bias improves generalization -- supporting the claim that interpretability is built into the model for this task rather than imposed post hoc. We conclude with practical recommendations for STEM education researchers seeking to adopt the tool and for ML researchers aiming to extend the model's design. Overall, we hope this work encourages the development of mechanistically interpretable models that are understandable and controllable for both end users and model designers in STEM education research.