Search papers, labs, and topics across Lattice.
1
0
3
4
Socratic tutors can be effectively trained via RL by decoupling student cognitive states, using generative pedagogical rewards, and stabilizing multi-objective optimization.