Search papers, labs, and topics across Lattice.
Northwestern
3
0
7
Socratic tutors can be effectively trained via RL by decoupling student cognitive states, using generative pedagogical rewards, and stabilizing multi-objective optimization.
VLMs struggle to meaningfully ground numerical outputs in spatial contexts, often performing at chance levels in critical tasks.
Video diffusion models can now generate physically plausible 4D worlds thanks to a new pipeline that combines pretraining, supervised fine-tuning, and reinforcement learning.