Search papers, labs, and topics across Lattice.
2
0
5
0
Context-augmented RL lets smaller MLLMs punch *way* above their weight, rivaling much larger models on reasoning tasks while dodging reward hacking.
By decoupling coarse action consistency from fine-grained variations, PF-DAG achieves state-of-the-art imitation learning performance in robotic manipulation tasks.