Search papers, labs, and topics across Lattice.
2
0
4
Tabular reasoning gets a boost: decoupling high-level visual perception from granular symbolic reasoning yields better accuracy, especially on large tables, even with smaller models.
Offline RL can be made more robust to distribution shift by directly optimizing against worst-case transition dynamics within an uncertainty set, leading to policies that avoid unreliable out-of-distribution actions.