Search papers, labs, and topics across Lattice.
Shanghai Jiao Tong University
1
0
3
Image editing models can learn to solve visual planning puzzles with finetuning, but still lag far behind humans in zero-shot efficiency, revealing a key gap in neural visual reasoning.