Search papers, labs, and topics across Lattice.
1
0
3
2
Vision-language models struggle to adapt plans based on visual input alone, revealing a critical gap in their ability to use what they see when things don't go as expected.