Search papers, labs, and topics across Lattice.
1
0
3
6
VLMs may ace the color coverage test, but they flunk the "do as I say, not as I do" test, routinely ignoring their own stated reasoning rules in ways that humans don't.