Search papers, labs, and topics across Lattice.
Princeton University
1
0
2
Even the top-performing MLLMs struggle with visual reasoning, achieving only 64% accuracy on a benchmark designed to reflect real-world diversity.