Search papers, labs, and topics across Lattice.
2
0
5
Visual context barely affects human sentence acceptability judgments, but throws LLMs for a loop, widening the gap between their internal representations and acceptability predictions.
LLMs struggle with basic GPS coordinate reasoning, often failing at geometric computations despite showing some understanding of real-world geography.