Search papers, labs, and topics across Lattice.
The University of Western Australia
2
0
3
Current LLM benchmarks hide critical reasoning failures in long, multimodal documents, which BRIDGE exposes through step-level evaluation.
LVLMs struggle with causal reasoning not because they lack the capacity, but because they lack structured guidance on what's relevant.