Search papers, labs, and topics across Lattice.
Northeastern University
3
0
6
0
A hierarchical agent that separates visual and textual contexts drastically improves multi-step reasoning on complex charts, outperforming monolithic MLLMs.
LLMs struggle with causal reasoning when noise is introduced, but explicitly modeling causal graphs can dramatically improve performance and generalization.
Video LLMs don't just get details wrong, they fundamentally distort motion and fabricate entire events, demanding a new approach to evaluation and mitigation.