Search papers, labs, and topics across Lattice.
University of Waterloo
4
0
8
DR-DCI achieves a remarkable 73.3% accuracy in agentic search tasks while efficiently scaling from 100K to 10M documents, outperforming traditional methods.
Even the top-performing MLLMs struggle with visual reasoning, achieving only 64% accuracy on a benchmark designed to reflect real-world diversity.
MiniMax-M2 proves that massive parameter counts don't always translate to better agentic performance; strategic activation of a smaller subset can unlock frontier-level intelligence.
Today's visual generation models are often evaluated on the wrong things, leading to inflated performance claims that mask critical failures in spatial reasoning, temporal consistency, and causal understanding.