Search papers, labs, and topics across Lattice.
Huawei Cloud BU ♢ Core Contributor ∗ Project Leader
1
0
3
4
Current multimodal agents are surprisingly bad at web browsing, achieving only 36% accuracy on a new benchmark designed to test deep, multi-modal reasoning across web pages.