Search papers, labs, and topics across Lattice.
Ant Digital Technologies
2
0
4
Visual-Seeker outperforms proprietary models by actively engaging with visual details, redefining multimodal search capabilities.
Current multimodal browsing agents are surprisingly bad at using visual information on webpages, with even top models scoring below 50% accuracy on a new visual-native search benchmark.