Search papers, labs, and topics across Lattice.
Baidu Inc, Hong Kong University of Science and Technology (Guangzhou)
1
0
3
VLMs that ace digital document parsing benchmarks still stumble badly when faced with real-world scanned, warped, or photographed documents, revealing a significant "reality gap."