Search papers, labs, and topics across Lattice.
Shanghai AI Laboratory
1
0
3
4
Current MLLMs still struggle to connect the dots between images and text when they're interleaved, highlighting a critical gap in real-world multimodal understanding.