Search papers, labs, and topics across Lattice.
Southeast University, Shanghai AI Laboratory
1
0
3
0
Current MLLMs still struggle to connect the dots between images and text when they're interleaved, highlighting a critical gap in real-world multimodal understanding.