Search papers, labs, and topics across Lattice.
2
0
4
Multimodal agents can now reason, plan, and execute actions more effectively by integrating perception as a core component, not just an auxiliary interface.
MLLMs that excel at medical image diagnosis may be relying on shortcuts, undermining their trustworthiness for clinical deployment.