Search papers, labs, and topics across Lattice.
Ant Group, Zhejiang University, Nanjing University
2
0
3
Complex visual queries can significantly elevate the reasoning capabilities of multi-modal large language models, revealing new dimensions in AI's understanding of abstract visual content.
CRAFTQA's ability to dynamically generate custom code functions allows it to tackle complex reasoning tasks that traditional methods cannot handle.