Search papers, labs, and topics across Lattice.
Zhejiang University 2 Qwen Team, Alibaba Group 3 Shanghai Jiao Tong University 4 Tsinghua University zuozhu.liu@zju.edu.cn
Tsinghua AI2
0
4
Forget real-world video datasets: training VLMs on just 7.7K synthetic videos with temporal primitives beats 165K real-world examples, unlocking surprisingly effective transfer learning for video reasoning.
Directly modeling 3D geometry in dental scans unlocks a 9.58% accuracy boost in multi-disease diagnosis compared to methods relying on 2D or multi-view image representations.