Songtao Jiang

Papers on Lattice

Total citations

Topics

h-index

Research focus

Multimodal Models (2)Eval Frameworks & Benchmarks (1)Reasoning & Chain-of-Thought (1)Computer Vision (1)Data Curation & Synthetic Data (1)

Frequent co-authors

Xiaotian Zhang (1)Jianhui Wei (1)Jie Tan (1)Yan Zhang (1)

Papers (2)

Apr 21, 2026

Institute for Genomic BiologyApr 21, 2026·also DeepMind, Department of Mechanical and Aerospace, Department of Mechanical Science and Engineering, HKUST +1

How Far Are Video Models from True Multimodal Reasoning?

Today's best video models achieve near-zero success rates on interactive video generation, revealing a stark gap in multimodal reasoning and physical grounding.

Xiaotian Zhang, Jianhui Wei, Jie Tan +7

Eval Frameworks & Benchmarks Multimodal Models Reasoning & Chain-of-Thought

Mar 18, 2026

Tsinghua AIMar 18, 2026·also DAMO

Learning Transferable Temporal Primitives for Video Reasoning via Synthetic Videos

Forget real-world video datasets: training VLMs on just 7.7K synthetic videos with temporal primitives beats 165K real-world examples, unlocking surprisingly effective transfer learning for video reasoning.

Songtao Jiang, Sibo Song, Chenyi Zhou +7

Computer Vision Data Curation & Synthetic Data Multimodal Models

Search

Songtao Jiang

Research focus

Frequent co-authors

Papers (2)