Sicong Jiang

McGill University

Papers on Lattice

Total citations

Topics

h-index

Research focus

Multimodal Models (3)Computer Vision (2)Eval Frameworks & Benchmarks (2)Robotics & Embodied AI (1)Data Curation & Synthetic Data (1)

Frequent co-authors

Kangan Qian (1)ChuChu Xie (1)Yang Zhong (1)Jingrui Pang (1)

Papers (3)

Apr 20, 2026

Apr 20, 2026·also Tsinghua AI, BAAI, Rimbot

XEmbodied: A Foundation Model with Enhanced Geometric and Physical Cues for Large-Scale Embodied Environments

Endowing VLMs with intrinsic 3D geometric awareness and physical interaction cues via XEmbodied substantially boosts performance on spatial reasoning and embodied tasks, surpassing existing 2D image-text pretrained models.

Kangan Qian, ChuChu Xie, Yang Zhong +12

Computer Vision Multimodal Models Robotics & Embodied AI

Apr 17, 2026

Apr 17, 2026·also McGill, UW-Madison

VEFX-Bench: A Holistic Benchmark for Generic Video Editing and Visual Effects

Current video editing AIs still struggle to balance visual quality, instruction adherence, and localized edits, as revealed by a new benchmark designed to disentangle these factors.

Xiangbo Gao, Sicong Jiang, Bangya Liu +9

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Mar 28, 2026

ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding

VLMs can now get a million-scale boost in chart-understanding abilities thanks to a new dataset with paired code, images, data, and reasoning.

Jovana Kondic, Pengyuan Li, Dhiraj Joshi +24

Data Curation & Synthetic Data Eval Frameworks & Benchmarks Multimodal Models

Search

Sicong Jiang

Research focus

Frequent co-authors

Papers (3)