Zirui Song

Mohamed bin Zayed University of Artificial Intelligence;University of Technology Sydney,

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (2)Multimodal Models (2)Computer Vision (1)Natural Language Processing (1)

Frequent co-authors

Xiuying Chen (2)Qian Jiang (1)Zhecheng Shi (1)Jingpu Yang (1)

Papers (3)

Jul 9, 2026

4d ago

OmniFood-Bench: Evaluating VLMs for Nutrient Reasoning and Personalized Health Advice

VLMs may ace dish recognition but often falter in delivering safe dietary advice, revealing a critical gap in their practical application for health management.

Qian Jiang, Zhecheng Shi, Jingpu Yang +2

Eval Frameworks & Benchmarks Multimodal Models

Jul 2, 2026

Xianhui Meng +121w ago·also MBZUAI

Text-Driven 3D Indoor Scene Synthesis in Non-Manhattan Environments

SPG-Layout achieves a breakthrough in 3D scene synthesis by generating physically plausible layouts in non-Manhattan environments, outperforming existing methods.

Xianhui Meng, Zirui Song, Yuchen Zhang +10

Computer Vision Natural Language Processing World Models & Planning

Apr 30, 2026

FineState-Bench: Benchmarking State-Conditioned Grounding for Fine-grained GUI State Setting

Even the best vision-language models struggle to reliably set fine-grained GUI states, achieving only 33% accuracy on a new benchmark, but targeted visual hints suggest a clear path to improvement.

Fengxian Ji, Jingpu Yang, Zirui Song +4

Eval Frameworks & Benchmarks Multimodal Models Tool Use & Agents

Search

Zirui Song

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)