Li Luo

Shanghai Jiao Tong University

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (1)Multimodal Models (1)Speech & Audio (1)Computer Vision (1)

Frequent co-authors

Zixuan Chen (1)Depeng Wang (1)Hao Lin (1)Ke Xu (1)

Papers (3)

Apr 15, 2026

2w ago·also Ant Group, CUHK

AVID: A Benchmark for Omni-Modal Audio-Visual Inconsistency Understanding via Agent-Driven Construction

Omni-modal LLMs can ace captioning and QA, but AVID reveals they're surprisingly bad at spotting audio-visual inconsistencies in videos, a crucial skill for trustworthy AI.

Zixuan Chen, Depeng Wang, Hao Lin +6

Eval Frameworks & Benchmarks Multimodal Models Speech & Audio

Apr 13, 2026

Xingjian Ran +42w ago·also SJTU

Pair2Scene: Learning Local Object Relations for Procedural Scene Generation

Forget LLMs – generating realistic 3D scenes might just hinge on learning how objects relate to each other locally.

Xingjian Ran, Shujie Zhang, Weipeng Zhong +2

Computer Vision Data Curation & Synthetic Data World Models & Planning

Feb 25, 2026

Feb 25, 2026·also HIT, HKU, SJTU, SYSU +1

AgentLTV: An Agent-Based Unified Search-and-Evolution Framework for Automated Lifetime Value Prediction

Forget hand-crafted LTV pipelines: AgentLTV uses LLM-driven agents to automatically search for and evolve high-performing models, adapting to diverse data patterns and improving prediction accuracy, especially for critical high-value and negative-LTV segments.

Chaowei Wu, Huazhu Chen, Congde Yuan +5

Recommendation & Information Retrieval Tool Use & Agents