Zixuan Chen

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (2)Computer Vision (1)Robotics & Embodied AI (1)Distributed Systems & Hardware (1)

Frequent co-authors

Hongyu Ding (1)Sizhuo Zhang (1)Ziming Xu (1)Jinwen Guo (1)

Papers (3)

May 26, 2026

Hongyu Ding +133w ago·also Tsinghua AI, Shanghai Qi Zhi Institute, Sydney

Uni-LaViRA: Language-Vision-Robot Actions Translation for Unified Embodied Navigation

Forget scaling laws – this zero-shot navigation agent beats million-sample trained models by structurally unifying language, vision, and robot actions within the reasoning capabilities of pre-trained MLLMs.

Hongyu Ding, Sizhuo Zhang, Ziming Xu +11

Computer Vision Multimodal Models Robotics & Embodied AI

May 5, 2026

Yixuan Mei +9May 5, 2026

Coral: Cost-Efficient Multi-LLM Serving over Heterogeneous Cloud GPUs

Save up to 2.79x on LLM serving costs by intelligently distributing models across a diverse fleet of cloud GPUs.

Yixuan Mei, Zikun Li, Zixuan Chen +7

Distributed Systems & Hardware Inference & Quantization

Apr 15, 2026

Apr 15, 2026·also Ant Group, CUHK, DUT, Joint Shantou International Eye Center +1

AVID: A Benchmark for Omni-Modal Audio-Visual Inconsistency Understanding via Agent-Driven Construction

Omni-modal LLMs can ace captioning and QA, but AVID reveals they're surprisingly bad at spotting audio-visual inconsistencies in videos, a crucial skill for trustworthy AI.

Zixuan Chen, Depeng Wang, Hao Lin +6

Eval Frameworks & Benchmarks Multimodal Models Speech & Audio

Search

Zixuan Chen

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)