Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Hu Xu | Lattice

Hu Xu

Papers on Lattice

2

Total citations

0

Topics

4

h-index

0

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (2)Eval Frameworks & Benchmarks (1)Computer Vision (1)Robotics & Embodied AI (1)

Frequent co-authors

Yida Yin (1)H. Krishnakumar (1)Chung Peng Lee (1)Boya Zeng (1)

Papers (2)

Jun 4, 2026

1w ago·also Waterloo

WorldBench: A Challenging and Visually Diverse Multimodal Reasoning Benchmark

Even the top-performing MLLMs struggle with visual reasoning, achieving only 64% accuracy on a benchmark designed to reflect real-world diversity.

Yida Yin, H. Krishnakumar, Chung Peng Lee +9

Eval Frameworks & Benchmarks Multimodal Models

May 21, 2026

Meta AI3w ago·also BAIR, NYU

Cambrian-P: Pose-Grounded Video Understanding

Camera pose, largely ignored in video LLMs, unlocks significant gains in spatial reasoning and even improves general video QA when used as a lightweight supervisory signal.

Jihan Yang, Zifan Zhao, Xichen Pan +5

Computer Vision Multimodal Models Robotics & Embodied AI

Wenhao Chai (1)

Shengbang Tong (1)