Xinhu Zheng

Papers on Lattice

Total citations

Topics

h-index

Research focus

Multimodal Models (2)Robotics & Embodied AI (2)Computer Vision (2)World Models & Planning (1)Data Curation & Synthetic Data (1)

Frequent co-authors

Zhaonian Kuang (2)Rui Ding (2)Meng Yang (2)Gang Hua (2)

Papers (3)

Mar 18, 2026

P$^{3}$Nav: End-to-End Perception, Prediction and Planning for Vision-and-Language Navigation

VLN agents can navigate more effectively by predicting their future states and proactively planning based on forecasted semantic map cues, rather than relying solely on historical context.

Tianfu Li, Tian Li, Wenbo Chen +4

Multimodal Models Robotics & Embodied AI World Models & Planning

Mar 5, 2026

Amazon ScienceMar 5, 2026·also D paradigms represented by BEVDepth

CoIn3D: Revisiting Configuration-Invariant Multi-Camera 3D Object Detection

MC3D models can now generalize to unseen camera configurations thanks to a new framework that explicitly accounts for spatial prior discrepancies.

Zhaonian Kuang, Rui Ding, Haotian Wang +3

Computer Vision Multimodal Models Robotics & Embodied AI

Feb 24, 2026

Amazon ScienceFeb 24, 2026·also D paradigms represented by BEVDepth

Object-Scene-Camera Decomposition and Recomposition for Data-Efficient Monocular 3D Object Detection

Stop training your M3OD models on the same old entangled data: this method decomposes and recomposes objects, scenes, and camera poses to generate diverse training examples on the fly, boosting performance without needing more real-world data.

Zhaonian Kuang, Rui Ding, Meng Yang +2

Computer Vision Data Curation & Synthetic Data Training Efficiency & Optimization

Search

Xinhu Zheng

Research focus

Frequent co-authors

Papers (3)