Haoang Li

X. Meng, P. Hou, and H. Li are with the Thrust of Robotics and Autonomous Systems, Hong Kong University of Science and Technology (Guangzhou), Guangzhou, China. Z. Zhao and J. Civera are with the University of Zaragoza, Zaragoza, Spain. D. Cremers is with the School of Computation, Information and Technology, Technical University of Munich, Munich, Germany. H. Wang is with the School of Automation and IntelligentSensing, Shanghai Jiao Tong University, Shanghai 200240, China (e-mail: wanghesheng @sjtu.edu.cn)

Papers on Lattice

Total citations

Topics

h-index

Research focus

Robotics & Embodied AI (7)Multimodal Models (5)World Models & Planning (3)Computer Vision (3)

Frequent co-authors

Haoang Li (3)Wenxuan Song (2)Zhenjun Zhao (2)Tianfu Li (1)

Papers (7)

Mar 18, 2026

P$^{3}$Nav: End-to-End Perception, Prediction and Planning for Vision-and-Language Navigation

VLN agents can navigate more effectively by predicting their future states and proactively planning based on forecasted semantic map cues, rather than relying solely on historical context.

Tianfu Li, Tian Li, Wenbo Chen +4

Multimodal Models Robotics & Embodied AI World Models & Planning

Mar 18, 2026

AERR-Nav: Adaptive Exploration-Recovery-Reminiscing Strategy for Zero-Shot Object Navigation

Robots can now nimbly navigate complex, multi-floor environments without prior training, thanks to a new strategy that dynamically switches between exploration, recovery, and memory recall.

Jingzhi Huang, Jing Huang, Junkai Huang +4

Multimodal Models Robotics & Embodied AI Tool Use & Agents

Mar 5, 2026

Mar 5, 2026·also Dexmal

Omni-Manip: Beyond-FOV Large-Workspace Humanoid Manipulation with Omnidirectional 3D Perception

Humanoid robots can now nimbly manipulate objects across much larger workspaces thanks to a LiDAR-powered perception system that eliminates the need for constant repositioning.

Pei Qu, Yufei Jia, Ziyun Liu +2

Computer Vision Multimodal Models Robotics & Embodied AI

Feb 26, 2026

Feb 26, 2026·also Galbot, TU Munich, Xidian

Rethinking the Practicality of Vision-language-action Model: A Comprehensive Benchmark and An Improved Baseline

A practical VLA model, LLaVA-VLA, achieves strong generalization and versatility on a new benchmark, CEBench, while running on consumer-grade GPUs, eliminating the need for costly pre-training.

Wenxuan Song, Jiayi Chen, Xiaoquan Sun +11

Eval Frameworks & Benchmarks Multimodal Models Robotics & Embodied AI

Feb 25, 2026

Feb 25, 2026·also University of Zaragoza

Dream-SLAM: Dreaming the Unseen for Active SLAM in Dynamic Environments

By "dreaming" plausible scene completions, Dream-SLAM enables robots to navigate dynamic environments more effectively, achieving better localization, mapping, and exploration than existing methods.

Xiangqi Meng, Pengxu Hou, Zhenjun Zhao +1

Computer Vision Robotics & Embodied AI World Models & Planning

Feb 19, 2026

Tsinghua AIFeb 19, 2026·also NTU, PKU, The Fin AI, TU Munich +2

FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment

By aligning latent representations with multiple visual foundation models, FRAPPE offers a more scalable and data-efficient way to imbue generalist robotic policies with robust world-awareness.

Han Zhao, Jingbo Wang, Wenxuan Song +9

Multimodal Models Robotics & Embodied AI World Models & Planning

Feb 16, 2026

University of ZaragozaFeb 16, 2026·also Kwkfk, TU Munich, Westlake

Advances in Global Solvers for 3D Vision

Certifiably optimal solutions to 3D vision problems are now within reach, but choosing the right global solver (BnB, CR, or GNC) requires navigating a complex trade-off between optimality, robustness, and scalability.

Zhenjun Zhao, Bangyan Liao, Yingping Zeng +4

Computer Vision Robotics & Embodied AI

Search

Haoang Li

Research focus

Frequent co-authors

Papers (7)