Hang Li

Xiaomi Inc

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (4)Tool Use & Agents (3)Recommendation & Information Retrieval (2)Architecture Design (Transformers, SSMs, MoE) (1)

Frequent co-authors

Zhenbo Luo (2)Yuhan Liu (1)Pei Fu (1)Yukun Qi (1)

Papers (5)

Jun 18, 2026

1d ago·also Corresponding author, Xiaomi Inc

ELVA: Exploring Ranking-Driven Universal Multimodal Retrieval

Treating negative samples based on their similarity to positives leads to a 13.1% boost in retrieval performance, revealing the critical role of grain-level information.

Yuhan Liu, Pei Fu, Hang Li +8

Multimodal Models Recommendation & Information Retrieval

Jun 9, 2026

Kwai Keye Team +501w ago·also Cambridge, GNucleus AI, HKUST, HUST +5

Kwai Keye-VL-2.0 Technical Report

Achieving lossless processing of 256K contexts, Keye-VL-2.0 transforms how we approach long-video understanding and agentic intelligence.

Kwai Keye Team, Bin Wen, Changyi Liu +48

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Tool Use & Agents

Jun 1, 2026

Xiaomi Inc2w ago·also UQ

Whole-Pool Setwise Reranking with Long-Context Language Models

Long-context LLMs can drastically reduce the number of model calls needed for passage re-ranking, achieving efficiency without sacrificing effectiveness.

Hang Li, Chuting Yu, Teerapong Leelanupab +2

Natural Language Processing Recommendation & Information Retrieval

May 29, 2026

3w ago·also ByteDance, Fudan, Xiaomi Inc

Task-Focused Memorization for Multimodal Agents

Forget everything you thought you knew about multimodal agent memory: TaskMem learns what to remember on the fly, boosting VQA accuracy by up to 7% without even looking at the raw video.

Tao Zou, Yichen He, Tian Qiu +2

Multimodal Models Robotics & Embodied AI Tool Use & Agents

Apr 15, 2026

Yuanlei Zheng +9Apr 15, 2026·also BIT, Xiaomi Inc

Doc-V*:Coarse-to-Fine Interactive Visual Reasoning for Multi-Page Document VQA

Doc-V* demonstrates that an agentic approach to multi-page document VQA, using active navigation and structured memory, can significantly outperform retrieval-augmented generation, especially in out-of-domain scenarios.

Yuanlei Zheng, Pei Fu, Hang Li +7

Multimodal Models Reasoning & Chain-of-Thought Tool Use & Agents

Search

Hang Li

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (5)