Xiang Wang

Institute for AI Industry Research (AIR), Tsinghua University

Tsinghua AI

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Reasoning & Chain-of-Thought (4)Multimodal Models (3)Recommendation & Information Retrieval (3)Tool Use & Agents (3)

Frequent co-authors

Huaxing Liu (2)Junfeng Fang (2)Yuxin Chen (2)Yi Zhang (2)

Papers (10)

Jun 7, 2026

1w ago·also Tsinghua AI, Shanghai Innovation, ShanghaiTech, Tencent AI

Two Bridges, One Pathway: From VLMs to Generalizable VLAs with Embodied Trajectory-Coupled Data

Gradual bridging with embodied trajectory-coupled data transforms VLMs into robust robot control policies, overcoming significant transfer challenges.

Linqi Yin, Shiduo Zhang, Shenling Qiu +11

Multimodal Models Robotics & Embodied AI

Tsinghua AI1w ago·also NJU, TJU, University of Electronic Science and Technology

ActProbe: Action-Space Probe for Early Failure Detection of Generative Robot Policies

ActProbe predicts robot policy failures before they become visually apparent, enhancing both detection accuracy and operational efficiency in real-world tasks.

Bingjia Huang, Xiang Wang, Liang Mi +5

Robotics & Embodied AI

Jun 4, 2026

OneRec Team +752w ago·also CMU ML, Tsinghua AI, Columbia, HKU +4

OneReason Technical Report

Surprisingly, the "think before answer" paradigm fails to enhance generative recommendation models, prompting a novel approach that redefines how reasoning is integrated into these systems.

OneRec Team, Boyang Ding, Chenglong Chu +73

Reasoning & Chain-of-Thought Recommendation & Information Retrieval

Jun 1, 2026

DAMO2w ago·also Tsinghua AI, CAS, PKU

Do Multimodal Agents Really Benefit from Tool Use? A Systematic Study of Capability Gains

Tool-augmented multimodal agents may appear to excel, but they often rely on learned tool-calling patterns rather than enhanced problem-solving abilities.

Garvin Guo, Donglei Yu, Xiang Wang +4

Multimodal Models Reasoning & Chain-of-Thought Tool Use & Agents

2w ago·also NUS, Tsinghua AI, SMU, USTC

Dynamic Spectral Denoising with Global-Context Attention for Multi-Behavior Recommendation

Noise in multi-behavior recommendation can be effectively mitigated through a novel spectral filtering approach that enhances representation purity and reliability.

Fangqi Zhu, Junfeng Fang, Zhijie Zhang +3

Recommendation & Information Retrieval

May 26, 2026

3w ago·also NUS, Tsinghua AI, BUPT, Meituan +2

VitaBench 2.0: Evaluating Personalized and Proactive Agents in Long-Term User Interactions

Current LLM agents still struggle to infer and leverage user preferences from fragmented, real-world interactions, revealing a substantial gap between their capabilities and the demands of personalized decision-making.

Yuxin Chen, Yi Zhang, Zhengzhou Cai +8

Eval Frameworks & Benchmarks Recommendation & Information Retrieval Tool Use & Agents

Tsinghua AI3w ago·also HUST

What Makes Chain-of-Thought Work at Probe Time? Local Co-occurrence Rather Than Global Derivation

Chain-of-thought prompting works not because of deep reasoning, but because adjacent tokens nudge the model towards the right answer.

Xiang Wang

Interpretability & Mechanistic Interp Reasoning & Chain-of-Thought

3w ago·also NUS, Tsinghua AI, Meituan, TJU +1

Learning to Act under Noise: Enhancing Agent Robustness via Noisy Environments

LLM agents trained with simulated user and tool noise not only become more robust in messy real-world environments, but also surprisingly improve on clean, idealized benchmarks.

Yuxin Chen, Xiaodong Cai, Junfeng Fang +6

Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness Tool Use & Agents

May 21, 2026

May 21, 2026·also Tsinghua AI, George Mason University, NTU

Faithful-MR1: Faithful Multimodal Reasoning via Anchoring and Reinforcing Visual Attention

MLLMs can learn to reason more faithfully by explicitly anchoring visual attention to relevant image regions and reinforcing the use of that evidence during reasoning via counterfactual interventions.

Changyuan Tian, Zhicong Lu, Huaxing Liu +5

Computer Vision Multimodal Models Reasoning & Chain-of-Thought

Apr 9, 2026

Jie Sun +10Apr 9, 2026·also Tsinghua AI

SepSeq: A Training-Free Framework for Long Numerical Sequence Processing in LLMs

LLMs choke on long numerical sequences, but a simple separator token trick can boost accuracy by 35% and cut token costs by 16%—without any training.

Jie Sun, Yu Liu, Lu Han +8

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing

Search

Xiang Wang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (10)