Liang Wang

University of Chinese Academy of Sciences

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (5)Robotics & Embodied AI (5)RLHF & Preference Learning (3)Computer Vision (3)

Frequent co-authors

Peiyan Li (4)Jiabing Yang (3)Yixiang Chen (3)Qisen Ma (3)

Papers (14)

Jul 20, 2026

6d ago·also McGovern Medical School

Style over Substance: A Shortcut Audit of Emotion-Description Preference Evaluation

A simple logistic regression can match the performance of advanced models in evaluating emotion descriptions, raising questions about the validity of current multimodal benchmarks.

Jiabing Yang, Yixiang Chen, Yixiang Chen +7

Eval Frameworks & Benchmarks Multimodal Models RLHF & Preference Learning

Jul 14, 2026

1w ago·also FiveAges

FlowWAM: Optical Flow as a Unified Action Representation for World Action Models

FlowWAM achieves a remarkable 92.94% success rate in manipulation tasks by harnessing optical flow as a video-native action representation.

Yixiang Chen, Peiyan Li, Qisen Ma +12

Computer Vision World Models & Planning

Jul 13, 2026

Hanting Suo +31w ago·also CAS

Stop to Decide: Latency-Aware Proprioceptive Navigation Primitives for Mapping-Free Quadruped Inspection

Climb-settle cadence can eliminate overshoot errors in quadruped stair navigation, outperforming traditional methods even at lower loop rates.

Hanting Suo, Haonan Yan, Liang Wang +1

Robotics & Embodied AI

Jun 30, 2026

3w ago·also Tsinghua AI, CAS, OPPO, SJTU

PrISM-IQA: Image Quality Assessment Made Practical for Smartphone Photography

Transforming image quality assessment from a single score to a nuanced diagnosis of multiple quality issues could revolutionize smartphone ISP tuning.

Shuyan Zhai, Jiaqi He, Weixia Zhang +4

Computer Vision Multimodal Models

3w ago·also CAS

DrivingDepth: Sparse-Prompted Pixel-wise Scale Correction for Driving Depth Estimation

DrivingDepth achieves state-of-the-art depth estimation by leveraging sparse LiDAR to fine-tune pixel-wise scale without sacrificing geometric coherence.

Chi Huang, Wenhao Zhang, Hao Li +3

Computer Vision Robotics & Embodied AI

Jun 29, 2026

3w ago·also BitInf Ltd, Nanjing Tech University, WHU

Arko-T: A Foundation Model for Text-to-Structured 3D Generation

Arko-T achieves superior performance in text-to-structured 3D generation while being ten times more cost-effective than leading models.

Liang Wang, Zhaoyang Xi, Zekai Xiang +4

Code Generation & Program Synthesis Multimodal Models

Jun 25, 2026

Jun 25, 2026·also FiveAges

E-TTS: A New Embodied Test-Time Scaling Framework for Robotic Manipulation

E-TTS achieves up to a 33.14% performance boost in robotic manipulation by leveraging historical context and iterative refinement, redefining how we approach test-time scaling.

Peiyan Li, Tingyu Yuan, Xiangnan Wu +3

Reasoning & Chain-of-Thought Robotics & Embodied AI

Jun 25, 2026

Improving Vision-Language-Action Model Fine-Tuning with Structured Stage and Keyframe Supervision

Structured supervision can boost VLA model performance by over 50% in complex robotic tasks, transforming how we approach fine-tuning in manipulation.

Yixiang Chen, Kai Wang, Jiabing Yang +3

Multimodal Models Robotics & Embodied AI

Jun 24, 2026

Jun 24, 2026·also Microsoft Research, PKU

BitNet Text Embeddings

Achieving comparable performance to full-precision models, BITEMBED slashes storage costs and enhances embedding efficiency with extreme low-bit quantization.

Liang Wang, Ting Song, Shaohan Huang +1

Inference & Quantization Recommendation & Information Retrieval

Jun 9, 2026

Jun 9, 2026·also Bonn, of Artificial Intelligence (TeleAI), Oxford, Shenzhen +1

GUIDE: Goal-Initialized Directional Understanding for End-to-End Visual Navigation

Robots can now navigate complex environments without continuous goal updates, relying solely on their internal spatial memory.

Liang Wang, Jin Jin, KanZhong Yao +4

Multimodal Models Robotics & Embodied AI

Jun 8, 2026

Han Huang +3Jun 8, 2026·also CAS

CRANE: Knowledge Editing for Reasoning MLLMs

CRANE achieves a remarkable 96.9% Grounded Success in knowledge editing for reasoning MLLMs, overcoming traditional failure modes that plague existing methods.

Han Huang, Mengqi Zhang, Qiang Liu +1

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought

Jun 1, 2026

Jun 1, 2026·also Fudan, Shanghai Innovation

Beyond Isolated Behaviors: Hierarchical User Modeling for LLM Personalization

Personalizing LLMs through a sociologically grounded framework reveals the hierarchical nature of user behavior, leading to significant performance gains across tasks.

Liang Wang, Xiaoyou Liu, Tiannan Wang +1

Natural Language Processing RLHF & Preference Learning

Jun 1, 2026·also ZJU

Learning When Not to Act: Mitigating Tool Abuse in Agentic Reinforcement Learning

EAPO enables agents to learn when to forgo tool use, achieving a remarkable 10.45% performance boost while slashing tool calls by over 18%.

Liuji Chen, Dianxing Tang, Xing Shi +3

RLHF & Preference Learning Scalable Oversight & Alignment Theory Tool Use & Agents

May 28, 2026

May 28, 2026·also Ant Group, CAS, Fudan, SUSTech

GAPD: Gold-Action Policy Distillation for Agentic Reinforcement Learning in Knowledge Base Question Answering

Achieve state-of-the-art results in agentic knowledge base question answering by distilling gold-action policies into on-policy student rollouts, bridging the gap between sparse rewards and weakly supervised intermediate actions.

Xin Sun, Jian Xie, Zhongqi Chen +5

Natural Language Processing Reasoning & Chain-of-Thought Tool Use & Agents

Search

Liang Wang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (14)