Yang Li

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Tool Use & Agents (5)Robotics & Embodied AI (4)Reasoning & Chain-of-Thought (3)RLHF & Preference Learning (3)

Frequent co-authors

Zhichen Dong (1)Yuhan Sun (1)Zinian Peng (1)Taiheng Ye (1)

Papers (13)

Jun 9, 2026

DAMO1d ago·also Shanghai AI Lab, SJTU

How Does Reasoning Flow? Tracing Attention-Induced Information Flow for Targeted RL in LLMs

FlowTracer reveals that optimizing token-level rewards based on attention-induced information flow can dramatically enhance reasoning performance in LLMs.

Zhichen Dong, Yang Li, Yuhan Sun +6

Reasoning & Chain-of-Thought RLHF & Preference Learning

Jun 4, 2026

Yang Li +36d ago·also Barnard College, Columbia, NYU

AURA: Intent-Directed Probing for Implicit-Need Surfacing in Situated LLM Agents

AURA reveals that understanding implicit user intent can dramatically reduce the number of queries needed while enhancing the relevance of responses.

Yang Li, Jiaxiang Liu, Jiang Cai +1

Natural Language Processing Tool Use & Agents

May 26, 2026

2w ago·also CAS, CUHK, Shenzhen MSU-BIT University

Efficient Agentic Reinforcement Learning with On-Policy Intrinsic Knowledge Boundary Enhancement

LLM agents can learn to use tools more efficiently and accurately by explicitly learning when *not* to use them, leading to a 25% increase in tool productivity.

Dingwei Chen, Zefang Zong, Zhipeng Ma +4

RLHF & Preference Learning Tool Use & Agents Training Efficiency & Optimization

May 25, 2026

2w ago

When Search Becomes Memory: Turning Robot Design Trials into Transferable Skills

LLMs can convert expensive robot design evaluations into reusable, auditable design principles by distilling search traces into explicit natural-language skill libraries.

Yunfei Wang, Xiaohao Xu, Yang Li +1

Code Generation & Program Synthesis Robotics & Embodied AI Tool Use & Agents

May 22, 2026

Open-Sora Plan Team2w ago·also Tsinghua AI, AI Lab, Annenberg School of Communication and Journalism, Department of Foundation Model +5

StepAudio 2.5 Technical Report

Forget specialized architectures: StepAudio 2.5 proves a single audio-language foundation, shaped by RLHF, can dominate ASR, TTS, and real-time dialogue simultaneously.

Bin Lin, Bo Zhao, Boyong Wu +93

Architecture Design (Transformers, SSMs, MoE)Open-Source Models & Weights Speech & Audio

2w ago·also Stanford HAI, Tsinghua AI, Beijing Film Academy, CAS +5

EvalVerse: Pipeline-Aware and Expert-Calibrated Benchmarking for Professional Cinematic Video Generation

Current video generation benchmarks miss the forest for the trees: EvalVerse actually measures cinematic quality, not just prompt adherence.

Songlin Yang, Haobin Zhong, Ruilin Zhang +20

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

May 6, 2026

May 6, 2026·also Tsinghua AI, Ant Group, NJU, ZJU

PhysForge: Generating Physics-Grounded 3D Assets for Interactive Virtual World

Interactive 3D asset generation can now be driven by functional logic and hierarchical physics, thanks to a new framework that synthesizes simulation-ready assets.

Yunhan Yang, Chunshi Wang, Junliang Ye +6

Data Curation & Synthetic Data Robotics & Embodied AI World Models & Planning

Yihan Lin +5May 6, 2026

From Pixels to Tokens: A Systematic Study of Latent Action Supervision for Vision-Language-Action Models

Image-based latent actions are your secret weapon for long-horizon reasoning in VLAs, while action-based latent actions unlock complex motor coordination.

Yihan Lin, Yang Li, Haitao Shen +3

Computer Vision Multimodal Models Robotics & Embodied AI

May 5, 2026

Logical Consistency as a Bridge: Improving LLM Hallucination Detection via Label Constraint Modeling between Responses and Self-Judgments

LLMs' own self-judgments, when logically linked to their response features, can significantly improve hallucination detection.

Hao Mi, Qiang Sheng, Shaofei Wang +7

Eval Frameworks & Benchmarks Natural Language Processing Reasoning & Chain-of-Thought

Apr 22, 2026

Humanoid Robot (Shanghai) Co.Apr 22, 2026·also HIT, Tongji

VTouch++: A Multimodal Dataset with Vision-Based Tactile Enhancement for Bimanual Manipulation

Vision-based tactile signals in the VTOUCH dataset significantly enhance bimanual manipulation capabilities, paving the way for more effective robotic interactions.

Qianxi Hua, Xinyue Li, Zheng Yan +3

Computer Vision Multimodal Models Robotics & Embodied AI

Apr 13, 2026

Materials Artificial Intelligence CenterApr 13, 2026·also CAS, Chongqing, Shenzhen Institute of Advanced

A collaborative agent with two lightweight synergistic models for autonomous crystal materials research

Domain-specific reasoning and tool coordination in materials science no longer require massive LLMs: a lightweight, dual-model agent outperforms larger general-purpose models while slashing hardware costs.

Tongyu Shi, Yutang Li, Zhanyuan Li +6

Architecture Design (Transformers, SSMs, MoE)Scientific Discovery & Drug Design Tool Use & Agents

Apr 13, 2026·also Stanford HAI, K). On DeepSearch

Escaping the Context Bottleneck: Active Context Curation for LLM Agents via Reinforcement Learning

A lightweight, RL-trained context curator can match GPT-4o's context management abilities, slashing token consumption by 8x and opening the door to efficient long-horizon LLM agents.

Xiaozhe Li, Tianyi Lyu, Yizhao Yang +6

Reasoning & Chain-of-Thought RLHF & Preference Learning Tool Use & Agents

Apr 6, 2026

Beyond the Final Actor: Modeling the Dual Roles of Creator and Editor for Fine-Grained LLM-Generated Text Detection

LLM-generated text detection gets a major upgrade: RACE spots the difference between AI as author versus AI as editor, unlocking policy-aligned regulation.

Yang Li, Yehan Yang

Eval Frameworks & Benchmarks Natural Language Processing