Yang Liu

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (8)Computer Vision (7)Architecture Design (Transformers, SSMs, MoE) (6)Natural Language Processing (5)

Frequent co-authors

Weisong Sun (3)Minghang Zheng (3)Yuchen Chen (2)Haocheng Huang (2)

Papers (30)

Jun 11, 2026

Tsinghua AI4d ago·also China University of Mining Technology, Shanghai AI Lab, SJTU, ZJU

RoboProcessBench: Benchmarking Process-Aware Understanding in Vision-Language Robotic Manipulation

Current vision-language models struggle with process understanding in robotic manipulation, but targeted post-training can yield significant improvements.

Dayu Xia, Yue Shi, Yao Mu +8

Eval Frameworks & Benchmarks Multimodal Models Robotics & Embodied AI

Jun 9, 2026

Tsinghua AI6d ago·also NJU

Securing Code Understanding: Detecting Natural Backdoor Vulnerability in Code Language Models

Natural backdoor vulnerabilities are not just a theoretical concern; they are prevalent in CodeLMs and can significantly compromise code security.

Yuchen Chen, Weisong Sun, Haocheng Huang +11

Code Generation & Program Synthesis Red-Teaming & Adversarial Robustness

Jun 8, 2026

Tsinghua AI1w ago

Temporal-Aware Reasoning Optimization for Video Temporal Grounding

Superficial reasoning in video temporal grounding can be transformed into high-quality, time-aware insights with the right optimization framework.

Minghang Zheng, Zihao Yin, Yuxin Peng +1

Multimodal Models Reasoning & Chain-of-Thought RLHF & Preference Learning

Tsinghua AI1w ago·also Department of Systems Science, Faculty of Arts and Sciences, International Academic Center of Complex, MiniMax

ABot-Earth 0.5: Generative 3D Earth Model

Generating realistic 3D environments from satellite imagery in under 10 minutes could revolutionize how we visualize and interact with our planet.

Ming Qian, Tianjian Ouyang, Mingchao Sun +25

Computer Vision World Models & Planning

1w ago·also Tsinghua AI, CAS, DP Technology, Ningbo Key Laboratory of Advanced Manufacturing Simulation +5

Data-driven discovery of governing differential equations across physical systems

The new REO framework reveals that the true challenge in differential equation discovery lies not just in recovering equations, but in leveraging them to reshape scientific understanding.

Siyu Lou, Hao Xu, Wenguan Wang +5

Scientific Discovery & Drug Design

Tsinghua AI1w ago·also PKU, Tongji

Bridging the Agent-World Gap: Text World Models for LLM-based Agents

Text world models can transform LLM-based agents from reactive responders into proactive planners, enhancing their performance in complex interactive tasks.

Yixia Li, Hongru Wang, Peng Lai +14

Tool Use & Agents World Models & Planning

Jun 4, 2026

Tsinghua AI1w ago·also Huawei, PKU, Xiaohongshu

RedKnot: Efficient Long-Context LLM Serving with Head-Aware KV Reuse and SegPagedAttention

Transforming the KV cache from a monolithic structure into a dynamic, head-aware system could revolutionize LLM serving efficiency and scalability.

Yang Liu, ZhaoKai Luo, HuaYi Jin +5

Distributed Systems & Hardware Inference & Quantization

Jun 1, 2026

Tsinghua AI2w ago·also BAAI, CAS

Training-Free Composed Video Retrieval via Visual Representation-Guided Video-LLM Reasoning

Achieving nearly 50% Recall@1 in video retrieval without any training marks a significant leap in efficiency and effectiveness for complex user queries.

Yang Liu, Qianqian Xu, Peisong Wen +2

Multimodal Models Recommendation & Information Retrieval

2w ago·also Tsinghua AI, NWU, Xidian

Benign Inputs, Harmful Outputs: Cross-Modal Jailbreaking via Distributed Semantic Recomposition

MLLMs can be manipulated to produce harmful outputs from benign inputs, exposing a critical vulnerability in their safety mechanisms.

Yani Wang, Yilong Yang, Yang Liu +3

Constitutional AI & AI Ethics Multimodal Models Red-Teaming & Adversarial Robustness

May 31, 2026

Tsinghua AI2w ago·also NTU, WHU

Bridging Requirements and Architecture: Multi-Agent Orchestration with External Knowledge and Hierarchical Memory

MAAD not only automates architecture design but also enhances the quality of outputs through a collaborative agent framework and advanced LLM integration.

Ruiyin Li, Yiran Zhang, Xiyu Zhou +5

Architecture Design (Transformers, SSMs, MoE)

Tsinghua AI2w ago

Distilling Neuro-Symbolic Programs into 3D Multi-modal LLMs

APEIRIA bridges the gap between interpretable neuro-symbolic reasoning and the flexibility of multi-modal language models, achieving superior performance in 3D spatial reasoning.

Wentao Mo, Yang Liu

Multimodal Models Reasoning & Chain-of-Thought

May 26, 2026

Tsinghua AI2w ago·also Beihang, BUPT, CAS, Fullive Innovation (Beijing) AI Technology Co. +3

FinHarness: An Inline Lifecycle Safety Harness for Finance LLM Agents

Finance LLM agents can now block unauthorized actions mid-trajectory without sacrificing performance, thanks to a novel inline safety harness that adaptively routes verification between lightweight and advanced LLM judges.

Haoxuan Jia, Yang Liu, Yancheng Chen +4

Constitutional AI & AI Ethics Red-Teaming & Adversarial Robustness Tool Use & Agents

2w ago·also Tsinghua AI, Beihang, CAS, NJU +3

ExTax: Explainable Disinformation Detection via Persuasion, Emotion, and Narrative Role Taxonomies

Disinformation detection gets a major upgrade with ExTax, a framework that doesn't just flag fake news, but explains *how* it manipulates you through persuasion, emotion, and narrative.

Shang Luo, Zhenchen Sun, Yang Liu +4

Constitutional AI & AI Ethics Natural Language Processing Red-Teaming & Adversarial Robustness

May 25, 2026

Tsinghua AI3w ago·also BUPT, Fudan, PKU

Towards 3D heart mesh generation using contactless radar imaging and physics-informed neural network

Reconstructing high-fidelity 3D heart models from noisy radar data is now possible, thanks to a novel mesh deformation approach that leverages physics-informed learning.

Jinye Li, Chenxi Fu, Minghang Zheng +3

Computer Vision Scientific Discovery & Drug Design

Tsinghua AI3w ago

TriDP-PTM: a three-stage distortion-perception tradeoff guides the pre-training model for radar cardiac sensing

Counterintuitively, letting radar cardiac sensors learn to mimic ECGs first yields far better performance on downstream tasks like blood pressure regression and waveform segmentation than directly training on those tasks.

Jinye Li, Yang Liu, Qingchao Chen

Scientific Discovery & Drug Design Speech & Audio

3w ago·also Tsinghua AI, AI for Science Institute, School of Mechanics and Engineering, TJU

NPSolver: Neural Poisson Solver with Iterative Physics Supervision

Solving Poisson equations just got faster and more stable: NPSolver trains neural operators without solution labels by iteratively refining predictions with preconditioned conjugate gradient steps.

Bocheng Zeng, Runze Mao, Mengtao Yan +4

Architecture Design (Transformers, SSMs, MoE)Scientific Discovery & Drug Design Training Efficiency & Optimization

Apr 30, 2026

Tsinghua AIApr 30, 2026·also PolyU, SEU

FedHarmony: Harmonizing Heterogeneous Label Correlations in Federated Multi-Label Learning

Federated learning can overcome data silos, but struggles when clients have different label relationships; FedHarmony shows how to harmonize these differences, leading to better performance.

Zhi Kou, Zhiqiang Kou, Jun Wu +11

Data Curation & Synthetic Data Distributed Systems & Hardware Natural Language Processing

Tsinghua AIApr 30, 2026·also CAS, NJU, NTU, Soochow

PuzzleMark: Implicit Jigsaw Learning for Robust Code Dataset Watermarking in Neural Code Completion Models

Code dataset watermarking gets a stealthy upgrade: PuzzleMark hides watermarks in variable names based on code complexity, making them nearly undetectable while guaranteeing perfect verification.

Haocheng Huang, Yuchen Chen, Weisong Sun +6

Code Generation & Program Synthesis Data Curation & Synthetic Data

Tsinghua AIApr 30, 2026·also BUPT, Corresponding author

Decoding Scientific Experimental Images: The SPUR Benchmark for Perception, Understanding, and Reasoning

Today's best vision-language models are surprisingly bad at reading scientific figures, failing to match expert-level reasoning on a new benchmark of experimental images.

Junpeng Ding, Zichen Tang, Zichen Tang +21

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Tsinghua AIApr 30, 2026·also Microsoft Research

CasLayout: Cascaded 3D Layout Diffusion for Indoor Scene Synthesis with Implicit Relation Modeling

Forget fully connected relation graphs: CasLayout's sparse relation modeling unlocks enhanced controllability and realism in 3D indoor scene synthesis.

Yingrui Wu, Youkang Kong, Mingyang Zhao +5

Architecture Design (Transformers, SSMs, MoE)Computer Vision Data Curation & Synthetic Data

Tsinghua AIApr 30, 2026·also Microsoft Research

SQuadGen: Generating Simple Quad Layouts via Chart Distance Fields

Simple, artist-friendly quad meshes can now be automatically generated on 3D shapes using a diffusion model trained on a continuous surface representation, sidestepping the complexity of discrete mesh optimization.

Youkang Kong, Yang Liu, Yang Liu +4

Architecture Design (Transformers, SSMs, MoE)Computer Vision

Apr 29, 2026

Tsinghua AIApr 29, 2026·also Fudan, RUC, TRI, Xidian

CL-bench Life: Can Language Models Learn from Real-Life Context?

Today's best language models can barely make sense of your messy group chats and fragmented digital life, achieving only 19% accuracy on a new benchmark of real-world reasoning.

Shihan Dou, Yujiong Shen, Chenhao Huang +33

Eval Frameworks & Benchmarks Natural Language Processing

Apr 28, 2026

Tsinghua AIApr 28, 2026·also Huawei, PKU

OmniVTG: A Large-Scale Dataset and Training Paradigm for Open-World Video Temporal Grounding

MLLMs are better at understanding videos than directly grounding text queries within them, and a self-correction training loop can close the gap.

Minghang Zheng, Zihao Yin, Yi Yang +3

Data Curation & Synthetic Data Multimodal Models Reasoning & Chain-of-Thought

Apr 20, 2026

Apr 20, 2026·also DAMO, Tsinghua AI, BUPT

UDM-GRPO: Stable and Efficient Group Relative Policy Optimization for Uniform Discrete Diffusion Models

RL fine-tuning of discrete diffusion models can be made dramatically more stable and effective by treating the final denoised sample as the action and reconstructing trajectories using the forward diffusion process.

Jiaqi Wang, Haoge Deng, Ting Pan +10

Architecture Design (Transformers, SSMs, MoE)Computer Vision RLHF & Preference Learning+1

Tsinghua AIApr 20, 2026·also Kyoto

Understanding the Prompt Sensitivity

LLMs disperse similar prompts instead of clustering them, leading to significant prompt sensitivity that challenges stability and reliability.

Yang Liu, Chenhui Chu

Interpretability & Mechanistic Interp Natural Language Processing

Apr 20, 2026·also Tsinghua AI, NTU, SMU, University of Massachusetts

Weaponizing the Commons: A Taxonomy and Detection Framework of Abuse on GitHub

GitHub abuse is more widespread and varied than previously thought, demanding a unified detection approach to safeguard software supply chains.

Yuli Cheng, Xiaoyu Zhang, Jiongchi Yu +3

Code Generation & Program Synthesis Data Curation & Synthetic Data Open-Source Models & Weights

Tsinghua AIApr 20, 2026

M100: An Orchestrated Dataflow Architecture Powering General AI Computing

Ditching caches for compiler-managed data streams, Li Auto's M100 architecture achieves higher utilization than GPUs on autonomous driving tasks, hinting at a new path for efficient AI inference.

Yan Xie, Changkui Mao, Chan-gui Wu +40

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Training Efficiency & Optimization

Apr 17, 2026

Tsinghua AIApr 17, 2026·also BIGAI, University of Science and Technology

Revisiting a Pain in the Neck: A Semantic Reasoning Benchmark for Language Models

LLMs still struggle to understand the meaning of common phrases, idioms, and compound words, revealing critical gaps in semantic reasoning.

Yang Liu, Hongming Li, Melissa Xiaohui Qin +2

Eval Frameworks & Benchmarks Natural Language Processing

Apr 15, 2026

Tsinghua AIApr 15, 2026·also Corresponding author are Bo Cheng and Soujanya, PhotoFlow, SCU, Tencent AI +1

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

Imagine creating high-fidelity, navigable 3D worlds from just a text prompt or a single image – HY-World 2.0 makes it a reality.

Team HY-World, Chenjie Cao, Xuhui Zuo +40

Computer Vision Multimodal Models World Models & Planning

Apr 7, 2026

University of SannioApr 7, 2026·also Tsinghua AI, School of Computer Science and Technology, Veermata Jijabai Technological Institute

A Novel PID Design Method via Model-Based Reinforcement Learning Algorithms

PID controllers can now be enhanced with data-driven, adaptive gains learned directly from RL, preserving their simplicity while boosting performance in uncertain environments.

Hozefa Jesawada, A. Yerudkar, Yang Liu +2

RLHF & Preference Learning Robotics & Embodied AI Training Efficiency & Optimization+1

Search

Yang Liu

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (30)