Junyang Lin

Qwen Team, Alibaba Group 4 University of California San Diego 5 Zhejiang University 6 Shanghai Jiao Tong University Equal Contribution.Corresponding author. yang.yujiu@sz.tsinghua.edu.cn, Qwen Team, Alibaba Group

Tsinghua AI

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Natural Language Processing (2)Reasoning & Chain-of-Thought (2)RLHF & Preference Learning (2)Tool Use & Agents (2)

Frequent co-authors

Bowen Yu (2)An Yang (2)Liyuan Mao (1)Le Yu (1)

Papers (5)

Mar 9, 2026

Tsinghua AIMar 9, 2026·also DAMO, HIT

Revealing Behavioral Plasticity in Large Language Models: A Token-Conditional Perspective

LLMs can switch between reasoning and factual answering on the fly, without retraining, simply by conditioning on specific token prefixes.

Liyuan Mao, Le Yu, Jing Zhou +8

Natural Language Processing Reasoning & Chain-of-Thought RLHF & Preference Learning

Mar 4, 2026

Tsinghua AIMar 4, 2026·also DAMO, ZJU

From Narrow to Panoramic Vision: Attention-Guided Cold-Start Reshapes Multimodal Reasoning

Multimodal models are often blind at birth: a new "Visual Attention Score" reveals they struggle to focus on visual inputs during cold-start, but a simple attention-guided fix can boost performance by 7%.

Chufan Shi, Yizhen Zhang, Ruizhe Chen +3

Computer Vision Interpretability & Mechanistic Interp Multimodal Models+1

Feb 28, 2026

Tsinghua AIFeb 28, 2026·also DAMO, CAS, PolyU, RUC +1

Qwen3-Coder-Next Technical Report

An 80B model that runs like a 3B? Qwen3-Coder-Next shows you can get competitive coding agent performance with a fraction of the active parameters, thanks to smart training.

Ruisheng Cao, Mouxiang Chen, Jiawei Chen +17

Code Generation & Program Synthesis Inference & Quantization Tool Use & Agents

Feb 15, 2026

DAMOFeb 15, 2026·also MIT CSAIL, Tsinghua AI, BJTU, Fudan +1

HLE-Verified: A Systematic Verification and Structured Revision of Humanity's Last Exam

LLM benchmark accuracy jumps 10% when evaluated on a cleaned-up version of Humanity's Last Exam, highlighting the significant impact of dataset noise on performance metrics.

Weiqi Zhai, Weiqi Zhai, Zhihai Wang +49

Data Curation & Synthetic Data Eval Frameworks & Benchmarks Natural Language Processing

Oct 30, 2025

Tsinghua AIOct 30, 2025·also DAMO, CAS, Shenzhen University of Advanced Technology

ToolRM: Towards Agentic Tool-Use Reward Modeling

ToolRMs drastically improve tool-use accuracy in LLMs, outperforming existing models by up to 17.94%, while also reducing output token usage by over 66% through efficient inference-time scaling.

Renhao Li, Jianhong Tu, Yang Su +6

RLHF & Preference Learning Tool Use & Agents

Search

Junyang Lin

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (5)