Yu Cheng

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Reasoning & Chain-of-Thought (4)Eval Frameworks & Benchmarks (2)Training Efficiency & Optimization (2)Code Generation & Program Synthesis (2)

Frequent co-authors

Zhilin Wang (2)Runzhe Zhan (2)Jiacheng Chen (2)Yafu Li (2)

Papers (9)

Jul 2, 2026

Tsinghua AI3w ago·also UMacau, UT Austin

EvoPolicyGym: Evaluating Autonomous Policy Evolution in Interactive Environments

GPT-5.5 not only tops the leaderboard in policy evolution but also reveals critical insights into how agents can optimize performance through strategic feedback utilization.

Zhilin Wang, Hanxiao Song, Han Song +15

Eval Frameworks & Benchmarks Tool Use & Agents

Jun 29, 2026

Disen Lan +73w ago

Morphing into Hybrid Attention Models

FlashMorph reveals that optimizing layer selection in hybrid attention models can drastically improve efficiency while maintaining performance, outperforming existing heuristic methods.

Disen Lan, Jianbin Zheng, Yuxi Ren +5

Architecture Design (Transformers, SSMs, MoE)Training Efficiency & Optimization

Jun 16, 2026

Yu Cheng +4Jun 16, 2026·also College of Computer Science and Technology

PracRepair: LLM-Empowered Automated Program Repair Inspired by Human-Like Debugging Practices

PracRepair fixes 162 out of 171 bugs in a challenging benchmark, showcasing a leap in automated program repair capabilities through human-inspired debugging techniques.

Yu Cheng, Zhongxin Liu, Chao Ni +2

Code Generation & Program Synthesis

Jun 11, 2026

Tsinghua AIJun 11, 2026·also Birmingham, CUHK, Fudan, MiniMax +1

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

MaxProof's innovative test-time scaling enables an AI to outperform human champions in mathematical proof competitions.

Jiacheng Chen, Xinyu Zhang, Shunkai Zhang +23

Reasoning & Chain-of-Thought Scaling Laws & Emergent Abilities

Jun 9, 2026

DAMOJun 9, 2026·also HIT, Shanghai AI Lab, SJTU

How Does Reasoning Flow? Tracing Attention-Induced Information Flow for Targeted RL in LLMs

FlowTracer reveals that optimizing token-level rewards based on attention-induced information flow can dramatically enhance reasoning performance in LLMs.

Zhichen Dong, Yuhan Sun, Zinian Peng +5

Reasoning & Chain-of-Thought RLHF & Preference Learning

Microsoft ResearchJun 9, 2026·also CUHK, Oxford, SJTU

3D-CoS: A New 3D Reconstruction Paradigm Based on VLM Code Synthesis

Code-based 3D reconstruction achieves superior edit fidelity and locality, outperforming traditional point-cloud methods in preserving unedited regions.

Yuhao Wang, Puyi Wang, Linjie Li +3

Code Generation & Program Synthesis Multimodal Models

Jun 9, 2026·also Tsinghua AI, AI Laboratory, CUHK, HKU +3

ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics

Even the best LLMs struggle with Olympiad-level combinatorics, achieving only 65.4% on a benchmark designed to expose their reasoning limitations.

Shunkai Zhang, Yun Luo, Qianjia Cheng +14

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought

Apr 21, 2026

Kun Wang +7Apr 21, 2026

ProjLens: Unveiling the Role of Projectors in Multimodal Model Safety

Projector fine-tuning, commonly used for aligning MLLMs, unexpectedly introduces backdoor vulnerabilities with activation mechanisms distinct from those in text-only LLMs.

Kun Wang, Cheng Qian, Cheng Qian +5

Interpretability & Mechanistic Interp Multimodal Models Red-Teaming & Adversarial Robustness

Qingyang Zhang +9Apr 21, 2026

TEMPO: Scaling Test-time Training for Large Reasoning Models

Test-time training can finally scale for large reasoning models: TEMPO unlocks sustained performance gains by interleaving policy refinement with periodic critic recalibration, boosting accuracy by over 18% on challenging benchmarks.

Qingyang Zhang, Xinke Kong, Haitao Wu +7

Inference & Quantization Reasoning & Chain-of-Thought Training Efficiency & Optimization

Search

Yu Cheng

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (9)