Zhiyuan Zhai

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (2)Inference & Quantization (1)Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)

Frequent co-authors

Xin Wang (2)Bingcong Li (1)Bingnan Xiao (1)Zhiyuan Zhai (1)

Papers (2)

Apr 16, 2026

Zhiyuan Zhai +31w ago

Adaptive Test-Time Compute Allocation for Reasoning LLMs via Constrained Policy Optimization

Stop wasting compute: a learned policy can intelligently allocate LLM inference budgets, boosting accuracy by up to 12.8% compared to uniform allocation.

Zhiyuan Zhai, Bingcong Li, Bingnan Xiao +1

Eval Frameworks & Benchmarks Inference & Quantization Reasoning & Chain-of-Thought

Zhiyuan Zhai +51w ago

Does RL Expand the Capability Boundary of LLM Agents? A PASS@(k,T) Analysis

RL unlocks genuinely new tool-use capabilities in LLMs by enabling compositional strategies that surpass what's achievable through mere re-sampling, challenging the notion that RL only improves reliability.

Zhiyuan Zhai, Zhiyuan Zhai, Wenjing Yan +3

Eval Frameworks & Benchmarks RLHF & Preference Learning Tool Use & Agents

Search

Zhiyuan Zhai

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)