Hongyu Lin

LLMs exhibit a "Utopian bias" when simulating human behavior, converging towards an unrealistic "positive average person" and failing to capture individual differences and long-tail behaviors.

Jiawei Chen, Ruoxi Xu, Boxi Cao +12

Eval Frameworks & Benchmarks Tool Use & Agents World Models & Planning

Mar 10, 2026

CMU MLMar 10, 2026·also CAS

Decoupling Reasoning and Confidence: Resurrecting Calibration in Reinforcement Learning from Verifiable Rewards

LLMs trained with reinforcement learning from verifiable rewards (RLVR) become overconfident in incorrect answers, but a simple fix—decoupling reasoning and calibration objectives—can restore proper calibration without sacrificing accuracy.

Zheng Ma, Zhengzhao Ma, Xueru Wen +7

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought RLHF & Preference Learning

Feb 26, 2026

DeepPresenter: Environment-Grounded Reflection for Agentic Presentation Generation

By grounding reflection in the visual artifacts of presentation slides, DeepPresenter enables agents to iteratively refine presentations in a way that internal reasoning traces alone cannot.

Hao Zheng, Haolin Zheng, Guozhao Mo +11

Multimodal Models Tool Use & Agents World Models & Planning

Search

Hongyu Lin

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)