Lattice AI Research

Research focus

Eval Frameworks & Benchmarks (1)Interpretability & Mechanistic Interp (1)Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)Scalable Oversight & Alignment Theory (1)

Frequent co-authors

Quang H. Nguyen (1)Hung T. Q. Le (1)Heng Ji (1)Khoa D Doan (1)

Papers (2)

Mar 17, 2026

Mar 17, 2026·also VinUniversity

Decoding the Critique Mechanism in Large Reasoning Models

LRMs can often recover from injected errors in their reasoning steps, revealing a hidden "critique" ability that can be harnessed to improve performance without additional training.

Quang H. Nguyen, Hung T. Q. Le, Xiusi Chen +2

Eval Frameworks & Benchmarks Interpretability & Mechanistic Interp Reasoning & Chain-of-Thought

Mar 9, 2026

Tsinghua AIMar 9, 2026

How Far Can Unsupervised RLVR Scale LLM Training?

Intrinsic reward signals in unsupervised RL for LLMs inevitably collapse due to sharpening of the model's prior, but external rewards grounded in computational asymmetries offer a path to sustained scaling.

Bingxiang He, Bingxiang He, Yuxin Zuo +29

RLHF & Preference Learning Scalable Oversight & Alignment Theory Training Efficiency & Optimization

Search

Xiusi Chen

Research focus

Frequent co-authors

Papers (2)