Lattice AI Research

Research focus

Recommendation & Information Retrieval (1)Interpretability & Mechanistic Interp (1)Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)

Frequent co-authors

Yunlong Hou (1)Vincent Y. F. Tan (1)Yuhang He (1)Haodong Wu (1)

Papers (2)

May 25, 2026

NUS2w ago·also HKUST

On the Benefits of Free Exploration for Regret Minimization in Multi-Armed Bandits

Free exploration in multi-armed bandits can lead to sharp phase transitions in accumulated regret, offering significant savings compared to standard regret minimization.

Yunlong Hou, Zixin Zhong, Vincent Y. F. Tan

Recommendation & Information Retrieval

Apr 13, 2026

Apr 13, 2026·also HKUST

Rethinking Token-Level Credit Assignment in RLVR: A Polarity-Entropy Analysis

RLVR's reasoning gains hinge on high-entropy tokens, revealing a critical inefficiency in uniform reward broadcast that EAPO effectively addresses.

Yuhang He, Haodong Wu, Siyi Liu +7

Interpretability & Mechanistic Interp Reasoning & Chain-of-Thought RLHF & Preference Learning

Search

Zixin Zhong

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)