Lattice AI Research

Research focus

Reasoning & Chain-of-Thought (2)Training Efficiency & Optimization (2)RLHF & Preference Learning (1)

Frequent co-authors

Tingcheng Bian (1)Jinchang Luo (1)Mingquan Cheng (1)Xiaoling Xia (1)

Papers (2)

Mar 18, 2026

Mar 18, 2026·also Shenzhen University

TRiMS: Real-Time Tracking of Minimal Sufficient Length for Efficient Reasoning via RL

LLMs can slash over 80% of their chain-of-thought tokens with a minor accuracy boost, thanks to a new RL-based method that targets the "Minimal Sufficient Length" of reasoning.

Tingcheng Bian, Jinchang Luo, Mingquan Cheng +3

Reasoning & Chain-of-Thought Training Efficiency & Optimization

Mar 17, 2026

Mar 17, 2026·also PKU

Dual Consensus: Escaping from Spurious Majority in Unsupervised RLVR via Two-Stage Vote Mechanism

LLMs can escape the trap of converging on popular but incorrect answers in unsupervised RLVR by temporarily "unlearning" and exploring diverse response options.

Kaixuan Du, Hang Zhang, Yukun Wang +2

Reasoning & Chain-of-Thought RLHF & Preference Learning Training Efficiency & Optimization

Search

Ni Li

Research focus

Frequent co-authors

Papers (2)