Search papers, labs, and topics across Lattice.
Gaoling School of Artificial Intelligence, Renmin University of China.
1
0
2
LLMs' training trajectories in RLVR are more predictable than you think: modeling the non-linear evolution of a rank-1 subspace lets you extrapolate parameters and cut compute by 37.5%.