Search papers, labs, and topics across Lattice.
1
0
3
Forget difficulty-based heuristics: InSight leverages weighted mutual information to select RL training data, boosting LLM reasoning and alignment with up to 2.2x speedup.