Search papers, labs, and topics across Lattice.
2
0
6
6
Optimal LLM pretraining actually requires *overtraining* when you account for inference costs, overturning conventional scaling wisdom.
Forget expensive human annotations: RubiCap uses LLM-generated rubrics to train image captioning models via RL, achieving superhuman performance and even improving VLM pretraining.