Search papers, labs, and topics across Lattice.
Shanghai AI Laboratory, Shanghai AI Lab
3
0
8
8
LLMs can now automatically evolve and optimize GPU kernels to beat hand-tuned and proprietary models like Gemini and Claude.
Fine-tuning smaller reasoning models on data from larger models can backfire spectacularly unless you carefully match the stylistic nuances of the student.
Multilingual reasoning in LLMs isn't just about translation鈥攊t's a powerful knob for improving RL training by expanding the exploration space and boosting exploitation.