Search papers, labs, and topics across Lattice.
Shanghai AI Laboratory
2
0
5
0
Automating LLM fine-tuning is now possible: a multi-agent system, TREX, matches or exceeds human performance on a diverse set of real-world tasks.
Fine-tuning smaller reasoning models on data from larger models can backfire spectacularly unless you carefully match the stylistic nuances of the student.