Search papers, labs, and topics across Lattice.
1
0
3
6
Replaying generic pre-training data during fine-tuning boosts target task performance by up to 2x, challenging the common practice of minimizing its use.