Search papers, labs, and topics across Lattice.
2
0
4
Recursive depth in masked diffusion models can dramatically enhance parameter efficiency, enabling models to perform as well as much larger counterparts without the added computational burden.
PriFT achieves state-of-the-art performance in supervised fine-tuning by leveraging a stable token reweighting signal from a frozen pretrained model, drastically improving generalization.