Search papers, labs, and topics across Lattice.
2
2
3
15
Diffusion language models can achieve faster decoding and better accuracy by learning directly from the token reveal order suggested by a lightweight autoregressive teacher, without expensive distillation.
Forget imbalanced LoRA usage: ReMix leverages reinforcement learning to route effectively among LoRAs, boosting performance in parameter-efficient fine-tuning.