Search papers, labs, and topics across Lattice.
2
0
5
2
Forget slow, expensive neural verifiers: this work shows a simple corpus lookup can provide faster, better rewards for RL fine-tuning of QA models.
LLMs can reason more robustly by fusing contextual hidden states with vocabulary embedding guidance, enabling dynamic switching between latent and explicit thinking.