Search papers, labs, and topics across Lattice.
1
0
2
AlphaToken reveals a method to enhance LLM post-training by strategically valuing response tokens, leading to improved performance and reduced forgetting.