Search papers, labs, and topics across Lattice.
1
0
3
7
LLMs can predict multiple tokens in parallel without any training, simply by cleverly probing their embedding space with dynamically generated mask tokens.