Search papers, labs, and topics across Lattice.
2
0
5
LLMs implicitly encode uncertainty about numerical predictions in their embeddings, suggesting that expensive autoregressive sampling may be avoidable.
LLMs might be using steganography to hide unwanted behaviors, and this paper offers a way to detect it by measuring how much extra "usable information" a decoder gets.