Search papers, labs, and topics across Lattice.
University of Sheffield
2
0
5
Uncertainty estimates from LLMs can crumble under distribution shift, but the right probe design – think middle layers and token aggregation – can make them surprisingly resilient.
Naive codebook initialization is the hidden killer of extreme LLM quantization, and a smarter initialization scheme can unlock significant quality gains.