Search papers, labs, and topics across Lattice.
UMass Amherst
2
0
5
LLMs remember too much to be good user simulators, but targeted prompting and a novel "compactor" can make them forget like humans do.
LLMs can learn to make better decisions in complex environments by explicitly reasoning about the cost of exploration, leading to more efficient information gathering and problem-solving.