Search papers, labs, and topics across Lattice.
Seoul National University
2
0
5
LVLMs can generate more factual and detailed image captions at a lower compute cost by reflecting on their past mistakes and systematically attending to overlooked details.
LLMs' "Aha!" moments aren't about magic tokens, but about explicitly verbalizing and managing uncertainty during reasoning, which drives performance.