Search papers, labs, and topics across Lattice.
The Pennsylvania State University State College
2
0
6
LLMs can learn to abstain from answering questions they're unsure about with state-of-the-art accuracy by dynamically re-weighting abstention rewards based on trajectory consistency during training.
Stop naively aggregating knowledge for KB-VQA: MaS-VQA's Mask-and-Select mechanism shows how to prune irrelevant image regions and knowledge fragments for better reasoning.