Search papers, labs, and topics across Lattice.
Zhejiang University
2
0
3
Stop drowning your MLLMs in irrelevant context: FES-RAG shows that carefully selecting multimodal fragments boosts factual accuracy by up to 27% and slashes context length.
Semantic grounding, not token probability, is the key to better multimodal RAG.