Search papers, labs, and topics across Lattice.
LLM-powered query reformulation, a hot topic in IR, often fails to translate gains from lexical to neural retrieval, and bigger models don't always help.
Forget hand-crafted prompts: RL can automatically unearth 36 new failure modes in VLMs that humans miss, revealing surprising blind spots in counting, spatial reasoning, and viewpoint understanding.
LLMs struggle to balance rational financial decisions with mimicking noisy user behavior, often overfitting to short-term market trends instead of aligning with long-term investment goals.