Search papers, labs, and topics across Lattice.
3
1
6
6
LLMs struggle to generate diverse and specific connections between concepts, even with high token budgets and "thinking" prompts, revealing a gap in creative associative reasoning.
LLMs can learn to make better decisions in complex environments by explicitly reasoning about the cost of exploration, leading to more efficient information gathering and problem-solving.
Forget fixed margins in RLHF: modeling the *strength* of human preferences with "preference-over-preference" learning boosts both discriminative accuracy and generative quality.