Search papers, labs, and topics across Lattice.
May
2
0
5
14
Multimodal agents can now continually improve their tool use and orchestration in open-ended settings without parameter updates, thanks to a novel dual-stream framework that learns from both past experiences and structured skills.
LLMs can be taught to "think longer" and explore more diverse reasoning paths in-context via a simple length-incentivized reward, leading to improved generalization.