Search papers, labs, and topics across Lattice.
2
0
5
19
Even with ToM prompting, today's LLMs can be easily fooled in simple privacy games, but RL-trained "double agents" learn to effectively mislead attackers by modeling their beliefs.
LLMs can learn to solve previously intractable reasoning problems by training on adaptively-reformulated, cognitively simpler versions of the same tasks.