Search papers, labs, and topics across Lattice.
University of North Carolina at Chapel Hill
1
0
3
6
Even with ToM prompting, today's LLMs can be easily fooled in simple privacy games, but RL-trained "double agents" learn to effectively mislead attackers by modeling their beliefs.