Search papers, labs, and topics across Lattice.
2
0
5
2
Even with ToM prompting, today's LLMs can be easily fooled in simple privacy games, but RL-trained "double agents" learn to effectively mislead attackers by modeling their beliefs.
Even state-of-the-art multimodal LLMs struggle to accurately cite their sources when reasoning across video, audio, and text, often hallucinating citations despite generating correct answers.