Search papers, labs, and topics across Lattice.
2
0
5
Even when explicitly warned about potential deception, LLMs can still be persuaded to make incorrect decisions, highlighting a critical gap between task performance and vigilance.
VLMs are nowhere near human-level general intelligence: they score less than 10% of human performance across a diverse set of human-designed games, especially struggling with world-model learning, memory, and planning.