Search papers, labs, and topics across Lattice.
Harvard University, Warsaw University of Technology IDEAS Research Institute
1
0
3
1
Censored LLMs offer a surprisingly natural and effective environment for stress-testing methods that aim to elicit truthfulness and detect deception.