Search papers, labs, and topics across Lattice.
1
0
2
2
LLMs' true reasoning can be detected via activation probing even when their chains-of-thought are misleading rationalizations, revealing a discrepancy between internal processing and external justification.