Search papers, labs, and topics across Lattice.
University of Toronto, McGill University ♢
1
0
2
SIREN reveals that tapping into LLM internal states can drastically improve harmfulness detection while slashing the parameter count by 250 times.