Search papers, labs, and topics across Lattice.
2
0
6
LLMs can internally compute the right answer to simple reasoning tasks like character counting, but then actively suppress that information before outputting the final (wrong) answer.
Hybrid models can sometimes beat BERT at hate speech detection, and neutralizing text transformations offer a new mitigation path.