Search papers, labs, and topics across Lattice.
Beijing Academy of Artificial Intelligence
2
0
5
Agent deception in autonomous systems is not just a theoretical concern; it鈥檚 a pressing reality that can undermine trust in AI applications.
Achieve multilingual LLM safety alignment without expensive language-specific training data by enforcing cross-lingual consistency during monolingual alignment.