Search papers, labs, and topics across Lattice.
UC Santa Cruz, Yuhan Wang and Ying-Jun Angela Zhang are with the Department of Information Engineering, The Chinese University of Hong Kong, Hong Kong (e-mail: wy023@ie.cuhk.edu.hk; yjzhang@ie.cuhk.edu.hk).Suzhi Bi is with the College of Electronic and Information Engineering, Shenzhen University, Shenzhen 518060, China (e-mail: bsz@szu.edu.cn)
1
39
3
2
Just 1,000 carefully curated examples can boost an LRM's safety by 40% without significantly sacrificing reasoning ability.