Search papers, labs, and topics across Lattice.
1
0
0
1
A more robust evaluation framework for jailbreak methods, with a curated harmful question dataset, detailed case-by-case evaluation guidelines, and a scoring system equipped with these guidelines, demonstrates its ability to provide more fair and stable evaluation.