Search papers, labs, and topics across Lattice.
2
0
6
12
GPT-5-Mini can be made 10% more robust to jailbreaks and prompt injections simply by RL fine-tuning on a new instruction hierarchy dataset, IH-Challenge.
LLMs can verify code more effectively by focusing on test case utility rather than sheer quantity, achieving a 28.5% higher mutation score with 19.3% fewer tests.