Search papers, labs, and topics across Lattice.
Department of Systems and Computer Networks, Wrocław University of Science and Technology, Wrocław, Poland
1
0
3
2
AWS Guardrails and NeMo stand out, achieving 96.8% and 93.9% accuracy respectively, proving that effective defenses against jailbreaks are within reach for commercial LLM deployments.