Search papers, labs, and topics across Lattice.
University of Twente
1
0
3
2
LLMs struggle to identify software vulnerabilities, with even top models only achieving ~90% accuracy on a new CVE-based benchmark, suggesting significant risks in their application to software development.