Search papers, labs, and topics across Lattice.
CISPA Helmholtz Center for Information Security
2
0
5
17
Even the most advanced LLMs like GPT-5.2 and Gemini-3-Pro often fail to recognize and refuse to process harmful content embedded within seemingly harmless tasks.
Shadow APIs promising access to top LLMs like GPT-5 and Gemini 2.5 often deliver significantly degraded performance (down to 47.21% accuracy) and fail identity verification, casting doubt on research relying on them.