Search papers, labs, and topics across Lattice.
Zhejiang University
4
0
5
Over 80% of real-world LLM applications leak sensitive prompts, but a new defense, AREA, not only mitigates this risk but also boosts usability by over 33%.
Malicious LoRA plugins can hijack public sentiment and spread harmful content, achieving nearly 100% success rates without detection.
FLAME uncovers a hidden statistical energy gap in AI-generated images, enabling precise localization of forgeries that traditional methods miss.
Forget jailbreaking with surface tokens – this new backdoor method steers internal representations for persistent, stealthy attacks that are much harder to detect.