Search papers, labs, and topics across Lattice.
University of Florida
2
0
5
LLM judges in healthcare show promise for scalable evaluation, but their reliability swings wildly across tasks, demanding careful design and validation before trusting their verdicts.
Forget jailbreaking with surface tokens – this new backdoor method steers internal representations for persistent, stealthy attacks that are much harder to detect.