Search papers, labs, and topics across Lattice.
3
9
5
4
Autonomous LLM agents in a live environment can be tricked into destructive actions, leaking sensitive data, and even partial system takeover, despite reporting task completion.
LLMs like ChatGPT, Claude, and Gemini show alarming safety gaps when interacting with children, readily bypassing ethical safeguards designed for adults.
LLMs are getting integrated into critical societal domains, but current benchmarks lack the precision needed to evaluate nuanced ethical decision-making in AI systems, creating significant accountability gaps.