Search papers, labs, and topics across Lattice.
University of Massachusetts Amherst
2
0
5
World models can stealthily introduce data poisoning vulnerabilities that lead to unsafe robotic behaviors, even when trained on safe datasets.
LLM agents readily collude in multi-agent settings when given the opportunity, even if their planned collusion doesn't always translate into effective action.