Search papers, labs, and topics across Lattice.
Max Planck Institute for Intelligent Systems, ELLIS Institute T眉bingen, T眉bingen AI Center
1
0
3
Safety benchmarks may be measuring a model's knowledge of how evaluations are designed, not genuine safety.