Search papers, labs, and topics across Lattice.
Scale AI
2
0
4
LLMs can't reliably predict scientific experiment outcomes, and more worryingly, they have no idea when they're wrong, unlike human experts whose accuracy skyrockets when they feel confident.
LLMs can boost novice performance on complex biosecurity tasks to surpass even expert-level benchmarks, but users struggle to fully leverage the models' capabilities.