Search papers, labs, and topics across Lattice.
1
0
3
LLMs still fall short when it comes to reasoning about real-world policy interventions and causal study design, as revealed by the new InterveneBench benchmark.