Search papers, labs, and topics across Lattice.
脡cole Polytechnique, CREST (, ENSAE, CNRS)
1
0
3
LLM benchmarks are missing a critical ingredient: social science data, which could significantly improve model generalization and robustness across a wide range of disciplines.