Search papers, labs, and topics across Lattice.
Institute for the Study of Natural and Artificial Intelligence, Harvard AI and Robotics Lab, Harvard Medical School University of Southern California, Carnegie Mellon University, Stanford University Kempner, Harvard University
CMU Machine Learning2
0
4
LLMs struggle to synthesize scientific conclusions from structured biomedical evidence, and current metrics fail to capture nuanced differences in their reasoning abilities.
LLMs struggle even more when facing the double whammy of non-native English and typos, revealing that real-world performance is likely overestimated by standard English benchmarks.