Search papers, labs, and topics across Lattice.
Carnegie Mellon University, New York University
1
0
3
Healthcare LLM benchmarks can be misleading because they fail to capture critical assumptions about how users interact with models and how those interactions translate to real-world outcomes.