Search papers, labs, and topics across Lattice.
NASA Langley Research Center
1
1
3
0
LLM evaluation is missing the forest for the trees: automated metrics overlook critical errors that domain experts readily identify using nuanced, context-aware strategies.