Search papers, labs, and topics across Lattice.
University of Queensland
1
0
3
LLMs aren't ready to replace human judges in relevance assessment, as they consistently inflate relevance scores and are easily swayed by superficial cues like passage length.