Search papers, labs, and topics across Lattice.
1
0
3
LLM-generated labels for low-resource IR are surprisingly unreliable across languages, even with consistency checks and human evaluation, raising serious questions about cross-lingual dataset reuse.