Search papers, labs, and topics across Lattice.
Nanjing University of Science and Technology
1
0
3
4
Forget static relevance labels – RRPO uses LLM feedback to train RAG rerankers, boosting generation quality without expensive human annotations.