Search papers, labs, and topics across Lattice.
Baidu Inc., Beijing, China
2
0
4
LLMs can compress context better than dedicated compression modules, simply by prompting them to "think" about the task.
By reflecting on its own reasoning, ReflectRM achieves a +10.2 improvement in mitigating positional bias compared to leading generative reward models, making it a far more stable evaluator.