Search papers, labs, and topics across Lattice.
2
0
6
0
Fine-tuning LLMs to emulate specific cultural values improves average accuracy, but simultaneously widens the disparity in performance between demographic subgroups.
LLMs can achieve state-of-the-art reasoning accuracy with significantly fewer tokens by rewarding intermediate reasoning steps that maximize information gain and maintain monotonic progress.