Search papers, labs, and topics across Lattice.
INSAIT, Sofia University "St. Kliment Ohridski"
1
0
3
4
LLM benchmark translations can be dramatically improved by test-time compute scaling, revealing a surprisingly cheap way to get more reliable multilingual evaluations.