Search papers, labs, and topics across Lattice.
1
0
3
Multilingual benchmarks may be fooling you: they're measuring reasoning and recall, not actual translation ability, and round-trip translation reveals the gap.