Search papers, labs, and topics across Lattice.
1
0
3
9
Aggregate LLM benchmarks mislead on individual preferences: model rankings correlate near-zero for over half of users.