Search papers, labs, and topics across Lattice.
Universidade da Coruña
1
0
2
Stop trusting LLM-as-a-judge for persona simulation: Eval4Sim offers a grounded alternative by benchmarking against human conversational patterns across adherence, consistency and naturalness.