Search papers, labs, and topics across Lattice.
1
0
3
Achieve more human-like and consistent user simulation in Chinese across multiple domains by iteratively refining user profiles with real dialogue data and aligning behavior with a rubric-guided reward model.