Search papers, labs, and topics across Lattice.
Harbin Institute of Technology
2
0
3
8
Social intelligence may require more than just reasoning power: a 7B model trained with SAVOIR beats GPT-4o and Claude-3.5-Sonnet on social interaction tasks.
Unlock 2x faster reinforcement learning by distilling group feedback into actionable language refinements that guide exploration.