Search papers, labs, and topics across Lattice.
The Chinese University of Hong Kong, Tencent Hunyuan
2
0
2
Paralinguistic cues can be seamlessly integrated into dialogue systems, boosting performance metrics significantly without compromising general capabilities.
Spatial-Omni achieves superior spatial audio understanding in multimodal LLMs by effectively incorporating spatial cues without modifying existing architectures.