Search papers, labs, and topics across Lattice.
Meituan
3
0
4
AuRA achieves superior performance in speech-language tasks by seamlessly integrating audio understanding into LLMs, outperforming traditional methods in both speed and accuracy.
Complex visual queries can significantly elevate the reasoning capabilities of multi-modal large language models, revealing new dimensions in AI's understanding of abstract visual content.
Surprisingly, the "think before answer" paradigm fails to enhance generative recommendation models, prompting a novel approach that redefines how reasoning is integrated into these systems.