Search papers, labs, and topics across Lattice.
Shanghai Jiaotong University
2
0
5
Current video-to-audio models are surprisingly bad at generating speech and singing, exposing critical gaps in multimodal understanding.
LLMs with similar semantic skills show wildly different economic performance in simulated markets, revealing that reasoning about competition and resource allocation remains a major challenge.