Search papers, labs, and topics across Lattice.
Gaoling School of Artificial Intelligence, GSAI, Renmin University of China
1
0
3
15
Current multimodal models are stuck in bi-modal interactions, but OmniGAIA and OmniAtlas offer a path towards truly omni-modal AI assistants capable of reasoning and tool use across video, audio, and images.