Search papers, labs, and topics across Lattice.
Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China
1
0
3
10
Current multimodal models are stuck in bi-modal interactions, but OmniGAIA and OmniAtlas offer a path towards truly omni-modal AI assistants capable of reasoning and tool use across video, audio, and images.