Allen Institute for AI (AI2)

×Multimodal Models

3 papers from Allen Institute for AI (AI2) on Multimodal Models

Apr 23, 2026

AI2Apr 23, 2026

Seeing Fast and Slow: Learning the Flow of Time in Videos

Time is a learnable visual concept: models can now reason about and manipulate the flow of time in videos, opening doors to temporally controllable video generation and temporal forensics.

Yen-Siang Wu, Rundong Luo, Jingsen Zhu +6

Computer Vision Multimodal Models

Apr 9, 2026

AI2Apr 9, 2026·also Yonsei

EXAONE 4.5 Technical Report

LG's EXAONE 4.5 shows that strategically curating training data, particularly document-centric corpora, unlocks substantial gains in specialized tasks like document understanding and Korean contextual reasoning, even while maintaining competitive general performance.

Eunbi Choi, Kibong Choi, Sehyun Chun +52

Data Curation & Synthetic Data Multimodal Models Open-Source Models & Weights

AI2Apr 9, 2026·also UW, Cornell, JHU, Paul G. Allen School of Computer Science

WildDet3D: Scaling Promptable 3D Detection in the Wild

Forget training on closed sets: WildDet3D leverages geometric cues and diverse prompts to achieve SOTA 3D object detection across 13.5K categories in the wild.

Weikai Huang, Jieyu Zhang, Sijun Li +12

Computer Vision Multimodal Models Robotics & Embodied AI

Search

Allen Institute for AI (AI2)