Lattice AI Research

Research focus

Multimodal Models (2)Computer Vision (1)Eval Frameworks & Benchmarks (1)Open-Source Models & Weights (1)Training Efficiency & Optimization (1)

Frequent co-authors

Bingli Wang (1)Huanze Tang (1)Haijun Lv (1)Zhishan Lin (1)

Papers (2)

Apr 30, 2026

Apr 30, 2026·also Shanghai AI Lab

COHERENCE: Benchmarking Fine-Grained Image-Text Alignment in Interleaved Multimodal Contexts

Current MLLMs still struggle to connect the dots between images and text when they're interleaved, highlighting a critical gap in real-world multimodal understanding.

Bingli Wang, Huanze Tang, Haijun Lv +3

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Apr 14, 2025

Tsinghua AIApr 14, 2025·also NUS, CUHK, Deakin, Fudan +9

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Open-source multimodal models just leveled up: InternVL3 rivals closed-source titans like GPT-4o by pre-training vision and language together from the start.

Jinguo Zhu, Weiyun Wang, Zhe Chen +45901

Multimodal Models Open-Source Models & Weights Training Efficiency & Optimization

Search

Lixin Gu

Research focus

Frequent co-authors

Papers (2)