Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Caifeng Shan | Lattice

Caifeng Shan

Nanjing University

Papers on Lattice

3

Total citations

0

Topics

6

h-index

4

Research focus

Multimodal Models (3)Computer Vision (2)Eval Frameworks & Benchmarks (1)RLHF & Preference Learning (1)Tool Use & Agents (1)

Frequent co-authors

Yifan Zhang (2)Yunhang Shen (2)Haoyu Cao (2)Ran He (2)

Papers (3)

Apr 6, 2026

Chaoyou Fu +22Apr 6, 2026·also NJU

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Leaderboard-topping video models are still surprisingly brittle, failing on basic video reasoning tasks unless given the right textual cues.

Chaoyou Fu, Hao Yuan, Haozhi Yuan +20

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Mar 20, 2026

Mar 20, 2026

PersonaVLM: Long-Term Personalized Multimodal LLMs

Forget static, single-turn personalization – PersonaVLM unlocks long-term, evolving user alignment in MLLMs, even surpassing GPT-4o.

Chang Nie, Chaoyou Fu, Yifan Zhang +2

Multimodal Models RLHF & Preference Learning Tool Use & Agents

Mar 6, 2026

Lijiang Li +8Mar 6, 2026·also NJU

Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion

Ditch autoregressive MLLMs: Omni-Diffusion proves that mask-based discrete diffusion models can unify multimodal understanding and generation across text, speech, and images with competitive performance.

Lijiang Li, Zuwei Long, Yunhang Shen +6

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Architecture Design (Transformers, SSMs, MoE) (1)

Chaoyou Fu (1)

Haozhi Yuan (1)