Huangxuan Wu

Papers on Lattice

Total citations

Topics

h-index

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Computer Vision (1)Multimodal Models (1)

Frequent co-authors

Yifei She (1)

Papers (1)

Aug 31, 2025

Yifei She +1Aug 31, 2025

Fusion to Enhance: Fusion Visual Encoder to Enhance Multimodal Language Model

MLLMs can get a serious vision boost by fusing features from multiple specialized visual encoders, rather than relying on a single, semantically-focused one.

Yifei She, Huangxuan Wu

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Search

Huangxuan Wu

Research focus

Frequent co-authors

Papers (1)