Bohan Zhuang

Counterintuitively, VLMs can achieve higher VQA accuracy by intentionally degrading visual inputs, suggesting that high-resolution details can act as noise that hinders reasoning.

Haoxuan Han, Yefei He, Bohan Zhuang

Computer Vision Multimodal Models Reasoning & Chain-of-Thought

Weian Mao +9Apr 6, 2026·also Ministry of Education, SEU

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

LLMs can achieve 2.5x higher throughput and 10.7x KV memory reduction in long-context reasoning by compressing the KV cache using trigonometric functions derived from pre-RoPE query/key vector distributions.

Weian Mao, Weian Mao, Xi Lin +7

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Reasoning & Chain-of-Thought

Mar 29, 2026

Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

The medical imaging AI community is being held back by a fragmented data landscape, but a new metadata-driven fusion paradigm offers a path to unlocking the power of foundation models.

Zhongying Deng, Cheng Tang, Ziyan Huang +120

Computer Vision Data Curation & Synthetic Data Scientific Discovery & Drug Design

Search

Bohan Zhuang

Research focus

Frequent co-authors

Papers (5)