Haokun Lin

Existing image editing models fall short when it comes to precise spatial manipulations, but a new benchmark and dataset reveal the path to closing the gap.

Yicheng Xiao, Wenhu Zhang, Lin Song +4

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Feb 23, 2026

Feb 23, 2026·also CityU

QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models

VLA models can now be efficiently quantized *without* retraining, even surpassing full-precision performance, thanks to a new post-training method that carefully calibrates scales across attention and output heads.

Jingxuan Zhang, Yunta Hsieh, Zhongwei Wang +5

Inference & Quantization Multimodal Models Robotics & Embodied AI

Search

Haokun Lin

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)