Difan Zou

University of Hong Kong

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (3)Computer Vision (2)Data Curation & Synthetic Data (1)Architecture Design (Transformers, SSMs, MoE) (1)

Frequent co-authors

Zichao Yu (2)Jinghong Lan (1)Wei Cheng (1)Yunuo Chen (1)

Papers (3)

Jun 18, 2026

3d ago·also HKU, Westlake

FreeStyle: Free Control of Style-Content Dual-Reference Generation from Community LoRA Mining

FreeStyle achieves a remarkable balance between style alignment and content preservation while effectively suppressing semantic leakage in dual-reference image generation.

Jinghong Lan, Wei Cheng, Yunuo Chen +10

Computer Vision Data Curation & Synthetic Data Multimodal Models

May 25, 2026

3w ago·also Tsinghua AI, AI Laboratory, HKU, Monash +3

Toward Native Multimodal Modeling: A Roadmap

Forget bolting vision onto language models – truly powerful multimodal AI demands rethinking architectures from the ground up.

Siyu An, Junru Lu, Junnan Dong +18

Architecture Design (Transformers, SSMs, MoE)Multimodal Models

Apr 30, 2026

Yujin Han +13Apr 30, 2026·also HKU, Sydney

AesRM: Improving Video Aesthetics with Expert-Level Feedback

Expert-level video aesthetics can be captured and improved using a hierarchical rubric and reward models trained with a progressive learning scheme.

Yujin Han, Yujie Wei, Yefei He +11

Computer Vision Multimodal Models

Search

Difan Zou

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)