Search papers, labs, and topics across Lattice.
Huawei Noah鈥檚 Ark Lab 2 Harbin Institute of Technology 3 Nankai University
2
0
4
3
Forget expensive human feedback loops: a VLM-powered reward function can efficiently align image editing diffusion models with human preferences.
Current image editing models, even closed-source ones, still fall short on complex and creative instruction-based tasks, as revealed by a new interpretable QA-based evaluation framework.