Search papers, labs, and topics across Lattice.
University of Hong Kong
3
0
4
FreeStyle achieves a remarkable balance between style alignment and content preservation while effectively suppressing semantic leakage in dual-reference image generation.
Forget bolting vision onto language models – truly powerful multimodal AI demands rethinking architectures from the ground up.
Expert-level video aesthetics can be captured and improved using a hierarchical rubric and reward models trained with a progressive learning scheme.