Search papers, labs, and topics across Lattice.
This paper introduces TokenLight, a conditional image generation framework for precise image relighting using attribute tokens to control lighting factors like intensity, color, and 3D position. Trained on a large synthetic dataset with real image augmentation, the model learns to manipulate lighting attributes without explicit inverse rendering supervision. TokenLight achieves state-of-the-art performance in both synthetic and real image relighting tasks, demonstrating an implicit understanding of light-scene interaction.
Control in-scene lighting fixtures and edit environment illumination using virtual light sources with unprecedented precision, thanks to attribute tokens.
This paper presents a method for image relighting that enables precise and continuous control over multiple illumination attributes in a photograph. We formulate relighting as a conditional image generation task and introduce attribute tokens to encode distinct lighting factors such as intensity, color, ambient illumination, diffuse level, and 3D light positions. The model is trained on a large-scale synthetic dataset with ground-truth lighting annotations, supplemented by a small set of real captures to enhance realism and generalization. We validate our approach across a variety of relighting tasks, including controlling in-scene lighting fixtures and editing environment illumination using virtual light sources, on synthetic and real images. Our method achieves state-of-the-art quantitative and qualitative performance compared to prior work. Remarkably, without explicit inverse rendering supervision, the model exhibits an inherent understanding of how light interacts with scene geometry, occlusion, and materials, yielding convincing lighting effects even in traditionally challenging scenarios such as placing lights within objects or relighting transparent materials plausibly. Project page: vrroom.github.io/tokenlight/