Search papers, labs, and topics across Lattice.
The paper introduces ConceptWeaver, a framework for disentangling and manipulating concepts within flow-based generative models from a single reference image. They first use a differential probing technique to reveal that the generative process occurs in three stages: Blueprint, Instantiation, and Refinement. ConceptWeaver then learns concept-specific semantic offsets during the Instantiation stage and injects them via ConceptWeaver Guidance (CWG) to enable high-fidelity, compositional synthesis and editing.
Flow-based generative models disentangle concepts naturally during a pivotal "Instantiation Stage," offering a sweet spot for targeted manipulation.
Pre-trained flow-based models excel at synthesizing complex scenes yet lack a direct mechanism for disentangling and customizing their underlying concepts from one-shot real-world sources. To demystify this process, we first introduce a novel differential probing technique to isolate and analyze the influence of individual concept tokens on the velocity field over time. This investigation yields a critical insight: the generative process is not monolithic but unfolds in three distinct stages. An initial \textbf{Blueprint Stage} establishes low-frequency structure, followed by a pivotal \textbf{Instantiation Stage} where content concepts emerge with peak intensity and become naturally disentangled, creating an optimal window for manipulation. A final concept-insensitive refinement stage then synthesizes fine-grained details. Guided by this discovery, we propose \textbf{ConceptWeaver}, a framework for one-shot concept disentanglement. ConceptWeaver learns concept-specific semantic offsets from a single reference image using a stage-aware optimization strategy that aligns with the three-stage framework. These learned offsets are then deployed during inference via our novel ConceptWeaver Guidance (CWG) mechanism, which strategically injects them at the appropriate generative stage. Extensive experiments validate that ConceptWeaver enables high-fidelity, compositional synthesis and editing, demonstrating that understanding and leveraging the intrinsic, staged nature of flow models is key to unlocking precise, multi-granularity content manipulation.