Search papers, labs, and topics across Lattice.
The paper introduces EasyControlEdge, a method for adapting image-generation foundation models to edge detection tasks, focusing on crispness and data efficiency. They fine-tune a foundation model using an edge-oriented objective with a pixel-space loss and introduce guidance based on unconditional dynamics to control edge density at inference. Experiments on standard edge detection datasets demonstrate improved performance, especially in crispness and data efficiency, compared to state-of-the-art methods.
Achieve state-of-the-art edge detection, especially in crispness and data efficiency, by creatively repurposing image-generation foundation models.
We propose EasyControlEdge, adapting an image-generation foundation model to edge detection. In real-world edge detection (e.g., floor-plan walls, satellite roads/buildings, and medical organ boundaries), crispness and data efficiency are crucial, yet producing crisp raw edge maps with limited training samples remains challenging. Although image-generation foundation models perform well on many downstream tasks, their pretrained priors for data-efficient transfer and iterative refinement for high-frequency detail preservation remain underexploited for edge detection. To enable crisp and data-efficient edge detection using these capabilities, we introduce an edge-specialized adaptation of image-generation foundation models. To better specialize the foundation model for edge detection, we incorporate an edge-oriented objective with an efficient pixel-space loss. At inference, we introduce guidance based on unconditional dynamics, enabling a single model to control the edge density through a guidance scale. Experiments on BSDS500, NYUDv2, BIPED, and CubiCasa compare against state-of-the-art methods and show consistent gains, particularly under no-post-processing crispness evaluation and with limited training data.