Search papers, labs, and topics across Lattice.
This paper introduces AFS-Search, a training-free closed-loop framework for text-to-image generation that addresses limitations in relational reasoning and error accumulation in open-loop sampling. AFS-Search uses a Vision-Language Model (VLM) as a semantic critic to diagnose intermediate latents and dynamically steer the velocity field, formulating T2I generation as a sequential decision-making process with parallel rollout search. Experiments demonstrate that AFS-Search-Pro achieves state-of-the-art results on multiple benchmarks, while AFS-Search-Fast maintains fast generation speed with significant performance improvements.
Closed-loop feedback using VLMs can dramatically improve text-to-image generation quality, even without additional training.
Precise Text-to-Image (T2I) generation has achieved great success but is hindered by the limited relational reasoning of static text encoders and the error accumulation in open-loop sampling. Without real-time feedback, initial semantic ambiguities during the Ordinary Differential Equation trajectory inevitably escalate into stochastic deviations from spatial constraints. To bridge this gap, we introduce AFS-Search (Agentic Flow Steering and Parallel Rollout Search), a training-free closed-loop framework built upon FLUX.1-dev. AFS-Search incorporates a training-free closed-loop parallel rollout search and flow steering mechanism, which leverages a Vision-Language Model (VLM) as a semantic critic to diagnose intermediate latents and dynamically steer the velocity field via precise spatial grounding. Complementarily, we formulate T2I generation as a sequential decision-making process, exploring multiple trajectories through lookahead simulations and selecting the optimal path based on VLM-guided rewards. Further, we provide AFS-Search-Pro for higher performance and AFS-Search-Fast for quicker generation. Experimental results show that our AFS-Search-Pro greatly boosts the performance of the original FLUX.1-dev, achieving state-of-the-art results across three different benchmarks. Meanwhile, AFS-Search-Fast also significantly enhances performance while maintaining fast generation speed.