Search papers, labs, and topics across Lattice.
BrandFusion is introduced as a multi-agent framework to seamlessly integrate brands into text-to-video (T2V) generated content, addressing the challenge of balancing prompt fidelity, brand recognizability, and natural integration. The framework uses an offline phase to build a Brand Knowledge Base via probing and fine-tuning, and an online phase with five agents that iteratively refine user prompts. Experiments across various T2V models and brands show BrandFusion significantly improves semantic preservation, brand recognizability, and integration naturalness compared to baselines, validated by human evaluations.
Finally, a way to inject brands into AI-generated videos without sacrificing quality, opening the door to T2V monetization.
The rapid advancement of text-to-video (T2V) models has revolutionized content creation, yet their commercial potential remains largely untapped. We introduce, for the first time, the task of seamless brand integration in T2V: automatically embedding advertiser brands into prompt-generated videos while preserving semantic fidelity to user intent. This task confronts three core challenges: maintaining prompt fidelity, ensuring brand recognizability, and achieving contextually natural integration. To address them, we propose BrandFusion, a novel multi-agent framework comprising two synergistic phases. In the offline phase (advertiser-facing), we construct a Brand Knowledge Base by probing model priors and adapting to novel brands via lightweight fine-tuning. In the online phase (user-facing), five agents jointly refine user prompts through iterative refinement, leveraging the shared knowledge base and real-time contextual tracking to ensure brand visibility and semantic alignment. Experiments on 18 established and 2 custom brands across multiple state-of-the-art T2V models demonstrate that BrandFusion significantly outperforms baselines in semantic preservation, brand recognizability, and integration naturalness. Human evaluations further confirm higher user satisfaction, establishing a practical pathway for sustainable T2V monetization.