Search papers, labs, and topics across Lattice.
The paper introduces EverTale, a story world simulator that enables continuous character customization for story visualization. It uses an All-in-One-World Character Integrator with a unified LoRA module for efficient character adaptation, a Character Quality Gate using MLLM-as-Judge to ensure character fidelity, and a Character-Aware Region-Focus Sampling strategy to improve multi-character generation. Experiments demonstrate EverTale's superior performance in both single- and multi-character story visualization compared to existing methods.
Forget finetuning a new LoRA for every character: EverTale introduces a single LoRA that adapts to *all* characters in a story, enabling continuous character customization with improved fidelity and efficiency.
Story visualization has gained increasing attention in computer vision. However, current methods often fail to achieve a synergy between accurate character customization, semantic alignment, and continuous integration of new identities. To tackle this challenge, in this paper we present EverTale, a story world simulator for continuous story character customization. We first propose an All-in-One-World Character Integrator to achieve continuous character adaptation within unified LoRA module, eliminating the need for per-character optimization modules of previous methods. Then, we incorporate a Character Quality Gate via MLLM-as-Judge to ensure the fidelity of each character adaptation process through chain-of-thought reasoning, determining whether the model can proceed to the next character or require additional training on the current one. We also introduce a Character-Aware Region-Focus Sampling strategy to address the identity degradation and layout conflicts in existing multi-character visual storytelling, ensuring natural multi-character generation by harmonizing local character-specific details with global scene context with higher efficiency. Experimental results show that our EverTale achieves superior performance against a wider range of compared methods on both single- and multi-character story visualization. Codes will be available.