Home > Information > News
#News ·2025-01-07
This article is reprinted with the authorization of AIGC Studio public account, please contact the source for reprinting.
Together with NetEase, Xiamen University proposed StoryWeaver, which can achieve high-quality story visualization based on a given character within a unified model. An image can be generated to match the story text and ensure that each character is consistent in different scenes. The method in this paper mainly includes the following steps:
StoryWeaver can achieve high-quality story visualization based on a given role within a unified model.
StoryWeaver: A unified world model customized for knowledge enhanced story characters
Story visualization is getting more and more attention in the field of artificial intelligence. However, existing approaches still struggle to maintain a balance between character identity preservation and textual semantic alignment, mainly due to a lack of detailed semantic modeling of story scenes.
To address this challenge, the paper proposes a new knowledge graph, the Character graph (CG), which comprehensively represents a variety of story-related knowledge, including characters, attributes associated with characters, and relationships between characters. We then introduced StoryWeaver, a custom image generator through the Character Graph (CCG) that enables consistent story visualizations with rich text semantics. In order to further improve the performance of multi-role generation, this paper combines knowledge enhanced spatial guidance (KE-SG) into StoryWeaver to precisely inject role semantics into generation.
In order to verify the validity of the proposed method, an extensive experiment is performed using a new benchmark named TBC-Bench. Experiments have confirmed that StoryWeaver is not only good at creating vivid visual storylines, but also good at accurately conveying character identities in various scenes, and has quite high storage efficiency, for example, DINO-I has an average increase of 9.03%, and CLIP-T has an average increase of 13.44%. In addition, ablation experiments were performed to verify the superiority of the proposed module.
StoryWeaver's overall framework.
a. Character-Graph is proposed to represent the semantically rich knowledge in the story world.
b. StoryWeaver is enhanced by the proposed spatial guidance to further improve the performance of multi-role generation
Visual examples of the impact of customization through character diagrams (C-CG) and knowledge-enhanced Spatial guidance (KE-SG).
a. Without C-CG, the generator would have difficulty capturing the finer granular details of the character.
b. Without KESG, generators tend to distribute attention evenly across all areas, resulting in a mix of identities.
Visual comparison of different approaches in single-character and multi-character visual storytelling. StoryWeaver excites character identity customization and well-matched semantic alignment.
(a) Single character generation example
(b) Multi-character generation example
An example of a multi-character story visualization on the Pororo dataset.
The collection of characters and samples focuses on two animated films, Boruru and Frozen. The samples included detailed descriptions of individual characters and scenes showing interactions between multiple characters.
This paper proposes a unified model, StoryWeaver, which has complex character customization functions and can be used for story visualization. This paper first proposes a novel character diagram, which encapsulates the rich semantic knowledge in the story world to enhance StoryWeaver. Then, knowledge enhanced spatial guidance is introduced to improve the cross attention map to achieve accurate multi-role generation. The experimental results show that StoryWeaver achieves better fidelity in identity customization and achieves better semantic alignment than a set of single and multiple customization methods.
2025-02-17
2025-02-14
2025-02-13
13004184443
Room 607, 6th Floor, Building 9, Hongjing Xinhuiyuan, Qingpu District, Shanghai
gcfai@dongfangyuzhe.com
WeChat official account
friend link
13004184443
立即获取方案或咨询top