Dynamic Typography: Bringing Text to Life via Video Diffusion Prior

Liu, Zichen; Meng, Yihao; Ouyang, Hao; Yu, Yue; Zhao, Bolin; Cohen-Or, Daniel; Qu, Huamin

Computer Science > Computer Vision and Pattern Recognition

arXiv:2404.11614v2 (cs)

[Submitted on 17 Apr 2024 (v1), revised 18 Apr 2024 (this version, v2), latest version 5 Nov 2024 (v3)]

Title:Dynamic Typography: Bringing Text to Life via Video Diffusion Prior

Authors:Zichen Liu, Yihao Meng, Hao Ouyang, Yue Yu, Bolin Zhao, Daniel Cohen-Or, Huamin Qu

View PDF HTML (experimental)

Abstract:Text animation serves as an expressive medium, transforming static communication into dynamic experiences by infusing words with motion to evoke emotions, emphasize meanings, and construct compelling narratives. Crafting animations that are semantically aware poses significant challenges, demanding expertise in graphic design and animation. We present an automated text animation scheme, termed "Dynamic Typography", which combines two challenging tasks. It deforms letters to convey semantic meaning and infuses them with vibrant movements based on user prompts. Our technique harnesses vector graphics representations and an end-to-end optimization-based framework. This framework employs neural displacement fields to convert letters into base shapes and applies per-frame motion, encouraging coherence with the intended textual concept. Shape preservation techniques and perceptual loss regularization are employed to maintain legibility and structural integrity throughout the animation process. We demonstrate the generalizability of our approach across various text-to-video models and highlight the superiority of our end-to-end methodology over baseline methods, which might comprise separate tasks. Through quantitative and qualitative evaluations, we demonstrate the effectiveness of our framework in generating coherent text animations that faithfully interpret user prompts while maintaining readability. Our code is available at: this https URL.

Comments:	Our demo page is available at: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2404.11614 [cs.CV]
	(or arXiv:2404.11614v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2404.11614

Submission history

From: Yue Yu [view email]
[v1] Wed, 17 Apr 2024 17:59:55 UTC (4,317 KB)
[v2] Thu, 18 Apr 2024 06:06:29 UTC (3,279 KB)
[v3] Tue, 5 Nov 2024 13:16:54 UTC (5,756 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Dynamic Typography: Bringing Text to Life via Video Diffusion Prior

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Dynamic Typography: Bringing Text to Life via Video Diffusion Prior

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators