FLAME: Free-form Language-based Motion Synthesis & Editing

Kim, Jihoon; Kim, Jiseob; Choi, Sungjoon

Computer Science > Computer Vision and Pattern Recognition

arXiv:2209.00349 (cs)

[Submitted on 1 Sep 2022 (v1), last revised 1 Jan 2023 (this version, v2)]

Title:FLAME: Free-form Language-based Motion Synthesis & Editing

Authors:Jihoon Kim, Jiseob Kim, Sungjoon Choi

View PDF

Abstract:Text-based motion generation models are drawing a surge of interest for their potential for automating the motion-making process in the game, animation, or robot industries. In this paper, we propose a diffusion-based motion synthesis and editing model named FLAME. Inspired by the recent successes in diffusion models, we integrate diffusion-based generative models into the motion domain. FLAME can generate high-fidelity motions well aligned with the given text. Also, it can edit the parts of the motion, both frame-wise and joint-wise, without any fine-tuning. FLAME involves a new transformer-based architecture we devise to better handle motion data, which is found to be crucial to manage variable-length motions and well attend to free-form text. In experiments, we show that FLAME achieves state-of-the-art generation performances on three text-motion datasets: HumanML3D, BABEL, and KIT. We also demonstrate that editing capability of FLAME can be extended to other tasks such as motion prediction or motion in-betweening, which have been previously covered by dedicated models.

Comments:	AAAI 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
Cite as:	arXiv:2209.00349 [cs.CV]
	(or arXiv:2209.00349v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2209.00349

Submission history

From: Jihoon Kim [view email]
[v1] Thu, 1 Sep 2022 10:34:57 UTC (13,107 KB)
[v2] Sun, 1 Jan 2023 11:46:43 UTC (13,084 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:FLAME: Free-form Language-based Motion Synthesis & Editing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:FLAME: Free-form Language-based Motion Synthesis & Editing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators