BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion Generation

Hosseyni, S. Rohollah; Rahmani, Ali Ahmad; Seyedmohammadi, S. Jamal; Seyedin, Sanaz; Mohammadi, Arash

Computer Science > Computation and Language

arXiv:2409.10847 (cs)

[Submitted on 17 Sep 2024]

Title:BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion Generation

Authors:S. Rohollah Hosseyni, Ali Ahmad Rahmani, S. Jamal Seyedmohammadi, Sanaz Seyedin, Arash Mohammadi

View PDF HTML (experimental)

Abstract:Autoregressive models excel in modeling sequential dependencies by enforcing causal constraints, yet they struggle to capture complex bidirectional patterns due to their unidirectional nature. In contrast, mask-based models leverage bidirectional context, enabling richer dependency modeling. However, they often assume token independence during prediction, which undermines the modeling of sequential dependencies. Additionally, the corruption of sequences through masking or absorption can introduce unnatural distortions, complicating the learning process. To address these issues, we propose Bidirectional Autoregressive Diffusion (BAD), a novel approach that unifies the strengths of autoregressive and mask-based generative models. BAD utilizes a permutation-based corruption technique that preserves the natural sequence structure while enforcing causal dependencies through randomized ordering, enabling the effective capture of both sequential and bidirectional relationships. Comprehensive experiments show that BAD outperforms autoregressive and mask-based models in text-to-motion generation, suggesting a novel pre-training strategy for sequence modeling. The codebase for BAD is available on this https URL.

Subjects:	Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2409.10847 [cs.CL]
	(or arXiv:2409.10847v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2409.10847

Submission history

From: Arash Mohammadi [view email]
[v1] Tue, 17 Sep 2024 02:28:19 UTC (2,652 KB)

Computer Science > Computation and Language

Title:BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators