Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
×
Oct 23, 2022 · In this work we introduce transformer based modality fusion techniques, which unify multi-modal data at an early stage. Our Anticipative Feature ...
We propose a transformer based feature fusion model to effectively fuse multiple modalities, and follow [20] to use a generative language model for fu- ture ...
Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation (WACV 2023). PWC. This repository contains the official source code and data for our ...
We propose a transformer based feature fusion model to effectively fuse multiple modalities, and follow [20] to use a generative language model for fu- ture ...
The Anticipative Feature Fusion Transformer (AFFT) proves to be superior to popular score fusion approaches and presents state-of-the-art results ...
While most of the works explore it in a unimodal setting by using the visual modality, other works also present a multi-modal approach for this task by using ...
Jan 23, 2024 · Therefore, we propose a Multi-modal Anticipative Transformer (MAT), an attention-based video transformer architecture that jointly learns from ...
18.5. Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation. 2022. 4. MeMViT-24. 17.7. MeMViT: Memory-Augmented Multiscale Vision ...
In this work we introduce transformer based modality fusion techniques, which unify multi-modal data at an early stage. Our Anticipative Feature Fusion ...
Action Anticipation on EPIC-KITCHENS-100 (test) ; 3. AFFT. 14.9. Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation ; 4. Abstract Goal.