Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
×
Abstract—Aiming at the classification of indoor scene im- ages, a multi-modal fusion model is proposed. Firstly, based on the scene image and its semantic ...
Dec 23, 2021 · We propose a model based on fusion of transcribed speech to text and visual features, which is used for classification on a novel dataset of ...
The focus of the paper is on studying five different methods to combine multi-view data from an uncalibrated smart camera network for human activity recognition ...
People also ask
Indoor scene recognition based on RGBD camera has at- tracted increasingly attention due to its wide applications in computer vision and robotics and the ...
Dec 23, 2021 · We propose a model based on fusion of transcribed speech to text and visual features, ... Multi-modal Deep Learning for Indoor Scene Recognition ...
[17] propose a novel enhanced spectral fusion network for hyperspectral image classification. Based on the fusion of different spectral strides, the model is ...
Jul 8, 2024 · In this paper, we introduce a novel dataset specifically designed to explore the fusion of multiple modalities for various tasks: image ...
In this work, we propose an effective multi-modal RGB-D scene recognition model that integrates global or local multi-scale/multi-semantic features. The ...
Apr 24, 2024 · Traditionally, the methods of multimodal data fusion are classified into four categories, based on the conventional fusion taxonomy shown in ...
Dec 23, 2021 · ... scene recognition techniques and applications. We propose a model based on fusion of transcribed speech to text and visual features, which ...