scholar.google.com › citations
Abstract—Aiming at the classification of indoor scene im- ages, a multi-modal fusion model is proposed. Firstly, based on the scene image and its semantic ...
Dec 23, 2021 · We propose a model based on fusion of transcribed speech to text and visual features, which is used for classification on a novel dataset of ...
The focus of the paper is on studying five different methods to combine multi-view data from an uncalibrated smart camera network for human activity recognition ...
People also ask
What is multi modal fusion?
What is the scene based image classification?
Indoor scene recognition based on RGBD camera has at- tracted increasingly attention due to its wide applications in computer vision and robotics and the ...
Dec 23, 2021 · We propose a model based on fusion of transcribed speech to text and visual features, ... Multi-modal Deep Learning for Indoor Scene Recognition ...
[17] propose a novel enhanced spectral fusion network for hyperspectral image classification. Based on the fusion of different spectral strides, the model is ...
Jul 8, 2024 · In this paper, we introduce a novel dataset specifically designed to explore the fusion of multiple modalities for various tasks: image ...
In this work, we propose an effective multi-modal RGB-D scene recognition model that integrates global or local multi-scale/multi-semantic features. The ...
Apr 24, 2024 · Traditionally, the methods of multimodal data fusion are classified into four categories, based on the conventional fusion taxonomy shown in ...
Dec 23, 2021 · ... scene recognition techniques and applications. We propose a model based on fusion of transcribed speech to text and visual features, which ...