Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleAugust 2024
Detection of Osseous Metastasis From Bone Scintigrams Using a Combined Global and Local Patch-Based Deep Learning Model
- Swailem Neil Angelo Lumba,
- Emmanuel Linus Evangelista,
- Kyla Sydney Martin,
- Raphael Alampay,
- Patricia Angela Abu
RCVE '24: Proceedings of the 2024 2nd International Conference on Robotics, Control and Vision EngineeringPages 36–40https://doi.org/10.1145/3685073.3685080Osseous metastasis, or bone metastasis, refers to the spread of cancer cells from their primary site to the bones, often considered by professionals as an indication that cancer has advanced to a level in which it can no longer be cured. As such, early ...
- extended-abstractJune 2024
Promoting Green Fashion Consumption in Recommender Systems
UMAP Adjunct '24: Adjunct Proceedings of the 32nd ACM Conference on User Modeling, Adaptation and PersonalizationPages 50–54https://doi.org/10.1145/3631700.3664922The fashion industry is a significant contributor to global carbon emissions, water consumption, and waste generation. My Ph.D. project explores two approaches to promote sustainability in fashion recommender systems. First, I aim to develop algorithms ...
- research-articleApril 2024
Sniffer Faster R-CNN ++: An Efficient Camera-LiDAR Object Detector with Proposal Refinement on Fused Candidates
ACM Journal on Autonomous Transportation Systems (JATS), Volume 1, Issue 2Article No.: 6, Pages 1–18https://doi.org/10.1145/3631138In this article, we present Sniffer Faster R-CNN++, an efficient camera-LiDAR late fusion network for low complexity and accurate object detection in autonomous driving scenarios. The proposed detection network architecture operates on output candidates ...
- short-paperJuly 2023
Simplifying Content-Based Neural News Recommendation: On User Modeling and Training Objectives
SIGIR '23: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information RetrievalPages 2384–2388https://doi.org/10.1145/3539618.3592062The advent of personalized news recommendation has given rise to increasingly complex recommender architectures. Most neural news recommenders rely on user click behavior and typically introduce dedicated user encoders that aggregate the content of ...
- ArticleSeptember 2023
On Comparing Early and Late Fusion Methods
AbstractThis paper presents a theoretical comparison of early and late fusion methods. An initial discussion on the conditions to apply early or late (soft or hard) fusion is introduced. The analysis show that, if large training sets are available, early ...
-
- research-articleJune 2023
Image–Text Multimodal Sentiment Analysis Framework of Assamese News Articles Using Late Fusion
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Volume 22, Issue 6Article No.: 161, Pages 1–30https://doi.org/10.1145/3584861Before the arrival of the web as a corpus, people detected positive and negative news based on the understanding of the textual content from physical newspaper rather than an automatic identification approach from readily available e-newspapers. Thus, the ...
- research-articleJanuary 2023
Developing a late fusion of multi facial components for facial recognition with a voting method and global weights
International Journal of Computational Vision and Robotics (IJCVR), Volume 13, Issue 6Pages 619–640https://doi.org/10.1504/ijcvr.2023.134314With the development of deep learning, many solutions have achieved outstanding performance in solving facial recognition problems. Nevertheless, many challenges still stand, such as occluded face or illumination. This paper proposes a late fusion of many ...
- short-paperNovember 2022
Investigating Transformer Encoders and Fusion Strategies for Speech Emotion Recognition in Emergency Call Center Conversations.
ICMI '22 Companion: Companion Publication of the 2022 International Conference on Multimodal InteractionPages 144–153https://doi.org/10.1145/3536220.3558038There has been growing interest in using deep learning techniques to recognize emotions from speech. However, real-life emotion datasets collected in call centers are relatively rare and small, making the use of deep learning techniques quite ...
- research-articleOctober 2022
Influence of Late Fusion of High-Level Features on User Relevance Feedback for Videos
IMuR '22: Proceedings of the 2nd International Workshop on Interactive Multimedia RetrievalPages 17–24https://doi.org/10.1145/3552467.3554795Content-based media retrieval relies on multimodal data representations. For videos, these representations mainly focus on the textual, visual, and audio modalities. While the modality representations can be used individually, combining their ...
- research-articleOctober 2022
Efficient Anchor Learning-based Multi-view Clustering -- A Late Fusion Method
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 3685–3693https://doi.org/10.1145/3503161.3548124Anchor enhanced multi-view late fusion clustering has attracted numerous researchers' attention for its high clustering accuracy and promising efficiency. However, in the existing methods, the anchor points are usually generated through sampling or ...
- research-articleOctober 2022
Continual Multi-view Clustering
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 3676–3684https://doi.org/10.1145/3503161.3547864With the increase of multimedia applications, data are often collected from multiple sensors or modalities, encouraging the rapid development of multi-view (also called multi modal) clustering technique. As a representative, late fusion multi-view ...
- research-articleOctober 2021
Fusion of Acoustic and Linguistic Information using Supervised Autoencoder for Improved Emotion Recognition
MuSe '21: Proceedings of the 2nd on Multimodal Sentiment Analysis ChallengePages 51–59https://doi.org/10.1145/3475957.3484448Automatic recognition of human emotion has a wide range of applications and has always attracted increasing attention. Expressions of human emotions can apparently be identified across different modalities of communication, such as speech, text, mimics, ...
- research-articleOctober 2021
Multi-view Clustering via Deep Matrix Factorization and Partition Alignment
MM '21: Proceedings of the 29th ACM International Conference on MultimediaPages 4156–4164https://doi.org/10.1145/3474085.3475548Multi-view clustering (MVC) has been extensively studied to collect multiple source information in recent years. One typical type of MVC methods is based on matrix factorization to effectively perform dimension reduction and clustering. However, the ...
- research-articleOctober 2021
One-Stage Incomplete Multi-view Clustering via Late Fusion
MM '21: Proceedings of the 29th ACM International Conference on MultimediaPages 2717–2725https://doi.org/10.1145/3474085.3475204As a representative of multi-view clustering (MVC), late fusion MVC (LF-MVC) algorithm has attracted intensive attention due to its superior clustering accuracy and high computational efficiency. One common assumption adopted by existing LF-MVC ...
- research-articleJune 2021
Sequential Late Fusion Technique for Multi-modal Sentiment Analysis
PETRA '21: Proceedings of the 14th PErvasive Technologies Related to Assistive Environments ConferencePages 264–265https://doi.org/10.1145/3453892.3461009Multi-modal sentiment analysis plays an important role for providing better interactive experiences to users. Each modality in multi-modal data can provide different viewpoints or reveal unique aspects of a user’s emotional state. In this work, we use ...
- research-articleOctober 2020
You Have a Point There: Object Selection Inside an Automobile Using Gaze, Head Pose and Finger Pointing
ICMI '20: Proceedings of the 2020 International Conference on Multimodal InteractionPages 595–603https://doi.org/10.1145/3382507.3418836Sophisticated user interaction in the automotive industry is a fast emerging topic. Mid-air gestures and speech already have numerous applications for driver-car interaction. Additionally, multimodal approaches are being developed to leverage the use of ...
- short-paperJune 2020
System Fusion with Deep Ensembles
ICMR '20: Proceedings of the 2020 International Conference on Multimedia RetrievalPages 256–260https://doi.org/10.1145/3372278.3390720Deep neural networks (DNNs) are universal estimators that have achieved state-of-the-art performance in a broad spectrum of classification tasks, opening new perspectives for many applications. One of them is addressing ensemble learning. In this paper, ...
- abstractOctober 2019
Multimodal Driver Interaction with Gesture, Gaze and Speech
ICMI '19: 2019 International Conference on Multimodal InteractionPages 487–492https://doi.org/10.1145/3340555.3356093The ever-growing research in computer vision has created new avenues for user interaction. Speech commands and gesture recognition are already being applied in various touch-based inputs. It is, therefore, foreseeable, that the use of multimodal input ...
- research-articleAugust 2019
Fusing of Medical Images and Reports in Diagnostics of Brain Diseases
- Aleksandra Vatian,
- Natalia Gusarova,
- Natalia Dobrenko,
- Anton Klochkov,
- Niyaz Nigmatullin,
- Artem Lobantsev,
- Anatoly Shalyto
PRAI '19: Proceedings of the 2019 the International Conference on Pattern Recognition and Artificial IntelligencePages 102–108https://doi.org/10.1145/3357777.3357793The combination of MRI images with textual clinical records, has a great potential since the former contains a raw information about study area of the human body, and the latter contains a human integral assessment of the image performed by doctor. In ...
- research-articleJune 2018
A Context-Aware Late-Fusion Approach for Disaster Image Retrieval from Social Media
ICMR '18: Proceedings of the 2018 ACM on International Conference on Multimedia RetrievalPages 266–273https://doi.org/10.1145/3206025.3206047Natural disasters, especially those related to flooding, are global issues that attract a lot of attention in many parts of the world. A series of research ideas focusing on combining heterogeneous data sources to monitor natural disasters have been ...