Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJune 2023
A Recurrent Neural Network based Generative Adversarial Network for Long Multivariate Time Series Forecasting
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 181–189https://doi.org/10.1145/3591106.3592306Some multimedia data from real life can be collected as multivariate time series data, such as community-contributed social data or sensor data. Many methods have been proposed for multivariate time series forecasting. In light of its importance in wide ...
- abstractJune 2023
Algorithms for Generating and Evaluating Visually Sorted Grid Layouts
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 672–673https://doi.org/10.1145/3591106.3592305The increasing amount of visual data shared online highlights the importance of organizing and finding related content. However, current efforts to improve visual search and image classification lack support for exploratory image search. Sorting images ...
- abstractJune 2023
MAD ’23 Workshop: Multimedia AI against Disinformation
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 676–677https://doi.org/10.1145/3591106.3592303With recent advancements in synthetic media manipulation and generation, verifying multimedia content posted online has become increasingly difficult. Additionally, the malicious exploitation of AI technologies by actors to disseminate disinformation on ...
- abstractJune 2023
ICDAR’23: Intelligent Cross-Data Analysis and Retrieval
- Guillaume Habault,
- Minh-Son Dao,
- Michael Alexander Riegler,
- Duc Tien Dang Nguyen,
- Yuta Nakashima,
- Cathal Gurrin
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 674–675https://doi.org/10.1145/3591106.3592302Recently, there has been an increased interest in cross-data research problems, such as predicting air quality using life logging images, predicting congestion using weather and tweets data, and predicting sleep quality using daily exercises and meals. ...
- keynoteJune 2023
Recognizing Actions in Videos under Domain Shift
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPage 671https://doi.org/10.1145/3591106.3592301Action recognition, which consists in automatically recognizing the action being performed in a video sequence, is a fundamental task in computer vision and multimedia. Supervised action recognition has been widely studied because of the growing need for ...
-
- research-articleJune 2023
Learning with Adaptive Knowledge for Continual Image-Text Modeling
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 472–480https://doi.org/10.1145/3591106.3592297In realistic application scenarios, existing methods for image-text modeling have limitations in dealing with data stream: training on all data needs too much computation/storage resources, and even the full access to previous data is invalid. In this ...
- research-articleJune 2023
A Robust Deep Learning Enhanced Monocular SLAM System for Dynamic Environments
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 508–515https://doi.org/10.1145/3591106.3592295Simultaneous Localization and Mapping (SLAM) has developed as a fundamental method for intelligent robot perception over the past decades. Most of the existing feature-based SLAM systems relied on traditional hand-crafted visual features and a strong ...
- short-paperJune 2023
Escaping local minima in deep reinforcement learning for video summarization
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 530–534https://doi.org/10.1145/3591106.3592288State-of-the-art deep neural unsupervised video summarization methods mostly fall under the adversarial reconstruction framework. This employs a Generative Adversarial Network (GAN) structure and Long Short-Term Memory (LSTM) autoencoders during its ...
- research-articleJune 2023
Zero-shot Sketch-based Image Retrieval with Adaptive Balanced Discriminability and Generalizability
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 407–415https://doi.org/10.1145/3591106.3592287Zero-shot sketch-based image retrieval (ZS-SBIR) is a task that learns semantic knowledge and embedding extraction to retrieve similar images using a sketch without any training examples of unseen classes. Existing methods have attempted to address the ...
- short-paperJune 2023
Offensive Tactics Recognition in Broadcast Basketball Videos Based on 2D Camera View Player Heatmaps
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 571–575https://doi.org/10.1145/3591106.3592285It is essential for sports teams to review their offensive and defensive tactical execution performance as well as understand their opponents’ tactics in order to identify effective counterattack strategies. This study focuses on basketball offensive ...
- research-articleJune 2023
Dual-Modality Co-Learning for Unveiling Deepfake in Spatio-Temporal Space
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 85–94https://doi.org/10.1145/3591106.3592284The emergence of photo-realistic deepfakes on a large scale has become a significant societal concern, which has garnered considerable attention from the research community. Several recent studies have identified the critical issue of “temporal ...
- research-articleJune 2023
SIGMA-DF: Single-Side Guided Meta-Learning for Deepfake Detection
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 153–161https://doi.org/10.1145/3591106.3592282The current challenge of Deepfake detection is the cross-domain performance on unseen Deepfake data. Instead of extracting forgery artifacts that are robust to the cross-domain scenarios as most previous works, we propose a novel method named Single-...
- research-articleJune 2023
Framing the News: From Human Perception to Large Language Model Inferences
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 627–635https://doi.org/10.1145/3591106.3592278Identifying the frames of news is important to understand the articles’ vision, intention, message to be conveyed, and which aspects of the news are emphasized. Framing is a widely studied concept in journalism, and has emerged as a new topic in ...
- research-articleJune 2023
Multi-channel Convolutional Neural Network for Precise Meme Classification
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 190–198https://doi.org/10.1145/3591106.3592275This paper proposes a multi-channel convolutional neural network (MC-CNN) for classifying memes and non-memes. Our architecture is trained and validated on a challenging dataset that includes non-meme formats with textual attributes, which are also ...
- research-articleJune 2023
Knowledge-Aware Causal Inference Network for Visual Dialog
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 253–261https://doi.org/10.1145/3591106.3592272The effective knowledge and interaction within multi-modalities are key to Visual Dialog. Classic graph-based framework with the direct connection between history dialog and answer fails to give the right answer for the spurious guidance and strong bias ...
- research-articleJune 2023
Multi-modal Fake News Detection on Social Media via Multi-grained Information Fusion
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 343–352https://doi.org/10.1145/3591106.3592271The easy sharing of multimedia content on social media has caused a rapid dissemination of fake news, which threatens society’s stability and security. Therefore, fake news detection has garnered extensive research interest in the field of social ...
- research-articleJune 2023
TsP-Tran: Two-Stage Pure Transformer for Multi-Label Image Retrieval
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 425–433https://doi.org/10.1145/3591106.3592269Image retrieval aims to find similar images given the query. Most of existing retrieval works are based on the pre-trained model of single-label image classification. In practice, the query usually contains more than one instance, and the single label ...
- research-articleJune 2023
TDEC: Deep Embedded Image Clustering with Transformer and Distribution Information
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 280–288https://doi.org/10.1145/3591106.3592268Image clustering is a crucial but challenging task in multimedia machine learning. Recently the combination of clustering with deep learning has achieved promising performance against conventional methods on high-dimensional image data. Unfortunately, ...
- research-articleJune 2023
Multi-Label Meta Weighting for Long-Tailed Dynamic Scene Graph Generation
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 39–47https://doi.org/10.1145/3591106.3592267This paper investigates the problem of scene graph generation in videos with the aim of capturing semantic relations between subjects and objects in the form of ⟨ subject, predicate, object⟩ triplets. Recognizing the predicate between subject and object ...
- research-articleJune 2023
Improving Image Encoders for General-Purpose Nearest Neighbor Search and Classification
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 57–66https://doi.org/10.1145/3591106.3592266Recent advances in computer vision research led to large vision foundation models that generalize to a broad range of image domains and perform exceptionally well in various image based tasks. However, content-based image-to-image retrieval is often ...