Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJanuary 2022
Visible-Infrared Cross-Modal Person Re-identification based on Positive Feedback
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 73, Pages 1–6https://doi.org/10.1145/3469877.3497693Visible-infrared person re-identification (VI-ReID) is undoubtedly a challenging cross-modality person retrieval task with increasing appreciation. Compared to traditional person ReID that focuses on person images in a single RGB mode, VI-ReID suffers ...
- research-articleJanuary 2022
Deep Adaptive Attention Triple Hashing
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 78, Pages 1–5https://doi.org/10.1145/3469877.3495646Recent studies have verified that learning compact hash codes can facilitate big data retrieval processing. In particular, learning the deep hash function can greatly improve the retrieval performance. However, the existing deep supervised hashing ...
- research-articleJanuary 2022
Generation of Variable-Length Time Series from Text using Dynamic Time Warping-Based Method
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 76, Pages 1–7https://doi.org/10.1145/3469877.3495644This study is aimed at finding a suitable method for generating time-series data such as video clips or avatar motions from text stating multiple events. This paper addresses the generation of variable-length time-series data considering the order and ...
- research-articleJanuary 2022
Focusing Attention across Multiple Images for Multimodal Event Detection
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 74, Pages 1–6https://doi.org/10.1145/3469877.3495642Multimodal social event detection has been attracting tremendous research attention in recent years, due to that it provides comprehensive and complementary understanding of social events and is important to public security and administration. Most ...
- research-articleJanuary 2022
CFCR: A Convolution and Fusion Model for Cross-platform Recommendation
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 65, Pages 1–6https://doi.org/10.1145/3469877.3495639With the emergence of various online platforms, associating different platforms is playing an increasingly important role in many applications. Cross-platform recommendation aims to improve recommendation accuracy through associating information from ...
-
- short-paperJanuary 2022
Discovering Social Connections using Event Images
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 68, Pages 1–5https://doi.org/10.1145/3469877.3493699Social events are very common activities, where people can interact with each other. During an event, the organizer often hires photographers to take images, which provide rich information about the participants’ behaviour. In this work, we propose a ...
- short-paperJanuary 2022
SangeetXML: An XML Format for Score Retrieval for Indic Music
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 66, Pages 1–5https://doi.org/10.1145/3469877.3493697Efficient retrieval of score information from a large set of XML-encoded scores and lyrics in an XML database requires such music data to be stored in a well-structured and systematic technique. Current search engines for Indic music (Tagore songs in ...
- short-paperJanuary 2022
Hybrid Improvements in Multimodal Analysis for Deep Video Understanding
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 69, Pages 1–5https://doi.org/10.1145/3469877.3493599The Deep Video Understanding Challenge (DVU) is a task that focuses on comprehending long duration videos which involve many entities. Its main goal is to build relationship and interaction knowledge graph between entities to answer relevant questions. ...
- short-paperJanuary 2022
An Embarrassingly Simple Approach to Discrete Supervised Hashing
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 56, Pages 1–5https://doi.org/10.1145/3469877.3493595Prior hashing works typically learn a projection function from high-dimensional visual feature space to low-dimensional latent space. However, such a projection function remains several crucial bottlenecks: 1) information loss and coding redundancy are ...
- short-paperJanuary 2022
Conditioned Image Retrieval for Fashion using Contrastive Learning and CLIP-based Features
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 54, Pages 1–5https://doi.org/10.1145/3469877.3493593Building on the recent advances in multimodal zero-shot representation learning, in this paper we explore the use of features obtained from the recent CLIP model to perform conditioned image retrieval. Starting from a reference image and an additive ...
- short-paperJanuary 2022
Deep Multiple Length Hashing via Multi-task Learning
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 52, Pages 1–5https://doi.org/10.1145/3469877.3493591Hashing can compress heterogeneous high-dimensional data into compact binary codes. For most existing hash methods, they first predetermine a fixed length for the hash code and then train the model based on this fixed length. However, when the task ...
- demonstrationJanuary 2022
RoadAtlas: Intelligent Platform for Automated Road Defect Detection and Asset Management
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 62, Pages 1–3https://doi.org/10.1145/3469877.3493589With the rapid development of intelligent detection algorithms based on deep learning, much progress has been made in automatic road defect recognition and road marking parsing. This can effectively address the issue of an expensive and time-consuming ...
- demonstrationJanuary 2022
Private-Share: A Secure and Privacy-Preserving De-Centralized Framework for Large Scale Data Sharing
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 61, Pages 1–3https://doi.org/10.1145/3469877.3493588The various data and privacy regulations introduced around the globe, require data to be stored in a secure and privacy-preserving fashion. Non-compliance with these regulations come with major consequences. This has led to the formation of huge data ...
- demonstrationJanuary 2022
An Efficient Bus Crowdedness Classification System
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 60, Pages 1–2https://doi.org/10.1145/3469877.3493587We propose an efficient bus crowdedness classification system that can be used in daily life. In particular, we analyze and study the data collected from real bus, aiming to deal with the difficulty of bus congestion classification. Besides, we combine ...
- short-paperJanuary 2022
PLM-IPE: A Pixel-Landmark Mutual Enhanced Framework for Implicit Preference Estimation
- Federico Becattini,
- Xuemeng Song,
- Claudio Baecchi,
- Shi-Ting Fang,
- Claudio Ferrari,
- Liqiang Nie,
- Alberto Del Bimbo
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 42, Pages 1–5https://doi.org/10.1145/3469877.3490621In this paper, we are interested in understanding how customers perceive fashion recommendations, in particular when observing a proposed combination of garments to compose an outfit. Automatically understanding how a suggested item is perceived, ...
- research-articleJanuary 2022
Inter-modality Discordance for Multimodal Fake News Detection
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 33, Pages 1–7https://doi.org/10.1145/3469877.3490614The paradigm shift in the consumption of news via online platforms has cultivated the growth of digital journalism. Contrary to traditional media, lowering entry barriers and enabling everyone to be part of content creation have disabled the concept of ...
- research-articleJanuary 2022
Zero-shot Recognition with Image Attributes Generation using Hierarchical Coupled Dictionary Learning
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 32, Pages 1–7https://doi.org/10.1145/3469877.3490613Zero-shot learning (ZSL) aims to recognize images from unseen (novel) classes with the training images from seen classes. The attributes of each class is exploited as auxiliary semantic information. Recently most ZSL approaches focus on learning visual-...
- research-articleJanuary 2022
Score Transformer: Generating Musical Score from Note-level Representation
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 31, Pages 1–7https://doi.org/10.1145/3469877.3490612In this paper, we explore the tokenized representation of musical scores using the Transformer model to automatically generate musical scores. Thus far, sequence models have yielded fruitful results with note-level (MIDI-equivalent) symbolic ...
- research-articleJanuary 2022
Efficient Proposal Generation with U-shaped Network for Temporal Sentence Grounding
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 26, Pages 1–7https://doi.org/10.1145/3469877.3490606Temporal Sentence Grounding aims to localize the relevant temporal region in a given video according to the query sentence. It is a challenging task due to the semantic gap between different modalities and diversity of the event duration. Proposal ...
- research-articleJanuary 2022
Visual Storytelling with Hierarchical BERT Semantic Guidance
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 24, Pages 1–7https://doi.org/10.1145/3469877.3490604Visual storytelling, which aims at automatically producing a narrative paragraph for photo album, remains quite challenging due to the complexity and diversity of photo album content. In addition, open-domain photo albums cover a broad range of topics ...