Information systems

Applied Filters

People

Conferences

Publication Date

Past 5 years

35 Results for: Book/Issue: MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,771,164 records)|Limit your search to The ACM Full-Text Collection (760,672 records)

Showing 1 - 20of35 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
January 2022
Visible-Infrared Cross-Modal Person Re-identification based on Positive Feedback
- Lingyi Lu,
- Xin Xu
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 73, Pages 1–6https://doi.org/10.1145/3469877.3497693

Visible-infrared person re-identification (VI-ReID) is undoubtedly a challenging cross-modality person retrieval task with increasing appreciation. Compared to traditional person ReID that focuses on person images in a single RGB mode, VI-ReID suffers ...
1
121
Metrics
Total Citations1
Total Downloads121
Last 12 Months21
Last 6 weeks7
Get Access
research-article
January 2022
Deep Adaptive Attention Triple Hashing
- Yang Shi,
- Xiushan Nie,
- Quan Zhou,
- Li Zou,
- Yilong Yin
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 78, Pages 1–5https://doi.org/10.1145/3469877.3495646

Recent studies have verified that learning compact hash codes can facilitate big data retrieval processing. In particular, learning the deep hash function can greatly improve the retrieval performance. However, the existing deep supervised hashing ...
1
70
Metrics
Total Citations1
Total Downloads70
Last 12 Months16
Last 6 weeks1
Get Access
research-article
Open Access
January 2022
Generation of Variable-Length Time Series from Text using Dynamic Time Warping-Based Method
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 76, Pages 1–7https://doi.org/10.1145/3469877.3495644

This study is aimed at finding a suitable method for generating time-series data such as video clips or avatar motions from text stating multiple events. This paper addresses the generation of variable-length time-series data considering the order and ...
0
674
Metrics
Total Citations0
Total Downloads674
Last 12 Months328
Last 6 weeks45
View online with eReader
View this article in HTML format
PDF
research-article
January 2022
Focusing Attention across Multiple Images for Multimodal Event Detection
- Yangyang Li,
- Jun Li,
- Hao Jin,
- Liang Peng
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 74, Pages 1–6https://doi.org/10.1145/3469877.3495642

Multimodal social event detection has been attracting tremendous research attention in recent years, due to that it provides comprehensive and complementary understanding of social events and is important to public security and administration. Most ...
2
189
Metrics
Total Citations2
Total Downloads189
Last 12 Months47
Last 6 weeks6
Get Access
research-article
Open Access
January 2022
CFCR: A Convolution and Fusion Model for Cross-platform Recommendation
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 65, Pages 1–6https://doi.org/10.1145/3469877.3495639

With the emergence of various online platforms, associating different platforms is playing an increasingly important role in many applications. Cross-platform recommendation aims to improve recommendation accuracy through associating information from ...
1
180
Metrics
Total Citations1
Total Downloads180
Last 12 Months66
Last 6 weeks16
View online with eReader
View this article in HTML format
PDF
short-paper
January 2022
Discovering Social Connections using Event Images
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 68, Pages 1–5https://doi.org/10.1145/3469877.3493699

Social events are very common activities, where people can interact with each other. During an event, the organizer often hires photographers to take images, which provide rich information about the participants’ behaviour. In this work, we propose a ...
0
96
Metrics
Total Citations0
Total Downloads96
Last 12 Months13
Last 6 weeks0
Get Access
short-paper
January 2022
SangeetXML: An XML Format for Score Retrieval for Indic Music
- Chandan Misra
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 66, Pages 1–5https://doi.org/10.1145/3469877.3493697

Efficient retrieval of score information from a large set of XML-encoded scores and lyrics in an XML database requires such music data to be stored in a well-structured and systematic technique. Current search engines for Indic music (Tagore songs in ...
0
64
Metrics
Total Citations0
Total Downloads64
Last 12 Months10
Last 6 weeks0
1
Supplementary Material
Supplemental files
Get Access
short-paper
January 2022
Hybrid Improvements in Multimodal Analysis for Deep Video Understanding
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 69, Pages 1–5https://doi.org/10.1145/3469877.3493599

The Deep Video Understanding Challenge (DVU) is a task that focuses on comprehending long duration videos which involve many entities. Its main goal is to build relationship and interaction knowledge graph between entities to answer relevant questions. ...
2
92
Metrics
Total Citations2
Total Downloads92
Last 12 Months18
Last 6 weeks3
Get Access
short-paper
January 2022
An Embarrassingly Simple Approach to Discrete Supervised Hashing
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 56, Pages 1–5https://doi.org/10.1145/3469877.3493595

Prior hashing works typically learn a projection function from high-dimensional visual feature space to low-dimensional latent space. However, such a projection function remains several crucial bottlenecks: 1) information loss and coding redundancy are ...
1
50
Metrics
Total Citations1
Total Downloads50
Last 12 Months12
Last 6 weeks1
Get Access
short-paper
January 2022
Conditioned Image Retrieval for Fashion using Contrastive Learning and CLIP-based Features
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 54, Pages 1–5https://doi.org/10.1145/3469877.3493593

Building on the recent advances in multimodal zero-shot representation learning, in this paper we explore the use of features obtained from the recent CLIP model to perform conditioned image retrieval. Starting from a reference image and an additive ...
8
676
Metrics
Total Citations8
Total Downloads676
Last 12 Months121
Last 6 weeks4
Get Access
short-paper
January 2022
Deep Multiple Length Hashing via Multi-task Learning
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 52, Pages 1–5https://doi.org/10.1145/3469877.3493591

Hashing can compress heterogeneous high-dimensional data into compact binary codes. For most existing hash methods, they first predetermine a fixed length for the hash code and then train the model based on this fixed length. However, when the task ...
1
57
Metrics
Total Citations1
Total Downloads57
Last 12 Months8
Last 6 weeks0
Get Access
demonstration
January 2022
RoadAtlas: Intelligent Platform for Automated Road Defect Detection and Asset Management
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 62, Pages 1–3https://doi.org/10.1145/3469877.3493589

With the rapid development of intelligent detection algorithms based on deep learning, much progress has been made in automatic road defect recognition and road marking parsing. This can effectively address the issue of an expensive and time-consuming ...
1
126
Metrics
Total Citations1
Total Downloads126
Last 12 Months25
Last 6 weeks1
1
Supplementary Material
Presentation slides
Get Access
demonstration
January 2022
Private-Share: A Secure and Privacy-Preserving De-Centralized Framework for Large Scale Data Sharing
- Arun Zachariah,
- Maha Alrasheed
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 61, Pages 1–3https://doi.org/10.1145/3469877.3493588

The various data and privacy regulations introduced around the globe, require data to be stored in a secure and privacy-preserving fashion. Non-compliance with these regulations come with major consequences. This has led to the formation of huge data ...
0
91
Metrics
Total Citations0
Total Downloads91
Last 12 Months17
Last 6 weeks2
Get Access
demonstration
January 2022
An Efficient Bus Crowdedness Classification System
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 60, Pages 1–2https://doi.org/10.1145/3469877.3493587

We propose an efficient bus crowdedness classification system that can be used in daily life. In particular, we analyze and study the data collected from real bus, aiming to deal with the difficulty of bus congestion classification. Besides, we combine ...
0
42
Metrics
Total Citations0
Total Downloads42
Last 12 Months6
Last 6 weeks0
1
Supplementary Material
Presentation slides
Get Access
short-paper
January 2022
PLM-IPE: A Pixel-Landmark Mutual Enhanced Framework for Implicit Preference Estimation
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 42, Pages 1–5https://doi.org/10.1145/3469877.3490621

In this paper, we are interested in understanding how customers perceive fashion recommendations, in particular when observing a proposed combination of garments to compose an outfit. Automatically understanding how a suggested item is perceived, ...
9
96
Metrics
Total Citations9
Total Downloads96
Last 12 Months20
Last 6 weeks0
Get Access
research-article
January 2022
Inter-modality Discordance for Multimodal Fake News Detection
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 33, Pages 1–7https://doi.org/10.1145/3469877.3490614

The paradigm shift in the consumption of news via online platforms has cultivated the growth of digital journalism. Contrary to traditional media, lowering entry barriers and enabling everyone to be part of content creation have disabled the concept of ...
8
415
Metrics
Total Citations8
Total Downloads415
Last 12 Months120
Last 6 weeks11
Get Access
research-article
January 2022
Zero-shot Recognition with Image Attributes Generation using Hierarchical Coupled Dictionary Learning
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 32, Pages 1–7https://doi.org/10.1145/3469877.3490613

Zero-shot learning (ZSL) aims to recognize images from unseen (novel) classes with the training images from seen classes. The attributes of each class is exploited as auxiliary semantic information. Recently most ZSL approaches focus on learning visual-...
2
72
Metrics
Total Citations2
Total Downloads72
Last 12 Months14
Last 6 weeks1
Get Access
research-article
January 2022
Score Transformer: Generating Musical Score from Note-level Representation
- Masahiro Suzuki
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 31, Pages 1–7https://doi.org/10.1145/3469877.3490612

In this paper, we explore the tokenized representation of musical scores using the Transformer model to automatically generate musical scores. Thus far, sequence models have yielded fruitful results with note-level (MIDI-equivalent) symbolic ...
1
145
Metrics
Total Citations1
Total Downloads145
Last 12 Months43
Last 6 weeks8
1
Supplementary Material
Converter tools, Metric
Get Access
research-article
January 2022
Efficient Proposal Generation with U-shaped Network for Temporal Sentence Grounding
- Ludan Ruan,
- Qin Jin
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 26, Pages 1–7https://doi.org/10.1145/3469877.3490606

Temporal Sentence Grounding aims to localize the relevant temporal region in a given video according to the query sentence. It is a challenging task due to the semantic gap between different modalities and diversity of the event duration. Proposal ...
0
69
Metrics
Total Citations0
Total Downloads69
Last 12 Months9
Last 6 weeks2
Get Access
research-article
January 2022
Visual Storytelling with Hierarchical BERT Semantic Guidance
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in AsiaArticle No.: 24, Pages 1–7https://doi.org/10.1145/3469877.3490604

Visual storytelling, which aims at automatically producing a narrative paragraph for photo album, remains quite challenging due to the complexity and diversity of photo album content. In addition, open-domain photo albums cover a broad range of topics ...
2
198
Metrics
Total Citations2
Total Downloads198
Last 12 Months32
Last 6 weeks4
Get Access

Applied Filters

People

Names

Institutions

Authors

Editors

Publications

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Caption

Visible-Infrared Cross-Modal Person Re-identification based on Positive Feedback

Deep Adaptive Attention Triple Hashing

Generation of Variable-Length Time Series from Text using Dynamic Time Warping-Based Method

Focusing Attention across Multiple Images for Multimodal Event Detection

CFCR: A Convolution and Fusion Model for Cross-platform Recommendation

Discovering Social Connections using Event Images

SangeetXML: An XML Format for Score Retrieval for Indic Music

Hybrid Improvements in Multimodal Analysis for Deep Video Understanding

An Embarrassingly Simple Approach to Discrete Supervised Hashing

Conditioned Image Retrieval for Fashion using Contrastive Learning and CLIP-based Features

Deep Multiple Length Hashing via Multi-task Learning

RoadAtlas: Intelligent Platform for Automated Road Defect Detection and Asset Management

Private-Share: A Secure and Privacy-Preserving De-Centralized Framework for Large Scale Data Sharing

An Efficient Bus Crowdedness Classification System

PLM-IPE: A Pixel-Landmark Mutual Enhanced Framework for Implicit Preference Estimation

Inter-modality Discordance for Multimodal Fake News Detection

Zero-shot Recognition with Image Attributes Generation using Hierarchical Coupled Dictionary Learning

Score Transformer: Generating Musical Score from Note-level Representation

Efficient Proposal Generation with U-shaped Network for Temporal Sentence Grounding

Visual Storytelling with Hierarchical BERT Semantic Guidance