Artificial intelligence

Applied Filters

People

Conferences

Publication Date

41 Results for: Book/Issue: ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,801,766 records)|Limit your search to The ACM Full-Text Collection (771,395 records)

Showing 1 - 20of41 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
June 2023
A Recurrent Neural Network based Generative Adversarial Network for Long Multivariate Time Series Forecasting
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 181–189https://doi.org/10.1145/3591106.3592306

Some multimedia data from real life can be collected as multivariate time series data, such as community-contributed social data or sensor data. Many methods have been proposed for multivariate time series forecasting. In light of its importance in wide ...
2
196
Metrics
Total Citations2
Total Downloads196
Last 12 Months90
Last 6 weeks2
Get Access
abstract
June 2023
Algorithms for Generating and Evaluating Visually Sorted Grid Layouts
- Kai Uwe Barthel
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 672–673https://doi.org/10.1145/3591106.3592305

The increasing amount of visual data shared online highlights the importance of organizing and finding related content. However, current efforts to improve visual search and image classification lack support for exploratory image search. Sorting images ...
0
44
Metrics
Total Citations0
Total Downloads44
Last 12 Months18
Last 6 weeks0
Get Access
abstract
June 2023
MAD ’23 Workshop: Multimedia AI against Disinformation
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 676–677https://doi.org/10.1145/3591106.3592303

With recent advancements in synthetic media manipulation and generation, verifying multimedia content posted online has become increasingly difficult. Additionally, the malicious exploitation of AI technologies by actors to disseminate disinformation on ...
0
90
Metrics
Total Citations0
Total Downloads90
Last 12 Months37
Last 6 weeks1
Get Access
abstract
June 2023
ICDAR’23: Intelligent Cross-Data Analysis and Retrieval
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 674–675https://doi.org/10.1145/3591106.3592302

Recently, there has been an increased interest in cross-data research problems, such as predicting air quality using life logging images, predicting congestion using weather and tweets data, and predicting sleep quality using daily exercises and meals. ...
1
76
Metrics
Total Citations1
Total Downloads76
Last 12 Months32
Last 6 weeks2
Get Access
keynote
June 2023
Recognizing Actions in Videos under Domain Shift
- Elisa Ricci
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPage 671https://doi.org/10.1145/3591106.3592301

Action recognition, which consists in automatically recognizing the action being performed in a video sequence, is a fundamental task in computer vision and multimedia. Supervised action recognition has been widely studied because of the growing need for ...
0
53
Metrics
Total Citations0
Total Downloads53
Last 12 Months14
Last 6 weeks0
Get Access
research-article
June 2023
Learning with Adaptive Knowledge for Continual Image-Text Modeling
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 472–480https://doi.org/10.1145/3591106.3592297

In realistic application scenarios, existing methods for image-text modeling have limitations in dealing with data stream: training on all data needs too much computation/storage resources, and even the full access to previous data is invalid. In this ...
0
131
Metrics
Total Citations0
Total Downloads131
Last 12 Months53
Last 6 weeks3
1
Supplementary Material
ICMR2023_DHA-suppl.pdf ICMR2023_DHA-suppl.pdf
Get Access
research-article
June 2023
A Robust Deep Learning Enhanced Monocular SLAM System for Dynamic Environments
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 508–515https://doi.org/10.1145/3591106.3592295

Simultaneous Localization and Mapping (SLAM) has developed as a fundamental method for intelligent robot perception over the past decades. Most of the existing feature-based SLAM systems relied on traditional hand-crafted visual features and a strong ...
0
190
Metrics
Total Citations0
Total Downloads190
Last 12 Months74
Last 6 weeks5
Get Access
short-paper
Open Access
June 2023
Escaping local minima in deep reinforcement learning for video summarization
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 530–534https://doi.org/10.1145/3591106.3592288

State-of-the-art deep neural unsupervised video summarization methods mostly fall under the adversarial reconstruction framework. This employs a Generative Adversarial Network (GAN) structure and Long Short-Term Memory (LSTM) autoencoders during its ...
1
248
Metrics
Total Citations1
Total Downloads248
Last 12 Months190
Last 6 weeks11
View online with eReader
View this article in HTML format
PDF
research-article
June 2023
Zero-shot Sketch-based Image Retrieval with Adaptive Balanced Discriminability and Generalizability
- Jialin Tian,
- Xing Xu,
- Zuo Cao,
- Gong Zhang,
- Fumin Shen,
- Yang Yang
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 407–415https://doi.org/10.1145/3591106.3592287

Zero-shot sketch-based image retrieval (ZS-SBIR) is a task that learns semantic knowledge and embedding extraction to retrieve similar images using a sketch without any training examples of unseen classes. Existing methods have attempted to address the ...
1
188
Metrics
Total Citations1
Total Downloads188
Last 12 Months97
Last 6 weeks7
Get Access
short-paper
June 2023
Offensive Tactics Recognition in Broadcast Basketball Videos Based on 2D Camera View Player Heatmaps
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 571–575https://doi.org/10.1145/3591106.3592285

It is essential for sports teams to review their offensive and defensive tactical execution performance as well as understand their opponents’ tactics in order to identify effective counterattack strategies. This study focuses on basketball offensive ...
1
104
Metrics
Total Citations1
Total Downloads104
Last 12 Months45
Last 6 weeks6
1
Supplementary Material
[ICMR 2023] Offensive Tactics Recognition in Broadcast Basketball Videos.zip
Get Access
research-article
Open Access
June 2023
Dual-Modality Co-Learning for Unveiling Deepfake in Spatio-Temporal Space
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 85–94https://doi.org/10.1145/3591106.3592284

The emergence of photo-realistic deepfakes on a large scale has become a significant societal concern, which has garnered considerable attention from the research community. Several recent studies have identified the critical issue of “temporal ...
2
695
Metrics
Total Citations2
Total Downloads695
Last 12 Months440
Last 6 weeks60
View online with eReader
View this article in HTML format
PDF
research-article
June 2023
SIGMA-DF: Single-Side Guided Meta-Learning for Deepfake Detection
- Bing Han,
- Jianshu Li,
- Wenqi Ren,
- Man Luo,
- Jian Liu,
- Xiaochun Cao
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 153–161https://doi.org/10.1145/3591106.3592282

The current challenge of Deepfake detection is the cross-domain performance on unseen Deepfake data. Instead of extracting forgery artifacts that are robust to the cross-domain scenarios as most previous works, we propose a novel method named Single-...
2
335
Metrics
Total Citations2
Total Downloads335
Last 12 Months161
Last 6 weeks8
Get Access
research-article
June 2023
Framing the News: From Human Perception to Large Language Model Inferences
- David Alonso del Barrio,
- Daniel Gatica-Perez
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 627–635https://doi.org/10.1145/3591106.3592278

Identifying the frames of news is important to understand the articles’ vision, intention, message to be conveyed, and which aspects of the news are emphasized. Framing is a widely studied concept in journalism, and has emerged as a new topic in ...
5
251
Metrics
Total Citations5
Total Downloads251
Last 12 Months157
Last 6 weeks20
Get Access
research-article
Open Access
June 2023
Multi-channel Convolutional Neural Network for Precise Meme Classification
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 190–198https://doi.org/10.1145/3591106.3592275

This paper proposes a multi-channel convolutional neural network (MC-CNN) for classifying memes and non-memes. Our architecture is trained and validated on a challenging dataset that includes non-meme formats with textual attributes, which are also ...
1
675
Metrics
Total Citations1
Total Downloads675
Last 12 Months475
Last 6 weeks62
1
Supplementary Material
Supplemental Materials.zip
View online with eReader
View this article in HTML format
PDF
research-article
June 2023
Knowledge-Aware Causal Inference Network for Visual Dialog
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 253–261https://doi.org/10.1145/3591106.3592272

The effective knowledge and interaction within multi-modalities are key to Visual Dialog. Classic graph-based framework with the direct connection between history dialog and answer fails to give the right answer for the spurious guidance and strong bias ...
3
166
Metrics
Total Citations3
Total Downloads166
Last 12 Months92
Last 6 weeks8
Get Access
research-article
June 2023
Multi-modal Fake News Detection on Social Media via Multi-grained Information Fusion
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 343–352https://doi.org/10.1145/3591106.3592271

The easy sharing of multimedia content on social media has caused a rapid dissemination of fake news, which threatens society’s stability and security. Therefore, fake news detection has garnered extensive research interest in the field of social ...
14
568
Metrics
Total Citations14
Total Downloads568
Last 12 Months323
Last 6 weeks39
Get Access
research-article
June 2023
TsP-Tran: Two-Stage Pure Transformer for Multi-Label Image Retrieval
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 425–433https://doi.org/10.1145/3591106.3592269

Image retrieval aims to find similar images given the query. Most of existing retrieval works are based on the pre-trained model of single-label image classification. In practice, the query usually contains more than one instance, and the single label ...
2
168
Metrics
Total Citations2
Total Downloads168
Last 12 Months74
Last 6 weeks5
Get Access
research-article
June 2023
TDEC: Deep Embedded Image Clustering with Transformer and Distribution Information
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 280–288https://doi.org/10.1145/3591106.3592268

Image clustering is a crucial but challenging task in multimedia machine learning. Recently the combination of clustering with deep learning has achieved promising performance against conventional methods on high-dimensional image data. Unfortunately, ...
0
175
Metrics
Total Citations0
Total Downloads175
Last 12 Months79
Last 6 weeks14
Get Access
research-article
June 2023
Multi-Label Meta Weighting for Long-Tailed Dynamic Scene Graph Generation
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 39–47https://doi.org/10.1145/3591106.3592267

This paper investigates the problem of scene graph generation in videos with the aim of capturing semantic relations between subjects and objects in the form of ⟨ subject, predicate, object⟩ triplets. Recognizing the predicate between subject and object ...
1
238
Metrics
Total Citations1
Total Downloads238
Last 12 Months124
Last 6 weeks1
Get Access
research-article
June 2023
Improving Image Encoders for General-Purpose Nearest Neighbor Search and Classification
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 57–66https://doi.org/10.1145/3591106.3592266

Recent advances in computer vision research led to large vision foundation models that generalize to a broad range of image domains and perform exceptionally well in various image based tasks. However, content-based image-to-image retrieval is often ...
5
181
Metrics
Total Citations5
Total Downloads181
Last 12 Months84
Last 6 weeks8
Get Access

Applied Filters

People

Names

Institutions

Authors

Publications

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Caption

A Recurrent Neural Network based Generative Adversarial Network for Long Multivariate Time Series Forecasting

Algorithms for Generating and Evaluating Visually Sorted Grid Layouts

MAD ’23 Workshop: Multimedia AI against Disinformation

ICDAR’23: Intelligent Cross-Data Analysis and Retrieval

Recognizing Actions in Videos under Domain Shift

Learning with Adaptive Knowledge for Continual Image-Text Modeling

A Robust Deep Learning Enhanced Monocular SLAM System for Dynamic Environments

Escaping local minima in deep reinforcement learning for video summarization

Zero-shot Sketch-based Image Retrieval with Adaptive Balanced Discriminability and Generalizability

Offensive Tactics Recognition in Broadcast Basketball Videos Based on 2D Camera View Player Heatmaps

Dual-Modality Co-Learning for Unveiling Deepfake in Spatio-Temporal Space

SIGMA-DF: Single-Side Guided Meta-Learning for Deepfake Detection

Framing the News: From Human Perception to Large Language Model Inferences

Multi-channel Convolutional Neural Network for Precise Meme Classification

Knowledge-Aware Causal Inference Network for Visual Dialog

Multi-modal Fake News Detection on Social Media via Multi-grained Information Fusion

TsP-Tran: Two-Stage Pure Transformer for Multi-Label Image Retrieval

TDEC: Deep Embedded Image Clustering with Transformer and Distribution Information

Multi-Label Meta Weighting for Long-Tailed Dynamic Scene Graph Generation

Improving Image Encoders for General-Purpose Nearest Neighbor Search and Classification