Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- ArticleJanuary 2025
SnapSeek 2.0 at Video Browser Showdown 2025
- Minh-Quan Ho-Le,
- Duy-Khang Ho,
- Huy-Hoang Do-Huu,
- Nhut-Thanh Le-Hinh,
- Hoa-Vien Vo-Hoang,
- Van-Tu Ninh,
- Cathal Gurrin,
- Minh-Triet Tran
AbstractSnapSeek 2.0 is a novel system designed for information retrieval from videos participating in the Video Browser Showdown 2025. The system aims to enhance interactivity and accelerate query input speed by employing a human feedback procedure and ...
- ArticleJanuary 2025
Exquisitor at the Video Browser Showdown 2025: Unifying Conversational Search and User Relevance Feedback
AbstractExquisitor is a multimedia retrieval system designed to combine conversational search and user relevance feedback to enhance interactive video search. In this work, we present recent improvements to Exquisitor, focusing on integrating these two ...
- research-articleSeptember 2024
Supervised Semantic-Embedded Hashing for Multimedia Retrieval
AbstractWith the rapid development of multimedia technologies, the efficient retrieval of large-scale multimedia information is regarded as a critical area to research. Hashing methods have achieved superiority as an effective solution for multimedia ...
Highlights- Proposed a novel supervised semantic-embedding hashing method.
- Designed a Semantic-Enhanced Representation module.
- Proposed a Class Structure Preservation to extract the semantic relationships.
- Evaluated the effectiveness of ...
- review-articleAugust 2024
Unsupervised affinity learning based on manifold analysis for image retrieval: A survey
AbstractDespite the advances in machine learning techniques, similarity assessment among multimedia data remains a challenging task of broad interest in computer science. Substantial progress has been achieved in acquiring meaningful data representations,...
Highlights- Comprehensive survey on unsupervised post-processing methods for image retrieval.
- Discussion including a brief history, organization, and evolution of the area.
- Analysis of literature trends through keywords network analysis.
- A ...
- research-articleJune 2024
Structure-aware contrastive hashing for unsupervised cross-modal retrieval
AbstractCross-modal hashing has attracted a lot of attention and achieved remarkable success in large-scale cross-media similarity retrieval applications because of its superior computational efficiency and low storage overhead. However, constructing ...
Highlights- We propose an informative multimodal correlation matrix construction approach.
- We propose a multimodal structure-aware alignment network to bridge heterogeneous gaps.
- Extensive experiments show SACH’s superiority in cross-modal ...
-
- research-articleMarch 2024
Fast metric multi-view hashing for multimedia retrieval
AbstractThe acquisition of multi-view hash representation for heterogeneous data holds paramount importance in the domain of multimedia retrieval. The limited retrieval precision observed in current approaches stems from their inadequate integration of ...
Highlights- We propose a novel multi-view hash method, which achieves state-of-the-art results in multimedia retrieval.
- We propose a deep metric loss to obtain information provided by dissimilar samples.
- We use Context Gating to address the ...
- ArticleJanuary 2024
Exquisitor at the Video Browser Showdown 2024: Relevance Feedback Meets Conversational Search
AbstractAn important open problem in video retrieval and exploration concerns the generation and refinement of queries for complex tasks that standard methods are unable to solve, especially when the systems are used by novices. In conversational search, ...
- research-articleJune 2023
Explicit Knowledge Integration for Knowledge-Aware Visual Question Answering about Named Entities
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 29–38https://doi.org/10.1145/3591106.3592227Recent years have shown unprecedented growth of interest in Vision-Language related tasks, with the need to address the inherent challenges of integrating linguistic and visual information to solve real-world applications. Such a typical task is Visual ...
- ArticleApril 2023
Conversational Search for Multimedia Archives
AbstractThe growth of media archives (including text, speech, video and audio) has led to significant interest in developing search methods for multimedia content. An ongoing challenge of multimedia search is user interaction during the search process, ...
- ArticleMarch 2023
A Study of a Cross-modal Interactive Search Tool Using CLIP and Temporal Fusion
AbstractRecently, the CLIP model demonstrated impressive performance in text-image search and zero classification tasks. Hence, CLIP was used as a primary model in many cross-modal search tools at evaluation campaigns. In this paper, we show a study ...
- research-articleAugust 2022
ISRE-Framework: nonlinear and multimodal exploration of image search result spaces
Multimedia Tools and Applications (MTAA), Volume 81, Issue 19Pages 27275–27308https://doi.org/10.1007/s11042-022-12561-4AbstractThe extensive information delivery power and an immense volume of image objects make them frequently use multimedia content over the web. However, access to desired image objects to satisfy visual information needs by employing primitive ...
- ArticleJune 2022
A Task Category Space for User-Centric Comparative Multimedia Search Evaluations
- Jakub Lokoč,
- Werner Bailer,
- Kai Uwe Barthel,
- Cathal Gurrin,
- Silvan Heller,
- Björn þór Jónsson,
- Ladislav Peška,
- Luca Rossetto,
- Klaus Schoeffmann,
- Lucia Vadicamo,
- Stefanos Vrochidis,
- Jiaxin Wu
AbstractIn the last decade, user-centric video search competitions have facilitated the evolution of interactive video search systems. So far, these competitions focused on a small number of search task categories, with few attempts to change task ...
- research-articleOctober 2021
Composite description based on color vector quantization and visual primary features for CBIR tasks
Multimedia Tools and Applications (MTAA), Volume 80, Issue 24Pages 33409–33427https://doi.org/10.1007/s11042-021-11353-6AbstractThis paper presents a novel method for content-based color image retrieval that combines color vector quantization and visual primary features into a compact feature representation. Color vector quantization is proposed to describe the image in a ...
- research-articleSeptember 2021
On augmenting database schemas by latent visual attributes
Knowledge and Information Systems (KAIS), Volume 63, Issue 9Pages 2277–2312https://doi.org/10.1007/s10115-021-01595-zAbstractDecision-making in our everyday lives is surrounded by visually important information. Fashion, housing, dating, food or travel are just a few examples. At the same time, most commonly used tools for information retrieval operate on relational and ...
- ArticleMay 2021
City-Stories: Combining Entity Linking, Multimedia Retrieval, and Crowdsourcing to Make Historical Data Accessible
AbstractDigitized historical image collections as provided by individuals or memory institutions often suffer from limited or a complete lack of metadata In this paper, we present the City-Stories system that combines entity linking, multimedia retrieval, ...
- research-articleNovember 2020
Similarity ranking technique exploiting the structure of similarity relationships
AbstractThis paper proposes a similarity ranking technique that exploits the entire network structure of similarity relationships for multimedia, particularly image, databases. The main problem in the similarity ranking on multimedia is the meaning gap ...
- research-articleJune 2020
Robust Unsupervised Cross-modal Hashing for Multimedia Retrieval
ACM Transactions on Information Systems (TOIS), Volume 38, Issue 3Article No.: 30, Pages 1–25https://doi.org/10.1145/3389547With the quick development of social websites, there are more opportunities to have different media types (such as text, image, video, etc.) describing the same topic from large-scale heterogeneous data sources. To efficiently identify the inter-media ...
- research-articleJanuary 2020
A cross-modal multimedia retrieval method using depth correlation mining in big data environment
Multimedia Tools and Applications (MTAA), Volume 79, Issue 1-2Pages 1339–1354https://doi.org/10.1007/s11042-019-08238-0AbstractCross-media retrieval is a technology aimed at breaking through the shackles of single-mode retrieval technology, which is limited to the same multimedia form. It is also hoped to be able to search each other across the media form. Comprehensive ...
- research-articleNovember 2019
FCA-based knowledge representation and local generalized linear models to address relevance and diversity in diverse social images
Future Generation Computer Systems (FGCS), Volume 100, Issue CPages 250–265https://doi.org/10.1016/j.future.2019.05.029AbstractIn social image retrieval, the main goal is to offer a relevant but also diverse result set of images to the user. To address relevance and diversity at the same time, we propose a multi-modal procedure. This approach deals with the ...
Highlights- This paper proposes a multimedia approach to deal with relevance and diversity.
- ArticleOctober 2019
An Image Retrieval System for Video
- Paolo Bolettieri,
- Fabio Carrara,
- Franca Debole,
- Fabrizio Falchi,
- Claudio Gennaro,
- Lucia Vadicamo,
- Claudio Vairo
AbstractSince the 1970’s the Content-Based Image Indexing and Retrieval (CBIR) has been an active area. Nowadays, the rapid increase of video data has paved the way to the advancement of the technologies in many different communities for the creation of ...