MUME: Vol 30, No 3

Volume 30, Issue 3Jun 2024

Volume 30, Issue 3

Jun 2024

Publisher:

Springer-Verlag
Berlin, Heidelberg

ISSN:0942-4962

Tags:

Bibliometrics

Select All

Export Citations Save to Binder

research-article

RefinerHash: a new hashing-based re-ranking technique for image retrieval

https://doi.org/10.1007/s00530-024-01296-x

Abstract

Re-ranking is a task of refining an initially ranked list of images obtained from an image retrieval technique for a given query image, with the goal of enhancing retrieval performance in an efficient manner. However, existing re-ranking methods ...

research-article

Exploring contactless techniques in multimodal emotion recognition: insights into diverse applications, challenges, solutions, and prospects

https://doi.org/10.1007/s00530-024-01302-2

Abstract

In recent years, emotion recognition has received significant attention, presenting a plethora of opportunities for application in diverse fields such as human–computer interaction, psychology, and neuroscience, to name a few. Although unimodal ...

research-article

An efficient heuristic-aided adaptive autoencoder-based dilated DNN with attention mechanism for enhancing the performance of the MIMO system in 5G communication

https://doi.org/10.1007/s00530-024-01305-z

Abstract

On considering modern society, the wireless communication system plays a most significant role. This system has kept evolving and deployed into a wireless system of Fifth Generation (5G). One of the significant factors of the 5G system has ...

research-article

Synchronous composition and semantic line detection based on cross-attention

https://doi.org/10.1007/s00530-024-01307-x

Abstract

Composition detection and semantic line detection are important research topics in computer vision and play an important auxiliary role in the analysis of image esthetics. However, at present, few researchers have considered the internal ...

research-article

Unbinding tensor product representations for image captioning with semantic alignment and complementation

https://doi.org/10.1007/s00530-024-01309-9

Abstract

Image captioning, which describes an image with natural language, is an important but challenging multi-modal task. Many state-of-the-art methods generally adopt the encoder–decoder framework to implement information conversion from image modality ...

research-article

Semantic-wise guidance for efficient multimodal emotion recognition with missing modalities

https://doi.org/10.1007/s00530-024-01310-2

Abstract

Emotions play an important role in human–computer interaction. Multimodal emotion recognition combines feature information from different modalities to recognize emotional states. However, in real application scenarios, data from all modalities ...

research-article

Students and teachers learning together: a robust training strategy for neural network pruning

https://doi.org/10.1007/s00530-024-01315-x

Abstract

Convolutional neural networks (CNNs) serve as the backbone for extracting image features in the majority of computer vision tasks. In an attempt to make them deployable on small devices, many academics have released small neural networks that they ...

research-article

CLDE-Net: crowd localization and density estimation based on CNN and transformer network

https://doi.org/10.1007/s00530-024-01318-8

Abstract

Given a crowd image, there are two ways for human to approximate the counting number: exactly locating head points in each local region or directly estimating the total number of person based on the whole image. By imitating human visual ...

research-article

Efficient brain tumour detection system by Cascaded Fully Convolutional Improved DenseNet with Attention-based Adaptive Swin Unet-derived segmentation strategy

https://doi.org/10.1007/s00530-024-01319-7

Abstract

The reason behind brain tumors is the rapid and uncontrolled growth of human cells. From this, the developed model motivated to design of the framework for brain tumor detection. Deep learning-assisted developments help to enhance the detection of ...

research-article

Attention U-Net based on multi-scale feature extraction and WSDAN data augmentation for video anomaly detection

https://doi.org/10.1007/s00530-024-01320-0

Abstract

The widespread adoption of video surveillance systems in public security and network security domains has underscored the importance of video anomaly detection as a pivotal research area. To enhance the precision and robustness of anomaly ...

research-article

Multi-label local awareness and global co-occurrence priori learning improve chest X-ray classification

https://doi.org/10.1007/s00530-024-01321-z

Abstract

When analyzing screening chest X-ray (CXR) images, radiologists can naturally consider information about the relationships between pathologies, namely pathological co-occurrence, and interdependence. Such topologically meaningful pathological ...

research-article

CA-CLIP: category-aware adaptation of CLIP model for few-shot class-incremental learning

https://doi.org/10.1007/s00530-024-01322-y

Abstract

Few-shot class-incremental learning (FSCIL) learns from continuously arriving new categories, each with only a small number of training samples. As a challenging problem, FSCIL aims to mitigate the catastrophic forgetting of old knowledge while ...

research-article

Image retrieval based on deep Tamura feature descriptor

https://doi.org/10.1007/s00530-024-01323-x

Abstract

Various levels of visual features have different effects in image retrieval, and deep features can express higher-level features or semantic information. Tamura texture feature belongs to the handcrafted feature, and it can represent texture ...

research-article

Link prediction in social networks using hyper-motif representation on hypergraph

https://doi.org/10.1007/s00530-024-01324-w

Abstract

Link prediction, a critical pursuit in complex networks research, revolves around the predictive understanding of connections between nodes. Our novel approach introduces a hypergraph to model the network, diverging from the conventional “node–...

research-article

A review of deep learning algorithms for modeling drug interactions

https://doi.org/10.1007/s00530-024-01325-9

Abstract

The interactions between therapeutics and their targets are an important part of the drug development process. To counter the cost, time and accuracy related issues, novel and efficient DL algorithms are required. These approaches have proven ...

research-article

Modeling methods of cylindrical and axisymmetric waterbomb origami based on multi-objective optimization

https://doi.org/10.1007/s00530-024-01326-8

Abstract

The basic principle of origami is to use two-dimensional flat materials to obtain various three-dimensional target shapes by folding crease patterns. Among them, the study of waterbomb tessellations inspires the design and functionality ...

research-article

Linking unknown characters via oracle bone inscriptions retrieval

https://doi.org/10.1007/s00530-024-01327-7

Abstract

Retrieving useful information from existing collections of oracle bone rubbing images plays a pivotal role in the study of oracle bone inscription decipherment. However, current systems for processing oracle bone information rely on expert-curated ...

research-article

A novel multiagent system for cervical motor control evaluation and individualized therapy: integrating gamification and portable solutions

https://doi.org/10.1007/s00530-024-01328-6

Abstract

The study focused on designing a portable, objective device for assessing and addressing Cervical Motor Control (CMC) impairments. This device is based on a proposed architecture that employs advanced technology to evaluate and enhance patients’ ...

research-article

Joint frame-level and CTU-level rate control based on constant perceptual quality

https://doi.org/10.1007/s00530-024-01329-5

Abstract

For a given network bandwidth, optimizing the rate control to achieve the best compression performance is essential in video communication and storage. Besides the enhancement of the overall coding efficiency, the rate control algorithm for ...

research-article

PathNet: a novel multi-pathway convolutional neural network for few-shot image classification from scratch

https://doi.org/10.1007/s00530-024-01330-y

Abstract

In recent years, advanced computer vision models have trended toward deeper and larger network architectures, and model depth is often considered an important feature for achieving superior performance. While deeper networks can help solve complex ...

research-article

A visual analysis approach for data transformation via domain knowledge and intelligent models

https://doi.org/10.1007/s00530-024-01331-x

Abstract

Industry benchmarking involves comparing and analyzing a company’s performance with other top-performing enterprises. PDF documents contain valuable corporate information, but their non-editable nature makes data extraction complex. This study ...

research-article

IS-DGM: an improved steganography method based on a deep generative model and hyper logistic map encryption via social media networks

https://doi.org/10.1007/s00530-024-01332-w

Abstract

The exchange of information through social networking sites has become a major risk due to the possibility of obtaining millions of subscribers’ data at any time without the right. Multimedia security is a multifaceted field that involves various ...

research-article

MF-DAT: a stock trend prediction of the double-graph attention network based on multisource information fusion

https://doi.org/10.1007/s00530-024-01333-9

Abstract

Stock forecasting research, which aims to predict the future price movement of stocks, has been the focus of investors and scholars. This is important for practical applications related to human-centric computing and information sciences. Previous ...

research-article

A novel exponent–sine–cosine chaos map-based multiple-image encryption technique

https://doi.org/10.1007/s00530-024-01334-8

Abstract

This paper proposes a multiple-image encryption (MIE) approach that uses a novel exponent–sine–cosine (ESC) chaotic map along with the dynamic permutation and DNA-based diffusion. In the first phase of the proposed approach, the three components ...

research-article

PointCMC: cross-modal multi-scale correspondences learning for point cloud understanding

https://doi.org/10.1007/s00530-024-01335-7

Abstract

Existing cross-modal frameworks have achieved impressive performance in point cloud object representations learning, where a 2D image encoder is employed to transfer knowledge to a 3D point cloud encoder. However, the local structures between ...

research-article

Deep Learning-based forgery detection and localization for compressed images using a hybrid optimization model

https://doi.org/10.1007/s00530-024-01336-6

Abstract

Manipulation of digital images has become quite common in recent years because of the rise of various image editing tools. It has become a challenging task to identify authentic and tampered images, since tampered images are non-distinguishable by ...

research-article

Pimo: memory-efficient privacy protection in video streaming and analytics

https://doi.org/10.1007/s00530-024-01337-5

Abstract

Video streaming from cameras to backend cloud or edge servers for neural-based analytics has gained significant popularity. However, the transmission of data from cameras to a backend raises substantial privacy concerns, particularly regarding ...

research-article

Improving collaborative filtering with SNE–GCN: a second-order neighbor enhanced graph convolutional network

https://doi.org/10.1007/s00530-024-01338-4

Abstract

Graph collaborative filtering uses user-item interactions to capture user preferences for items. While this approach proves highly effective, its performance may suffer from the sparse user-item interactions. On one hand, existing methods lack ...

research-article

Visual transductive learning via iterative label correction

https://doi.org/10.1007/s00530-024-01339-3

Abstract

Unsupervised domain adaptation (UDA) aims to transfer knowledge across domains when there is no labeled data available in the target domain. In this way, UDA methods attempt to utilize pseudo-labeled target samples to align distribution across the ...

research-article

Dual-path temporal map optimization for make-up temporal video grounding

https://doi.org/10.1007/s00530-024-01340-w

Abstract

Make-up temporal video grounding (MTVG) aims to localize the target video segment, which is semantically related to a sentence describing a make-up activity in a make-up video. Compared with the general video grounding, MTVG focuses on meticulous ...

Multimedia Systems

Sections

RefinerHash: a new hashing-based re-ranking technique for image retrieval

Exploring contactless techniques in multimodal emotion recognition: insights into diverse applications, challenges, solutions, and prospects

An efficient heuristic-aided adaptive autoencoder-based dilated DNN with attention mechanism for enhancing the performance of the MIMO system in 5G communication

Synchronous composition and semantic line detection based on cross-attention

Unbinding tensor product representations for image captioning with semantic alignment and complementation

Semantic-wise guidance for efficient multimodal emotion recognition with missing modalities

Students and teachers learning together: a robust training strategy for neural network pruning

CLDE-Net: crowd localization and density estimation based on CNN and transformer network

Efficient brain tumour detection system by Cascaded Fully Convolutional Improved DenseNet with Attention-based Adaptive Swin Unet-derived segmentation strategy

Attention U-Net based on multi-scale feature extraction and WSDAN data augmentation for video anomaly detection

Multi-label local awareness and global co-occurrence priori learning improve chest X-ray classification

CA-CLIP: category-aware adaptation of CLIP model for few-shot class-incremental learning

Image retrieval based on deep Tamura feature descriptor

Link prediction in social networks using hyper-motif representation on hypergraph

A review of deep learning algorithms for modeling drug interactions

Modeling methods of cylindrical and axisymmetric waterbomb origami based on multi-objective optimization

Linking unknown characters via oracle bone inscriptions retrieval

A novel multiagent system for cervical motor control evaluation and individualized therapy: integrating gamification and portable solutions

Joint frame-level and CTU-level rate control based on constant perceptual quality

PathNet: a novel multi-pathway convolutional neural network for few-shot image classification from scratch

A visual analysis approach for data transformation via domain knowledge and intelligent models

IS-DGM: an improved steganography method based on a deep generative model and hyper logistic map encryption via social media networks

MF-DAT: a stock trend prediction of the double-graph attention network based on multisource information fusion

A novel exponent–sine–cosine chaos map-based multiple-image encryption technique

PointCMC: cross-modal multi-scale correspondences learning for point cloud understanding

Deep Learning-based forgery detection and localization for compressed images using a hybrid optimization model

Pimo: memory-efficient privacy protection in video streaming and analytics

Improving collaborative filtering with SNE–GCN: a second-order neighbor enhanced graph convolutional network

Visual transductive learning via iterative label correction

Dual-path temporal map optimization for make-up temporal video grounding

Sections

Save to Binder

Comments