Computing methodologies

Applied Filters

People

Publications

Publication Date

Searched The ACM Guide to Computing Literature (3,765,099 records)|Limit your search to The ACM Full-Text Collection (758,140 records)

Showing 1 - 20of1,918 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
September 2024
DASOD: Detail-aware salient object detection
Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105154
Abstract
Salient object detection (SOD) is a challenging task in computer vision. Current SOD approaches have made significant progress, but they fail in challenging scenarios. This paper categorizes the existing challenges in SOD into four groups: images ...
Highlights

Proposing the DASOD method for salient object detection (SOD) in challenging images.
Identifying existing challenges and addressing them via camouflaged object detection.
SOD in images with complex background, low contrast, ...
0
Metrics
Total Citations0
research-article
September 2024
Mitigating human fall injuries: A novel system utilizing 3D 4-stream convolutional neural networks and image fusion
Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105153
Abstract
Unintentional human falls, especially in seniors, lead to serious injuries, fatalities, and reduced standard of life. Vision-based fall detection methods have demonstrated their usefulness in timely fall response, helping to lessen such injuries. ...
Highlights

A human fall detection system using a 4-stream 3D CNN is proposed.
The human body is segmented from the video clip using ResNet18.
The incident is divided into four sequences: standing, falling, fallen, and others.
Frames of ...
0
Metrics
Total Citations0
research-article
September 2024
CrowdAlign: Shared-weight dual-level alignment fusion for RGB-T crowd counting
Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105152
Abstract
The combination of visible and thermal images has been proven to be effective in improving accuracy for crowd counting in illumination-unconstrained scenes. However, the challenging problem of misalignment in RGB-T image pairs has not been ...
Graphical abstract

Display Omitted
Highlights

Explore the cross-modal misalignment issue for RGB-Thermal crowd counting task.
Design a spatial semantic joint alignment block for misaligned cross-modal counting.
Raise a low-frequency feature attention filtering block for the ...
0
Metrics
Total Citations0
research-article
September 2024
Omnidirectional image quality assessment with local–global vision transformers
Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105151
Abstract
With the rising popularity of omnidirectional images (ODIs) in virtual reality applications, the need for specialized image quality assessment (IQA) methods becomes increasingly critical. Traditional IQA approaches, designed for rectilinear ...
Highlights

Introduces LGT360IQ, a dual-branch framework for 360° image quality assessment (IQA).
Mimics top-down and bottom-up visual attention for omnidirectional images (ODIs).
Employs Transformers for local and global processing to ...
0
Metrics
Total Citations0
research-article
September 2024
OFACD: An end-to-end change detection network for small UAVs remote sensing with viewpoint differences
- Yaxin Dong,
- Fei Li,
- Kai Yan,
- Shen Deng,
- Tao Wen,
- Yang Yang
Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105150
Abstract
Change detection using remote sensing images captured by small unmanned aerial vehicles (small UAVs) finds wide applications across various fields. However, there is a challenge when dealing with images captured at the same location by small UAVs ...
Highlights

A novel change detection model for images with viewpoint differences is proposed.
OFACD simultaneously aligns images and detects changes toenhanceperformance.
Created two change detection datasets with viewpoint differences.
...
0
Metrics
Total Citations0
research-article
September 2024
Nighttime image semantic segmentation with retinex theory
Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105149
Abstract
Nighttime image semantic segmentation is challenging due to low-light and diverse lighting conditions. A straightforward solution is to first enhance nighttime scene images to resemble daytime scene before performing segmentation. This kind of ...
Highlights

Developed a Retinex-based network RNightSeg to enhance nighttime image segmentation.
Introduced illumination-independent reflectance component for accurate segmentation.
Method improves performance across diverse nighttime ...
0
Metrics
Total Citations0
article
September 2024
Erratum to “Two-dimensional hybrid incremental learning (2DHIL) framework for semantic segmentation of skin tissues” [Image and Vision Computing. Vol148 (2024) 105098]
Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105148
0
Metrics
Total Citations0
erratum
September 2024
Two-dimensional hybrid incremental learning (2DHIL) framework for semantic segmentation of skin tissues
Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105147
Abstract
This study aims to enhance the robustness and generalization capability of a deep learning transformer model used for segmenting skin carcinomas and tissues through the introduction of incremental learning. Deep learning AI models demonstrate ...
Highlights

A novel 2-dimensional hybrid incremenatl learning framework for multi-class segmentation of skin tissues and non-melanoma skin cancer
Combined knowledge distillation and the replay approach to address catastrophic forgetting
This ...
0
Metrics
Total Citations0
research-article
September 2024
Predictive breast cancer diagnosis using ensemble fuzzy model
Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105146
Abstract
Breast cancer continues to be a major global health challenge, necessitating reliable diagnostic methods for early detection and improved patient outcomes. This study introduces a novel ensemble fuzzy model for predictive breast cancer diagnosis, ...
Highlights

A new approach integrates deep-learning classifiers with fuzzy logic for improved decision-making.
The ensemble includes Inception-V4, Inception-ResNet, and Inception V3/V4 + BN, enhancing accuracy.
Fuzzy logic allows adaptive ...
0
Metrics
Total Citations0
research-article
September 2024
SMTCNN - A global spatio-temporal texture convolutional neural network for 3D dynamic texture recognition
Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105145
Abstract
Dynamic textures (DT) are typically 3D videos of physical processes showing statistical regularity but have indeterminate spatial and temporal extent. Existing DT recognition methods usually neglect the global spatio-temporal relationships of DT ...
Highlights

A novel SMTCNN architecture considering the global spatio-temporal relationships of dynamic textures is proposed for 3D dynamic texture recognition.
We present a new contextual module transforming 2D CNN features to 1D vectors for 3D ...
0
Metrics
Total Citations0
research-article
September 2024
Indirect deformable image registration using synthetic image generated by unsupervised deep learning
Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105143
Abstract Background and purpose
3D image registration is now common in many medical domains. Multimodal registration implies the use of different imaging modalities, which results in lower accuracy compared to monomodal registration. The aim of this study ...
Graphical abstract

Display Omitted
Highlights

We translated a multimodal CBCT-MR/CT registration into a sCT/CT registration.
The unsupervised synthesis method was based on a cGAN using novel perceptual loss.
The best registration accuracy was obtained via synthetic image ...
0
Metrics
Total Citations0
research-article
September 2024
Dual-consistency constraints network for noisy facial expression recognition
Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105141
Abstract
Although existing facial expression recognition (FER) methods have achieved great success, their performance degrades significantly under noisy labels caused by low-quality images, ambiguous expressions, and subjective and incorrect labeling. ...
Highlights

An effective DC-Net is presented to learn robust representations to suppress noisy samples.
A novel Class Activation Mapping Attention Consistency is proposed to make model focus on partially important features.
A novel Class ...
0
Metrics
Total Citations0
research-article
September 2024
Multi-branch progressive embedding network for crowd counting
- Lifang Zhou,
- Songlin Rao,
- Weisheng Li,
- Bo Hu,
- Bo Sun
Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105140
Abstract
Crowd counting is essential for video surveillance and public safety. The performance of counting models has been greatly improved with the rapid development of Convolution neural networks (CNN), while it still suffers interference from complex ...
Highlights

The insufficient scale hierarchy imposes some limitations on the performance.
The network is inclined to confuse dense regions and background regions.
Euclidean distance is strict pixel-level alignment will overfit the density map ...
0
Metrics
Total Citations0
research-article
September 2024
Localization of diffusion model-based inpainting through the inter-intra similarity of frequency features
Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105138
Abstract
Recently, the enhanced abilities of diffusion models have led to more realistic inpainting results, which raises the potential for criminal activity through image forgery. In this study, we explore the detection of inpainted images generated by a ...
Graphical abstract

Display Omitted
Highlights

We demonstrate the localization of inpainting images generated by diffusion models.
We introduce a new image forensics dataset based on diffusion modelbased inpainting.
We propose an inter-patch relationship through the learnable ...
0
Metrics
Total Citations0
research-article
September 2024
A puzzle questions form training for self-supervised skeleton-based action recognition
Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105137
Abstract
This paper proposed a novel pretext task to address the skeleton-based video representation for self-supervised action recognition tasks. Instead of exploiting only the whole body, various levels of the skeleton structure (e.g., upper body, lower ...
0
Metrics
Total Citations0
research-article
September 2024
Video object segmentation by multi-scale attention using bidirectional strategy
Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105136
Abstract
This paper focuses on semi-supervised video object segmentation (VOS). Recently, several Space‐Time Memory based networks have effectively improved the performance of VOS. However, most methods predict the target object mask forwardly, which ...
Highlights

Propose a bidirectional strategy to alleviate the error propagation problem.
Propose a multi-scale module refine the prediction of object boundaries.
Non-local modules combined with channel attention.
Experimental results show ...
0
Metrics
Total Citations0
research-article
September 2024
Active domain adaptation for semantic segmentation via dynamically balancing domainness and uncertainty
Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105132
Abstract
Active domain adaptation aims to enhance model adaptation performance by annotating a limited number of informative unlabeled target data. Traditional active learning strategies for semantic segmentation often neglect the presence of domain ...
Highlights

We propose a novel active domain adaptation framework for semantic segmentation.
The domainness score is designed to evaluate target samples' representativeness.
The prediction uncertainty is utilized to evaluate target samples' ...
0
Metrics
Total Citations0
review-article
September 2024
Enhancing small object tracking with reversible rescaling networks
- Yufei Zha,
- Xiao Guo,
- Fan Li,
- Hangfei Li
Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105131
Abstract
In the rapidly evolving domain of visual tracking, the accurate identification and tracking of small objects pose significant challenges due to their minimal pixel presence and detail deficiency. In the field of small object tracking, utilizing ...
Highlights

We present a small object tracker using reversible networks to preserve detailed information.
We integrate two reversible networks: one for capturing small object details and another for enhancing distinguishability in complex ...
0
Metrics
Total Citations0
review-article
September 2024
Hierarchical slice interaction and multi-layer cooperative decoding networks for remote sensing image dehazing
- Mei Yu,
- ShouYi Xu,
- Hang Sun,
- YueLin Zheng,
- Wen Yang
Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105129
Abstract
Recently, U-shaped neural networks have gained widespread application in remote sensing image dehazing and achieved promising performance. However, most of the existing U-shaped dehazing networks neglect the global and local information ...
Highlights

We propose a method (HSMD-Net) for haze removal in remote sensing images.
A new reconstruction module and a new information interaction module.
The method can produce reliable results under various concentrations of haze.
...
0
Metrics
Total Citations0
research-article
September 2024
Modality interactive attention for cross-modality person re-identification
- Zilin Zou,
- Ying Chen
Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105128
Abstract
The visible-infrared person re-identification (VI-ReID) task is challenging in image retrievals because of the modality gaps between visible and infrared images. Different from the most existing methods which either strive to capture modality ...
Highlights

Establish an interactive relation between modality-shared features and modality-specific features.
A modality interactive attention module is introduced to narrow down the modal gap.
Two feature extraction strategies have been ...
0
Metrics
Total Citations0

Applied Filters

People

Names

Institutions

Authors

Reviewers

Publications

All Publications

Content Type

Publisher

Publication Date

DASOD: Detail-aware salient object detection

Mitigating human fall injuries: A novel system utilizing 3D 4-stream convolutional neural networks and image fusion

CrowdAlign: Shared-weight dual-level alignment fusion for RGB-T crowd counting

Omnidirectional image quality assessment with local–global vision transformers

OFACD: An end-to-end change detection network for small UAVs remote sensing with viewpoint differences

Nighttime image semantic segmentation with retinex theory

Erratum to “Two-dimensional hybrid incremental learning (2DHIL) framework for semantic segmentation of skin tissues” [Image and Vision Computing. Vol148 (2024) 105098]

Two-dimensional hybrid incremental learning (2DHIL) framework for semantic segmentation of skin tissues

Predictive breast cancer diagnosis using ensemble fuzzy model

SMTCNN - A global spatio-temporal texture convolutional neural network for 3D dynamic texture recognition

Indirect deformable image registration using synthetic image generated by unsupervised deep learning

Dual-consistency constraints network for noisy facial expression recognition

Multi-branch progressive embedding network for crowd counting

Localization of diffusion model-based inpainting through the inter-intra similarity of frequency features

A puzzle questions form training for self-supervised skeleton-based action recognition

Video object segmentation by multi-scale attention using bidirectional strategy

Active domain adaptation for semantic segmentation via dynamically balancing domainness and uncertainty

Enhancing small object tracking with reversible rescaling networks

Hierarchical slice interaction and multi-layer cooperative decoding networks for remote sensing image dehazing

Modality interactive attention for cross-modality person re-identification