Computer vision

Applied Filters

People

Publications

Publication Date

Searched The ACM Guide to Computing Literature (3,718,616 records)|Limit your search to The ACM Full-Text Collection (746,404 records)

Showing 1 - 20of2,515 Results

Filters

Select All

Export Citations Save to Binder

per page:

Relevance

research-article
March 2024
Dual states based reinforcement learning for fast MR scan and image reconstruction
Neurocomputing (NEUROC), Volume 568, Issue CFeb 2024https://doi.org/10.1016/j.neucom.2023.127067
Abstract
Incomplete phase encoding with few phases is an effective under-sampling manner of fast Magnetic Resonance (MR) scan. The key is how to choose important slice-specific phases. Reinforcement Learning (RL) is powerful for sequential decision-making ...
0
Metrics
Total Citations0
research-article
March 2024
Progressive network based on detail scaling and texture extraction: A more general framework for image deraining
Neurocomputing (NEUROC), Volume 568, Issue CFeb 2024https://doi.org/10.1016/j.neucom.2023.127066
Abstract
Many feature extraction components have been proposed for image deraining tasks, aiming to improve feature learning. However, few models have addressed the integration of multi-scale features from derain images. The fusion of multiple features at ...
Highlights

Introduce detail scaling module to extract generalized features from rainfall image.
Improved Transform block was introduced to enhance the model’s generalized ability.
Scale-mixing strategy is proposed for capturing more multi-...
1
Metrics
Total Citations1
research-article
March 2024
Rotation-equivariant correspondence matching based on a dual-activation mixer
Neurocomputing (NEUROC), Volume 568, Issue CFeb 2024https://doi.org/10.1016/j.neucom.2023.127053
Abstract
Learning-based correspondence matching methods have become the mainstream techniques in many computer vision and robotics applications due to their robustness to large illumination and viewpoint changes. However, it is difficult for conventional ...
0
Metrics
Total Citations0
research-article
March 2024
Introducing shape priors in Siamese networks for image classification
Neurocomputing (NEUROC), Volume 568, Issue CFeb 2024https://doi.org/10.1016/j.neucom.2023.127034
Abstract
The efficiency of deep neural networks is increasing, and so is the amount of annotated data required for training them. We propose a solution improving the learning process of a classification network with less labeled data. Our approach is to ...
Highlights

We propose a solution to learn a classification network with less labeled data.
The principle is to provide the classifier a binary mask as a simple shape prior.
We resort to a Siamese architecture and feed it with images and shape ...
0
Metrics
Total Citations0
research-article
March 2024
Scarcity-GAN: Scarce data augmentation for defect detection via generative adversarial nets
Neurocomputing (NEUROC), Volume 566, Issue CJan 2024https://doi.org/10.1016/j.neucom.2023.127061
Abstract
Data augmentation is a crucial and challenging task for improving defect detection with limited data. Many generative models have been proposed and shown promising performance on this task. However, existing models are unable to capture the fine ...
0
Metrics
Total Citations0
research-article
March 2024
Knowledge distillation for object detection based on Inconsistency-based Feature Imitation and Global Relation Imitation
- Peng Ju,
- Yi Zhang
Neurocomputing (NEUROC), Volume 566, Issue CJan 2024https://doi.org/10.1016/j.neucom.2023.127060
Abstract
Knowledge distillation (KD) is a method that transfers information from a larger network (teacher) to a smaller network (student) to obtain stronger performance without extra computational load. It has made great success in image classification, ...
0
Metrics
Total Citations0
research-article
March 2024
Detection of dangerously approaching vehicles over onboard cameras by speed estimation from apparent size
Neurocomputing (NEUROC), Volume 567, Issue CJan 2024https://doi.org/10.1016/j.neucom.2023.127057
Abstract
Autonomous driving requires information such as the velocity of other vehicles to prevent potential hazards. This work proposes a real-time deep learning-based framework to estimate vehicle speeds from image captures through an onboard camera. ...
Highlights

Use object detection and linear regression to estimate moving vehicle speeds.
Comparison of the methodology with LIDAR-based speed estimation techniques.
Evaluation of the methodology on the Prevention dataset.
1
Metrics
Total Citations1
article
March 2024
Scene Graph Generation: A comprehensive survey
Neurocomputing (NEUROC), Volume 566, Issue CJan 2024https://doi.org/10.1016/j.neucom.2023.127052
Abstract
Deep learning techniques have led to remarkable breakthroughs in the field of object detection and have spawned a lot of scene-understanding tasks in recent years. Scene graph has been the focus of research because of its powerful semantic ...
Graphical abstract

Display Omitted
Highlights

A comprehensive review of 138 papers on scene graph generation is presented.
We analyze 2D scene graph generation, focusing on feature representation.
A review of typical datasets for 2D, spatio-temporal and 3D scene graph ...
0
Metrics
Total Citations0
research-article
March 2024
Anchor Ball Regression Model for large-scale 3D skull landmark detection
- Tao He,
- Guikun Xu,
- Li Cui,
- Wei Tang,
- Jie Long,
- Jixiang Guo
Neurocomputing (NEUROC), Volume 567, Issue CJan 2024https://doi.org/10.1016/j.neucom.2023.127051
Abstract
Recent deep learning models have exhibited impressive performance in the area of 3D skull landmark detection, but most of them aimed to detect a fixed number of landmarks. This paper focuses on automatically detecting an arbitrary number of ...
0
Metrics
Total Citations0
article
March 2024
A systematic review of image-level camouflaged object detection with deep learning
Neurocomputing (NEUROC), Volume 566, Issue CJan 2024https://doi.org/10.1016/j.neucom.2023.127050
Abstract
Camouflaged object detection (COD) aims to search and identify disguised objects that are hidden in their surrounding environment, thereby deceiving the human visual system. As an interesting and challenging task, COD has received increasing ...
0
Metrics
Total Citations0
research-article
March 2024
Collaborative representation based cross-domain semantic transfer for vehicle re-identification
- Yun Li,
- Fan Yang,
- Yudou Tian,
- Xuejun Wang,
- Qi Chen,
- Peiguang Jing
Neurocomputing (NEUROC), Volume 567, Issue CJan 2024https://doi.org/10.1016/j.neucom.2023.127039
Abstract
Vehicle re-identification (re-ID) has received increasing attention due to its tremendous potential in practical intelligent transportation scenarios. The main difficulties of vehicle re-ID are generally induced by insufficient annotations, ...
0
Metrics
Total Citations0
research-article
March 2024
A constrastive semi-supervised deep learning framework for land cover classification of satellite time series with limited labels
Neurocomputing (NEUROC), Volume 567, Issue CJan 2024https://doi.org/10.1016/j.neucom.2023.127031
Abstract
In this work, we present a new semi-supervised learning framework to cope with satellite image time series (SITS) classification in a data paucity scenario, considering extreme low levels of supervision. The proposed methodology, referred as S 3 ...
0
Metrics
Total Citations0
research-article
March 2024
Towards better transition modeling in recurrent neural networks: The case of sign language tokenization
Neurocomputing (NEUROC), Volume 567, Issue CJan 2024https://doi.org/10.1016/j.neucom.2023.127018
Abstract
Recurrent neural networks (RNNs) are a popular family of models widely used when facing sequential data such as videos. However, RNNs make assumptions about state transitions that could be damageable. This paper presents two theoretical ...
Highlights

Sign language tokenization is a challenging task involving complex transitions.
The limitations of RNNs highlighted in theory can be observed in practice.
Extensions have a positive impact, but there is still room for improvement.
0
Metrics
Total Citations0
research-article
March 2024
SFAMNet: A scene flow attention-based micro-expression network
Neurocomputing (NEUROC), Volume 566, Issue CJan 2024https://doi.org/10.1016/j.neucom.2023.126998
Abstract
Tremendous progress has been made in facial Micro-Expression (ME) spotting and recognition; however, most works have focused on either spotting or recognition tasks on the 2D videos. Until recently, the estimation of the 3D motion field (a.k.a ...
Highlights

First study to spot and recognize micro-expressions on the multi-modal dataset.
A scene flow attention-based end-to-end multi-stream multi-task network is proposed.
Scene flow is extracted to estimate the 3D motion using color and ...
0
Metrics
Total Citations0
research-article
March 2024
Automated detection of vertebral fractures from X-ray images: A novel machine learning model and survey of the field
Neurocomputing (NEUROC), Volume 566, Issue CJan 2024https://doi.org/10.1016/j.neucom.2023.126946
Abstract
Vertebral fractures are a common problem and the most prevalent of thoracolumbar compression and burst fractures. However, vertebral fractures are difficult to diagnose: an experienced orthopedist or radiologist is required to detect and ...
Highlights

Proposed AI model for vertebral fracture detection using YOLOv4 and ResUNet.
Achieved high precision (99%, 74%, 94%) in identifying healthy, compression, and burst fractures.
First literature mention of detecting compression and ...
0
Metrics
Total Citations0
article
March 2024
A review of coverless steganography
Neurocomputing (NEUROC), Volume 566, Issue CJan 2024https://doi.org/10.1016/j.neucom.2023.126945
Abstract
With the enhancement of people’s security awareness, transmitting secret information securely has gradually become a demand for the public. Steganography is a technology of representing secret information within another carrier, aiming at ...
0
Metrics
Total Citations0
research-article
February 2024
Attention Round for post-training quantization
Neurocomputing (NEUROC), Volume 565, Issue CJan 2024https://doi.org/10.1016/j.neucom.2023.127012
Abstract
Quantization methods for convolutional neural network models can be broadly categorized into post-training quantization (PTQ) and quantization aware training (QAT). While PTQ offers the advantage of requiring only a small portion of the data for ...
Highlights

Attention Round quantization function expands the quantization optimization space.
Mixed precision allocation method improves mixed precision quantization efficiency.
Enriched lightweight CNNs contribute to applications in resource-...
0
Metrics
Total Citations0
research-article
February 2024
Neuromorphic imaging and classification with graph learning
Neurocomputing (NEUROC), Volume 565, Issue CJan 2024https://doi.org/10.1016/j.neucom.2023.127010
Abstract
Bio-inspired neuromorphic cameras asynchronously record pixel brightness changes and generate sparse event streams. They can capture dynamic scenes with little motion blur and more details in extreme illumination conditions. Due to the ...
Highlights

Asynchronous event streams of neuromorphic cameras shape synchronous graph representations.
Graph Transformer effectively learns informative features from event-based graphs.
Temporal-wise connectivity and learning suffer less from ...
0
Metrics
Total Citations0
research-article
February 2024
Improving robustness for vision transformer with a simple dynamic scanning augmentation
- Shashank Kotyan,
- Danilo Vasconcellos Vargas
Neurocomputing (NEUROC), Volume 565, Issue CJan 2024https://doi.org/10.1016/j.neucom.2023.127000
Abstract
Vision Transformer (ViT) has demonstrated promising performance in computer vision tasks, comparable to state-of-the-art neural networks. Yet, this new type of deep neural network architecture is vulnerable to adversarial attacks limiting its ...
0
Metrics
Total Citations0
research-article
February 2024
Joint learning of motion deblurring and defocus deblurring networks with a real-world dataset
- Yu Li,
- Xinya Shu,
- Dongwei Ren,
- Qince Li,
- Wangmeng Zuo
Neurocomputing (NEUROC), Volume 565, Issue CJan 2024https://doi.org/10.1016/j.neucom.2023.126996
Abstract
When recovering the sharp image from a blurry observation, moving objects, e.g., people and vehicles, usually attract more perceptual attention. However, existing motion deblurring methods primarily focus on removing the global camera motion blur,...
0
Metrics
Total Citations0

Applied Filters

People

Names

Institutions

Authors

Publications

All Publications

Content Type

Publisher

Publication Date

Dual states based reinforcement learning for fast MR scan and image reconstruction

Progressive network based on detail scaling and texture extraction: A more general framework for image deraining

Rotation-equivariant correspondence matching based on a dual-activation mixer

Introducing shape priors in Siamese networks for image classification

Scarcity-GAN: Scarce data augmentation for defect detection via generative adversarial nets

Knowledge distillation for object detection based on Inconsistency-based Feature Imitation and Global Relation Imitation

Detection of dangerously approaching vehicles over onboard cameras by speed estimation from apparent size

Scene Graph Generation: A comprehensive survey

Anchor Ball Regression Model for large-scale 3D skull landmark detection

A systematic review of image-level camouflaged object detection with deep learning

Collaborative representation based cross-domain semantic transfer for vehicle re-identification

A constrastive semi-supervised deep learning framework for land cover classification of satellite time series with limited labels

Towards better transition modeling in recurrent neural networks: The case of sign language tokenization

SFAMNet: A scene flow attention-based micro-expression network

Automated detection of vertebral fractures from X-ray images: A novel machine learning model and survey of the field

A review of coverless steganography

Attention Round for post-training quantization

Neuromorphic imaging and classification with graph learning

Improving robustness for vision transformer with a simple dynamic scanning augmentation

Joint learning of motion deblurring and defocus deblurring networks with a real-world dataset