Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleMarch 2024
Dual states based reinforcement learning for fast MR scan and image reconstruction
AbstractIncomplete phase encoding with few phases is an effective under-sampling manner of fast Magnetic Resonance (MR) scan. The key is how to choose important slice-specific phases. Reinforcement Learning (RL) is powerful for sequential decision-making ...
- research-articleMarch 2024
Progressive network based on detail scaling and texture extraction: A more general framework for image deraining
AbstractMany feature extraction components have been proposed for image deraining tasks, aiming to improve feature learning. However, few models have addressed the integration of multi-scale features from derain images. The fusion of multiple features at ...
Highlights- Introduce detail scaling module to extract generalized features from rainfall image.
- Improved Transform block was introduced to enhance the model’s generalized ability.
- Scale-mixing strategy is proposed for capturing more multi-...
- research-articleMarch 2024
Rotation-equivariant correspondence matching based on a dual-activation mixer
AbstractLearning-based correspondence matching methods have become the mainstream techniques in many computer vision and robotics applications due to their robustness to large illumination and viewpoint changes. However, it is difficult for conventional ...
- research-articleMarch 2024
Introducing shape priors in Siamese networks for image classification
AbstractThe efficiency of deep neural networks is increasing, and so is the amount of annotated data required for training them. We propose a solution improving the learning process of a classification network with less labeled data. Our approach is to ...
Highlights- We propose a solution to learn a classification network with less labeled data.
- The principle is to provide the classifier a binary mask as a simple shape prior.
- We resort to a Siamese architecture and feed it with images and shape ...
- research-articleMarch 2024
Scarcity-GAN: Scarce data augmentation for defect detection via generative adversarial nets
AbstractData augmentation is a crucial and challenging task for improving defect detection with limited data. Many generative models have been proposed and shown promising performance on this task. However, existing models are unable to capture the fine ...
-
- research-articleMarch 2024
Knowledge distillation for object detection based on Inconsistency-based Feature Imitation and Global Relation Imitation
AbstractKnowledge distillation (KD) is a method that transfers information from a larger network (teacher) to a smaller network (student) to obtain stronger performance without extra computational load. It has made great success in image classification, ...
- research-articleMarch 2024
Detection of dangerously approaching vehicles over onboard cameras by speed estimation from apparent size
- Iván García-Aguilar,
- Jorge García-González,
- Daniel Medina,
- Rafael Marcos Luque-Baena,
- Enrique Domínguez,
- Ezequiel López-Rubio
AbstractAutonomous driving requires information such as the velocity of other vehicles to prevent potential hazards. This work proposes a real-time deep learning-based framework to estimate vehicle speeds from image captures through an onboard camera. ...
Highlights- Use object detection and linear regression to estimate moving vehicle speeds.
- Comparison of the methodology with LIDAR-based speed estimation techniques.
- Evaluation of the methodology on the Prevention dataset.
- articleMarch 2024
Scene Graph Generation: A comprehensive survey
- Hongsheng Li,
- Guangming Zhu,
- Liang Zhang,
- Youliang Jiang,
- Yixuan Dang,
- Haoran Hou,
- Peiyi Shen,
- Xia Zhao,
- Syed Afaq Ali Shah,
- Mohammed Bennamoun
AbstractDeep learning techniques have led to remarkable breakthroughs in the field of object detection and have spawned a lot of scene-understanding tasks in recent years. Scene graph has been the focus of research because of its powerful semantic ...
Graphical abstractDisplay Omitted
Highlights- A comprehensive review of 138 papers on scene graph generation is presented.
- We analyze 2D scene graph generation, focusing on feature representation.
- A review of typical datasets for 2D, spatio-temporal and 3D scene graph ...
- research-articleMarch 2024
Anchor Ball Regression Model for large-scale 3D skull landmark detection
AbstractRecent deep learning models have exhibited impressive performance in the area of 3D skull landmark detection, but most of them aimed to detect a fixed number of landmarks. This paper focuses on automatically detecting an arbitrary number of ...
- articleMarch 2024
A systematic review of image-level camouflaged object detection with deep learning
AbstractCamouflaged object detection (COD) aims to search and identify disguised objects that are hidden in their surrounding environment, thereby deceiving the human visual system. As an interesting and challenging task, COD has received increasing ...
- research-articleMarch 2024
Collaborative representation based cross-domain semantic transfer for vehicle re-identification
AbstractVehicle re-identification (re-ID) has received increasing attention due to its tremendous potential in practical intelligent transportation scenarios. The main difficulties of vehicle re-ID are generally induced by insufficient annotations, ...
- research-articleMarch 2024
A constrastive semi-supervised deep learning framework for land cover classification of satellite time series with limited labels
AbstractIn this work, we present a new semi-supervised learning framework to cope with satellite image time series (SITS) classification in a data paucity scenario, considering extreme low levels of supervision. The proposed methodology, referred as S 3 ...
- research-articleMarch 2024
Towards better transition modeling in recurrent neural networks: The case of sign language tokenization
AbstractRecurrent neural networks (RNNs) are a popular family of models widely used when facing sequential data such as videos. However, RNNs make assumptions about state transitions that could be damageable. This paper presents two theoretical ...
Highlights- Sign language tokenization is a challenging task involving complex transitions.
- The limitations of RNNs highlighted in theory can be observed in practice.
- Extensions have a positive impact, but there is still room for improvement.
- research-articleMarch 2024
SFAMNet: A scene flow attention-based micro-expression network
AbstractTremendous progress has been made in facial Micro-Expression (ME) spotting and recognition; however, most works have focused on either spotting or recognition tasks on the 2D videos. Until recently, the estimation of the 3D motion field (a.k.a ...
Highlights- First study to spot and recognize micro-expressions on the multi-modal dataset.
- A scene flow attention-based end-to-end multi-stream multi-task network is proposed.
- Scene flow is extracted to estimate the 3D motion using color and ...
- research-articleMarch 2024
Automated detection of vertebral fractures from X-ray images: A novel machine learning model and survey of the field
- Li-Wei Cheng,
- Hsin-Hung Chou,
- Yu-Xuan Cai,
- Kuo-Yuan Huang,
- Chin-Chiang Hsieh,
- Po-Lun Chu,
- I-Szu Cheng,
- Sun-Yuan Hsieh
AbstractVertebral fractures are a common problem and the most prevalent of thoracolumbar compression and burst fractures. However, vertebral fractures are difficult to diagnose: an experienced orthopedist or radiologist is required to detect and ...
Highlights- Proposed AI model for vertebral fracture detection using YOLOv4 and ResUNet.
- Achieved high precision (99%, 74%, 94%) in identifying healthy, compression, and burst fractures.
- First literature mention of detecting compression and ...
- articleMarch 2024
A review of coverless steganography
AbstractWith the enhancement of people’s security awareness, transmitting secret information securely has gradually become a demand for the public. Steganography is a technology of representing secret information within another carrier, aiming at ...
- research-articleFebruary 2024
Attention Round for post-training quantization
AbstractQuantization methods for convolutional neural network models can be broadly categorized into post-training quantization (PTQ) and quantization aware training (QAT). While PTQ offers the advantage of requiring only a small portion of the data for ...
Highlights- Attention Round quantization function expands the quantization optimization space.
- Mixed precision allocation method improves mixed precision quantization efficiency.
- Enriched lightweight CNNs contribute to applications in resource-...
- research-articleFebruary 2024
Neuromorphic imaging and classification with graph learning
AbstractBio-inspired neuromorphic cameras asynchronously record pixel brightness changes and generate sparse event streams. They can capture dynamic scenes with little motion blur and more details in extreme illumination conditions. Due to the ...
Highlights- Asynchronous event streams of neuromorphic cameras shape synchronous graph representations.
- Graph Transformer effectively learns informative features from event-based graphs.
- Temporal-wise connectivity and learning suffer less from ...
- research-articleFebruary 2024
Improving robustness for vision transformer with a simple dynamic scanning augmentation
AbstractVision Transformer (ViT) has demonstrated promising performance in computer vision tasks, comparable to state-of-the-art neural networks. Yet, this new type of deep neural network architecture is vulnerable to adversarial attacks limiting its ...
- research-articleFebruary 2024
Joint learning of motion deblurring and defocus deblurring networks with a real-world dataset
AbstractWhen recovering the sharp image from a blurry observation, moving objects, e.g., people and vehicles, usually attract more perceptual attention. However, existing motion deblurring methods primarily focus on removing the global camera motion blur,...