Author: Xia, Haiying : Search

research-article

Dual-consistency constraints network for noisy facial expression recognition

Image and Vision Computing (IAVC), Volume 148, Issue Chttps://doi.org/10.1016/j.imavis.2024.105141

Abstract

Although existing facial expression recognition (FER) methods have achieved great success, their performance degrades significantly under noisy labels caused by low-quality images, ambiguous expressions, and subjective and incorrect labeling. ...

Highlights

An effective DC-Net is presented to learn robust representations to suppress noisy samples.
A novel Class Activation Mapping Attention Consistency is proposed to make model focus on partially important features.
A novel Class ...

research-article

Hard semantic mask strategy for automatic facial action unit recognition with teacher–student model

Multimedia Systems (MUME), Volume 30, Issue 4https://doi.org/10.1007/s00530-024-01385-x

Abstract

Facial Action Coding System (FACS) is a widely used technique in affective computing, which defines a series of facial action units (AUs) corresponding to localized regions of the face. Fine-grained feature information of critical regions is ...

research-article

Learning from feature and label spaces’ bias for uncertainty-adaptive facial emotion recognition

Pattern Recognition Letters (PTRL), Volume 182, Issue CPages 97–103https://doi.org/10.1016/j.patrec.2024.04.015

Abstract

Developing an accurate deep model for facial emotion recognition is a long-term challenge. It is because the uncertainty of emotions, stemming from the ambiguity of different emotional categories and the difference of subjective annotations, can ...

Highlights

We establish an uncertainty-adaptive framework via exploring the bias between two kinds of sample sets.
We custom two modules namely cross-space attention consistency learning module and soft-label learning module.
The experimental ...

research-article

Learning informative and discriminative semantic features for robust facial expression recognition

Journal of Visual Communication and Image Representation (JVCIR), Volume 98, Issue Chttps://doi.org/10.1016/j.jvcir.2024.104062

Abstract

Facial expression recognition (FER) becomes challenging in real-world scenarios, which requires learning informative and discriminative features from challenging datasets to obtain robust facial expression recognition. In this paper, we propose ...

Highlights

An effective IDSFL network is presented to learn robust representations for FER in the wild.
A novel multi-channel feature modulator incorporating Gabor features is proposed to learn informative features.
A specific emotion-aware ...

research-article

RT-Net: Region-Enhanced Attention Transformer Network for Polyp Segmentation

Neural Processing Letters (NPLE), Volume 55, Issue 9Pages 11975–11991https://doi.org/10.1007/s11063-023-11405-y

Abstract

Colonic polyps are highly correlated with colorectal cancer. Prevention of colorectal cancer is the detection and removal of polyps in the early stages of the disease. But the detection process relies on the physician’s experience and is prone to ...

research-article

Feature fusion of multi-granularity and multi-scale for facial expression recognition

The Visual Computer: International Journal of Computer Graphics (VISC), Volume 40, Issue 3Pages 2035–2047https://doi.org/10.1007/s00371-023-02900-3

Abstract

Although great progress has been made in facial expression recognition, it still faces challenges such as occlusion and pose changes in real-world scenario. To address this issue, we propose a simple yet effective multi-granularity and multi-scale ...

research-article

ST-VQA: shrinkage transformer with accurate alignment for visual question answering

Applied Intelligence (KLU-APIN), Volume 53, Issue 18Pages 20967–20978https://doi.org/10.1007/s10489-023-04564-x

Abstract

While transformer-based models have been remarkably successful in the field of visual question answering (VQA), their approaches to achieve vision and language feature alignment are simple and coarse. In recent years, this shortcoming has been ...

research-article

Three-dimensional quantum wavelet transforms

Frontiers of Computer Science: Selected Publications from Chinese Universities (FCS), Volume 17, Issue 5https://doi.org/10.1007/s11704-022-1639-y

Abstract

Wavelet transform is being widely used in the field of information processing. One-dimension and two-dimension quantum wavelet transforms have been investigated as important tool algorithms. However, three-dimensional quantum wavelet transforms ...

research-article

Collaborative learning network for head pose estimation

Image and Vision Computing (IAVC), Volume 127, Issue Chttps://doi.org/10.1016/j.imavis.2022.104555

Highlights

Propose a collaborative learning framework for head pose estimation.
Learn ...

Abstract

Head pose estimation is an important task in many real-world applications, such as human–computer interaction, driver monitoring, face localization and gaze estimation. In this paper, we present a novel collaborative learning framework ...

research-article

HRNet:A hierarchical recurrent convolution neural network for retinal vessel segmentation

Multimedia Tools and Applications (MTAA), Volume 81, Issue 28Pages 39829–39851https://doi.org/10.1007/s11042-022-12696-4

Abstract

The extraction of retinal vessel is of great importance in the diagnosis of fundus disease. Many approaches have been proposed for vessel segmentation. However, these models have some drawbacks. First, the encoder-decoder structures, U-Net i.e., ...

research-article

MFC-Net: Multi-scale fusion coding network for Image Deblurring

Applied Intelligence (KLU-APIN), Volume 52, Issue 11Pages 13232–13249https://doi.org/10.1007/s10489-021-02993-0

Abstract

The existing image blind deblurring methods mostly adopt the “coarse-to-fine” scheme, which always require a mass of parameters and can not mine the blur information effectively. To tackle the above problems, we design a lightweight multi-scale ...

Article

Cooperative Positioning Enhancement for HDVs and CAVs Coexisting Environment Using Deep Neural Networks

Advances in Swarm IntelligencePages 118–131https://doi.org/10.1007/978-3-031-09726-3_11

Abstract

Accurate vehicle positioning is a key technology affecting traffic safety and travel efficiency. High precision positioning technology combined with the internet of vehicles (IoV) can improve the positioning accuracy of human-driving vehicles (...

research-article

HT-Net: hierarchical context-attention transformer network for medical ct image segmentation

Applied Intelligence (KLU-APIN), Volume 52, Issue 9Pages 10692–10705https://doi.org/10.1007/s10489-021-03010-0

Abstract

Convolutional neural networks (CNNs) have been a prevailing technique in the field of medical CT image processing. Although encoder-decoder CNNs exploit locality for efficiency, they cannot adequately model remote pixel relationships. Recent works ...

research-article

A multi-scale gated network for retinal hemorrhage detection

Applied Intelligence (KLU-APIN), Volume 53, Issue 5Pages 5259–5273https://doi.org/10.1007/s10489-022-03476-6

Abstract

Retinal hemorrhage detection is of great significance for clinical diagnosis and disease control. However, most of the traditional methods need to obtain candidate lesions firstly, and then determine the true lesions. To address this problem, we ...

research-article

ECA-CBAM: Classification of Diabetic Retinopathy: Classification of diabetic retinopathy by cross-combined attention mechanism

ICIAI '22: Proceedings of the 2022 6th International Conference on Innovation in Artificial IntelligencePages 78–82https://doi.org/10.1145/3529466.3529468

Although there is no distinctive header, this is the abstract. Diabetic retinopathy is an ophthalmological disease that causes bleeding in the fundus and loss of vision due to damage to blood vessels in the retina. It is one of the main causes of vision ...

research-article

Safety and energy-saving driving behaviour evaluation with driving feature constraint TOPSIS method

International Journal of Computing Science and Mathematics (IJCSM), Volume 16, Issue 1Pages 59–70https://doi.org/10.1504/ijcsm.2022.126769

There are many factors including driving behaviours, roads, weather to affect the safety and energy-saving of the vehicle and these driving behaviours have different features which impact the safety and energy-saving. To improve the performance of safety ...

research-article

MC-Net: multi-scale context-attention network for medical CT image segmentation

Applied Intelligence (KLU-APIN), Volume 52, Issue 2Pages 1508–1519https://doi.org/10.1007/s10489-021-02506-z

Abstract

The encoder-decoder CNN architecture has greatly improved CT medical image segmentation, but it encounters a bottleneck due to the loss of details in the encoding process, which limits the accuracy improvement. To address this problem, we propose ...

research-article

Style transfer for QR code

Multimedia Tools and Applications (MTAA), Volume 79, Issue 45-46Pages 33839–33852https://doi.org/10.1007/s11042-019-08555-4

Abstract

Due to fast scanning response and strong damage resistance, Quick Response (QR) code has been used widely in product tracking, item identification, time tracking, document management, and general marketing. The standard QR code consisting of black ...

research-article

Md-Net: Multi-scale Dilated Convolution Network for CT Images Segmentation

Neural Processing Letters (NPLE), Volume 51, Issue 3Pages 2915–2927https://doi.org/10.1007/s11063-020-10230-x

Abstract

Accurate CT image segmentation is of great importance to the clinical diagnosis. Due to the high similarity of gray values in CT image, the segmented areas are easily affected by their surroundings, which leads to the loss of semantic information. ...

research-article

Quantum circuit design of approximate median filtering with noise tolerance threshold

Quantum Information Processing (JQIP), Volume 19, Issue 6https://doi.org/10.1007/s11128-020-02678-6

Abstract

Quantum median filtering is an important step for many quantum signal processing algorithms. Current quantum median filtering designs show limitations in either computational complexity or incomplete noise detection. We propose a design of quantum ... $(^{})$ $(^{})$ $(^{}^{})$ $^{}^{}$

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Conference Event

Proceedings Series

Publication Date

Dual-consistency constraints network for noisy facial expression recognition

Hard semantic mask strategy for automatic facial action unit recognition with teacher–student model

Learning from feature and label spaces’ bias for uncertainty-adaptive facial emotion recognition

Learning informative and discriminative semantic features for robust facial expression recognition

RT-Net: Region-Enhanced Attention Transformer Network for Polyp Segmentation

Feature fusion of multi-granularity and multi-scale for facial expression recognition

ST-VQA: shrinkage transformer with accurate alignment for visual question answering

Three-dimensional quantum wavelet transforms

Collaborative learning network for head pose estimation

HRNet:A hierarchical recurrent convolution neural network for retinal vessel segmentation

MFC-Net: Multi-scale fusion coding network for Image Deblurring

Cooperative Positioning Enhancement for HDVs and CAVs Coexisting Environment Using Deep Neural Networks

HT-Net: hierarchical context-attention transformer network for medical ct image segmentation

A multi-scale gated network for retinal hemorrhage detection

ECA-CBAM: Classification of Diabetic Retinopathy: Classification of diabetic retinopathy by cross-combined attention mechanism

Safety and energy-saving driving behaviour evaluation with driving feature constraint TOPSIS method

MC-Net: multi-scale context-attention network for medical CT image segmentation

Style transfer for QR code

Md-Net: Multi-scale Dilated Convolution Network for CT Images Segmentation

Quantum circuit design of approximate median filtering with noise tolerance threshold

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Conference Event

Proceedings Series

Publication Date

Save to Binder