Learning task-specific discriminative embeddings for few-shot image classification
Recently, few-shot learning has attracted more and more attention. Generally, the fine-tuning-based few-shot learning framework contains two stages: i) In the pre-training stage, using base data to train the feature extractor; ii) In ...
Adaptive neural network decentralized fault-tolerant control for nonlinear interconnected fractional-order systems
This paper studies the neural network (NN) decentralized observer-based fault-tolerant control (FTC) design problem on the nonlinear interconnected fractional-order systems. The considered fractional-order systems have the non strict-...
Domain-aware multi-modality fusion network for generalized zero-shot learning
Generalized zero-shot learning (GZSL) is a challenging problem which aims to recognize images from both seen and unseen classes. Existing research suffers from the bias problem, which means that the model tends to misclassify an unseen ...
Knowledge interaction enhanced sequential modeling for interpretable learner knowledge diagnosis in intelligent tutoring systems
One of the fundamental tasks when providing personalized tutoring services to learners in online learning systems, such as intelligent tutoring systems and massive open online courses, is the learner knowledge diagnosis (LKD). LKD ...
You should know more: Learning external knowledge for visual dialog
Visual dialog is a task that two agents complete a multi-round conversation based on an image, a caption, and dialog histories. Despite the recent progress, existing methods still undergo degradation on the condition of complex ...
Multi-task joint training model for machine reading comprehension
Machine reading comprehension is an important topic in natural language processing. However, the redundancy of text and the diversity of answer types are neglected in existing machine reading comprehension. To tackle these issues, a ...
Point-to-point consensus tracking control for unknown nonlinear multi-agent systems using data-driven iterative learning
This paper considers the point-to-point consensus tracking control for a class of nonlinear multi-agent systems with completely unknown dynamics, where the consensus is concerned with some given desired points instead of the entire ...
Multimodal graph neural network for video procedural captioning
Video procedural captioning aims to generate detailed descriptive captions for all steps in a long instructional video. The peculiarity of this problem is the procedural dependency between the events to generate consistent captions ...
Self-supervised 3D human pose estimation from video
To accurately estimate 3D human pose from monocular camera images, a large amount of 3D annotated data is required. However, obtaining 3D annotated data outside the laboratory is not easy. In the absence of such data, weakly-supervised ...
FG-CF: Friends-aware graph collaborative filtering for POI recommendation
Collaborative filtering approach greatly promotes the development and application of personalized recommendation. In location-based social networks (LBSNs), the sparsity of check-in data is one of the main obstacles for traditional ...
Gain-scheduled state estimation for discrete-time complex networks under bit-rate constraints
In this paper, the gain-scheduled state estimation issue is investigated for a kind of complex networks subject to randomly occurring nonlinearities under bit-rate constraints. An array of random variables is introduced to govern the ...
Bipartite consensus for a class of nonlinear multi-agent systems under switching topologies: A disturbance observer-based approach
This paper considers the leader-following bipartite consensus for a class of nonlinear multi-agent systems (MASs) subject to exogenous disturbances under directed fixed and switching topologies, respectively. Firstly, two new output ...
Partial-neurons-based state estimation for artificial neural networks under constrained bit rate: The finite-time case
This paper is concerned with the partial-neuron-based finite-time state estimation problem for a class of artificial neural networks with time-varying delays. Measurements information from only a small fractional of the artificial ...
RefFaceNet: Reference-based Face Image Generation from Line Art Drawings
The flexible artistic creation of painting using deep neural networks has attracted lots of attention recently. Existing image-to-image translation approaches show powerful capabilities in producing photos between various domains. ...
Intelligent fault diagnosis based on sample weighted joint adversarial network
- Aiming at the cross-domain fault diagnosis of rotating components, a sample-weighted joint adversarial network is proposed, which explores category ...
In recent years, adversarial-based deep domain adaptation (DDA) has attracted increasing attention in transferable fault diagnosis. However, most of the existing methods mainly focus on the elimination of global distribution ...
Automatic sleep staging method of EEG signal based on transfer learning and fusion network
Automatic sleep staging technology is a current research hotspot in brain-computer interfaces, freeing sleep specialists from the time-consuming task of manually diagnosing sleep. It performs complex sleep staging tasks by ...
A new training approach for deep learning in EEG biometrics using triplet loss and EMG-driven additive data augmentation
One of the major challenges facing Electroencephalogram (EEG) in biometric systems is the high time-variability of the brainwaves, especially across different sessions. Electrical muscle activity, or Electromyogram (EMG), is considered ...
One-shot Video Graph Generation for Explainable Action Reasoning
Human action analysis is a critical yet challenging task for understanding diverse video content. Recently, to enable explainable reasoning of video actions, a spatio-temporal video graph structure was proposed to represent the video ...
A comprehensive survey on robust image watermarking
Display Omitted
Highlights
- A comprehensive survey on image watermarking schemes in different domains.
- ...
With the rapid development and popularity of the Internet, multimedia security has become a general essential concern. Especially, as manipulation of digital images gets much easier, the challenges it brings to authentication ...
Semi-supervised learning based on intra-view heterogeneity and inter-view compatibility for image classification
To achieve high value of classification results by using a limited number of training samples, designing a multi-view semi-supervised classification model is extremely urgent. Moreover, considering that graph learning can fully capture ...
Analysis of the electronic integrate and fire neuron model
Nano-scale devices are thought to intervene in natural life for a variety of responsibilities. For understanding the intrinsic communication of such nano-scale devices, software and hardware modalities have been introduced. Some of ...
EvoDCNN: An evolutionary deep convolutional neural network for image classification
- Neuroevolution to develop Deep Convolutional Neural Network (DCNN) for image classification.
Developing Deep Convolutional Neural Networks (DCNNs) for image classification is a complicated task that needs considerable effort and knowledge. By employing an evolutionary computation approach, one can automatically generate the ...
Action-dependent bidirectional contrastive predictive coding for neural belief representations
- Contrastive learning improves the data utilization of belief representations in POMDPs.
The key to solving the complex, partially observable Markov decision process (POMDP) with high-dimensional observations lies in explicit belief representations. However, the existing methods generally adopted the black-box model for ...
Feature adaptation-based multipeak-redetection spatial-aware correlation filter for object tracking
Visual tracking has always been an important research topic in the field of computer vision. During tracking, cluttered backgrounds, deformations, and occasional occlusion inevitably cause unpredictable appearance changes in the target,...
A reliable solder joint inspection method based on a light-weight point cloud network and modulated loss
High-speed and high-reliability automated quality inspection is widely demanded in the electronics industry. Recently, deep learning theory combined with many non-destructive technologies shows superior performance for inspecting ...
Ridge regression with adaptive additive rectangles and other piecewise functional templates
We propose a penalization algorithm for functional linear regression models, where the coefficient function β is shrunk towards a data-driven shape template γ. To the best of our knowledge, we employ the nonzero centered L 2 penalty in ...
Contour-guided saliency detection with long-range interactions
The guided search theory suggests that global sources of scenes are important for robust visual processing. In this study, we use an eye-tracking experiment to verify that contours play a crucial role in guiding visual attention. ...
SFNet: A slow feature extraction network for parallel linear and nonlinear dynamic process monitoring
- A Slow feature network (SFNet) method is proposed for dynamics extraction and parallel linear and nonlinear monitoring.
In a typical industrial process, there may exist both linear and nonlinear relationships among process variables. Besides, the existence of process dynamics poses challenges to process monitoring. Some monitoring methods have been ...
Learning soft threshold for sparse reparameterization using gradual projection operators
Deep neural networks (DNNs) have achieved great success in the field of computer vision in recent years. While being high-precision, the characteristic of over-parameterization impedes DNNs from being applied to lightweight devices. To ...