Multi-start team orienteering problem for UAS mission re-planning with data-efficient deep reinforcement learning
In this paper, we study the Multi-Start Team Orienteering Problem (MSTOP), a mission re-planning problem where vehicles are initially located away from the depot and have different amounts of fuel. We consider/assume the goal of multiple vehicles ...
DS-MSFF-Net: Dual-path self-attention multi-scale feature fusion network for CT image segmentation
Computed tomography (CT) is an important technique that is widely used in disease screening and diagnosis. In order to assist doctors in diagnosis and treatment plans, an efficient and accurate automatic image segmentation technology is urgently ...
Audio-visual speech synthesis using vision transformer–enhanced autoencoders with ensemble of loss functions
Audio-visual speech synthesis (AVSS) has garnered attention in recent years for its utility in the realm of audio-visual learning. AVSS transforms one speaker’s speech into another’s audio-visual stream while retaining linguistic content. This ...
Semi-supervised diagnosis of wind-turbine gearbox misalignment and imbalance faults
Both wear-induced bearing failure and misalignment of the powertrain between the rotor and the electrical generator are common failure modes in wind-turbine motors. In this study, Semi-Supervised Learning (SSL) is applied to a fault detection and ...
SelfPAB: large-scale pre-training on accelerometer data for human activity recognition
Annotating accelerometer-based physical activity data remains a challenging task, limiting the creation of robust supervised machine learning models due to the scarcity of large, labeled, free-living human activity recognition (HAR) datasets. ...
Video-based beat-by-beat blood pressure monitoring via transfer deep-learning
Currently, learning physiological vital signs such as blood pressure (BP), hemoglobin levels, and oxygen saturation, from Photoplethysmography (PPG) signal, is receiving more attention. Despite successive progress that has been made so far, ...
Evolutionary dynamic grouping based cooperative co-evolution algorithm for large-scale optimization
To effectively address large-scale optimization problems, this paper proposes an evolutionary dynamic grouping (EDG) based cooperative co-evolution (CC) algorithm. In the proposed algorithm, a novel decomposition method is designed to generate the ...
A two-stage approach solo_GAN for overlapping cervical cell segmentation based on single-cell identification and boundary generation
Accurate cell segmentation is a pivotal step throughout the cervical cancer treatment continuum, encompassing early screening, guiding treatment decisions, and assessing long-term prognosis. Currently, in clinical practice, pathologists rely on ...
Multi-state delayed echo state network with empirical wavelet transform for time series prediction
In this paper, considering the effect of multiple delayed states on the reservoir itself, based on the advantage of the empirical wavelet transform, an improved ESN with multiple delayed states is proposed, called multi-state delayed echo state ...
Conditional probability table limit-based quantization for Bayesian networks: model quality, data fidelity and structure score
Bayesian Networks (BN) are robust probabilistic graphical models mainly used with discrete random variables requiring discretization and quantization of continuous data. Quantization is known to affect model accuracy, speed and interpretability, ...
Physics-informed deep learning to quantify anomalies for real-time fault mitigation in 3D printing
In 3D printing processes, there are many thermal stress related defects that can have a significant negative impact on the shape and size of the structure. Such anomalies in the heat transfer of the printing process need to be detected at an early ...
A novel lightweight CNN for chest X-ray-based lung disease identification on heterogeneous embedded system
The global spread of epidemic lung diseases, including COVID-19, underscores the need for efficient diagnostic methods. Addressing this, we developed and tested a computer-aided, lightweight Convolutional Neural Network (CNN) for rapid and ...
Resolution-sensitive self-supervised monocular absolute depth estimation
Depth estimation is an essential component of computer vision applications for environment perception, 3D reconstruction and scene understanding. Among the available methods, self-supervised monocular depth estimation is noteworthy for its cost-...
Visual contextual relationship augmented transformer for image captioning
The image captioning task is among the most important tasks in computer vision. Most existing methods mine more useful contextual information from image features. Similarly, to mine more contextual information, this paper proposes a visual ...
A hybrid information-based two-phase expansion algorithm for community detection with imbalanced scales
The scale of communities in real-world networks is often imbalanced, which has a significant impact on community detection performance. Existing approaches exhibit a trade-off between accuracy and computational cost, with global methods offering ...
Network traffic grant classification based on 1DCNN-TCN-GRU hybrid model
Accurate grant classification of network traffic not only assists service providers in making acceptable allocations based on actual business demands, but also ensures service quality. To further improve the accuracy of traffic classification, we ...
FeSTGCN: A frequency-enhanced spatio-temporal graph convolutional network for traffic flow prediction under adaptive signal timing
Traffic flow prediction is the fundamental cornerstone of intelligent urban transportation systems. However, existing research has predominantly focused on exploring spatiotemporal dependencies within the spatial and temporal domains, often ...
Resource allocation in heterogeneous network with node and edge enhanced graph attention network
In wireless networks, the effectiveness of beamforming and power allocation strategies is crucial in meeting the increasing data demands of users and ensuring rapid data transmission. Graph learning approaches have been developed to tackle complex ...
Domain generalization based on domain-specific adversarial learning
Deep learning models often suffer from degraded performance when the distributions of the training and testing data differ (i.e., domain shift). Domain generalization (DG) techniques can help improve the generalization performance for unseen ...
Incremental feature selection approach to multi-dimensional variation based on matrix dominance conditional entropy for ordered data set
Rough set theory is a mathematical tool widely employed in various fields to handle uncertainty. Feature selection, as an essential and independent research area within rough set theory, aims to identify a small subset of important features by ...
Causal inference in the medical domain: a survey
Causal inference is considered a crucial topic in the medical field, as it enables the determination of causal effects for medical treatments through data analysis. However, the vast volume and complexity of medical data present significant ...
Multiple reference points-based multi-objective feature selection for multi-label learning
In the real world, data often exhibits high-dimensional and complex characteristics. In addition, an object may correspond to multiple class labels. Therefore, filtering and processing such data has become a hot research topic. Multi-label feature ...
Modeling essay grading with pre-trained BERT features
Writing essays is an important skill which enables one to clearly write the ideas and understanding of certain topic with the help of language articulation and examples. Writing essay is a skill so is the grading of those essays. It requires a lot ...
Self-knowledge distillation enhanced binary neural networks derived from underutilized information
Binarization efficiently compresses full-precision convolutional neural networks (CNNs) to achieve accelerated inference but with substantial performance degradations. Self-knowledge distillation (SKD) can significantly improve the performance of ...
Polyphonic sound event localization and detection using channel-wise FusionNet
Sound Event Localization and Detection (SELD) is the task of spatial and temporal localization of various sound events and their classification. Commonly, multitask models are used to perform SELD. In this work, a deep learning network model named ...
SS-MVMETRO: Semi-supervised multi-view human mesh recovery transformer
Parametric methods are widely utilized in RGB-based human mesh recovery, relying on precise statistical human body model parameters that are challenging to obtain. In contrast, non-parametric transformer-based approaches excel but are applied only ...