Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (4,226)

Search Parameters:
Keywords = multi-scale features

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
18 pages, 5207 KiB  
Article
MAPPNet: A Multi-Scale Attention Pyramid Pooling Network for Dental Calculus Segmentation
by Tianyu Nie, Shihong Yao, Di Wang, Conger Wang and Yishi Zhao
Appl. Sci. 2024, 14(16), 7273; https://doi.org/10.3390/app14167273 (registering DOI) - 19 Aug 2024
Viewed by 61
Abstract
Dental diseases are among the most prevalent diseases globally, and accurate segmentation of dental calculus images plays a crucial role in periodontal disease diagnosis and treatment planning. However, the current methods are not stable and reliable enough due to the variable morphology of [...] Read more.
Dental diseases are among the most prevalent diseases globally, and accurate segmentation of dental calculus images plays a crucial role in periodontal disease diagnosis and treatment planning. However, the current methods are not stable and reliable enough due to the variable morphology of dental calculus and the blurring of the boundaries between the dental edges and the surrounding tissues; therefore, our hope is to propose an accurate and reliable calculus segmentation algorithm to improve the efficiency of clinical detection. We propose a multi-scale attention pyramid pooling network (MAPPNet) to enhance the performance of dental calculus segmentation. The network incorporates a multi-scale fusion strategy in both the encoder and decoder, forming a model with a dual-ended multi-scale structure. This design, in contrast to employing a multi-scale fusion scheme at a single end, enables more effective capturing of features from diverse scales. Furthermore, the attention pyramid pooling module (APPM) reconstructs the features on this map by leveraging a spatial-first and channel-second attention mechanism. APPM enables the network to adaptively adjust the weights of different locations and channels in the feature map, thereby enhancing the perception of important regions and key features. Experimental evaluation of our collected dental calculus segmentation dataset demonstrates the superior performance of MAPPNet, which achieves an intersection-over-union of 81.46% and an accuracy rate of 98.35%. Additionally, on two publicly available datasets, ISIC2018 (skin lesion dataset) and Kvasir-SEG (gastrointestinal polyp segmentation dataset), MAPPNet achieved an intersection-over-union of 76.48% and 91.38%, respectively. These results validate the effectiveness of our proposed network in accurately segmenting lesion regions and achieving high accuracy rates, surpassing many existing segmentation methods. Full article
Show Figures

Figure 1

15 pages, 4507 KiB  
Article
A UAV Aerial Image Target Detection Algorithm Based on YOLOv7 Improved Model
by Jie Qin, Weihua Yu, Xiaoxi Feng, Zuqiang Meng and Chaohong Tan
Electronics 2024, 13(16), 3277; https://doi.org/10.3390/electronics13163277 (registering DOI) - 19 Aug 2024
Viewed by 103
Abstract
To address the challenges of multi-scale objects, dense distributions, occlusions, and numerous small targets in UAV image detection, we present CMS-YOLOv7, a real-time target detection method based on an enhanced YOLOv7 model. Firstly, the detection layer P2 for small targets was added to [...] Read more.
To address the challenges of multi-scale objects, dense distributions, occlusions, and numerous small targets in UAV image detection, we present CMS-YOLOv7, a real-time target detection method based on an enhanced YOLOv7 model. Firstly, the detection layer P2 for small targets was added to YOLOv7 to enhance the detection ability of small and medium-sized targets, and the deep detection head P5 was taken out to mitigate the influence of excessive downsampling on small target images. The anchor frame was calculated by the K-means++ method. Using the concept of Inner-IoU, the Inner-MPDIoU loss function was constructed to control the range of the auxiliary border and improve detection performance. Furthermore, the CARAFE module was introduced to replace traditional upsampling methods, offering improved integration of semantic information during the image upsampling process and enhancing feature mapping accuracy. Simultaneously, during the feature extraction stage, a non-strided convolutional SPD-Conv module was constructed using space-to-depth techniques. This module replaced certain convolutional operations to minimize the loss of fine-grained information and improve the model’s ability to extract features from small targets. Experiments on the UAV aerial photo dataset VisDrone2019 demonstrated that compared with the baseline YOLOv7 object detection algorithm, CMS-YOLOv7 achieved an improvement of 3.5% [email protected], 3.0% [email protected]:0.95, and the number of parameters decreased by 18.54 M. The ability of small target detection was significantly enhanced. Full article
(This article belongs to the Special Issue Applications of Computer Vision, 2nd Edition)
Show Figures

Figure 1

23 pages, 2501 KiB  
Article
MsFNet: Multi-Scale Fusion Network Based on Dynamic Spectral Features for Multi-Temporal Hyperspectral Image Change Detection
by Yining Feng, Weihan Ni, Liyang Song and Xianghai Wang
Remote Sens. 2024, 16(16), 3037; https://doi.org/10.3390/rs16163037 (registering DOI) - 18 Aug 2024
Viewed by 332
Abstract
With the development of satellite technology, the importance of multi-temporal remote sensing (RS) image change detection (CD) in urban planning, environmental monitoring, and other fields is increasingly prominent. Deep learning techniques enable a profound exploration of the intrinsic features within hyperspectral (HS) data, [...] Read more.
With the development of satellite technology, the importance of multi-temporal remote sensing (RS) image change detection (CD) in urban planning, environmental monitoring, and other fields is increasingly prominent. Deep learning techniques enable a profound exploration of the intrinsic features within hyperspectral (HS) data, leading to substantial enhancements in CD accuracy while addressing several challenges posed by traditional methodologies. However, existing convolutional neural network (CNN)-based CD approaches frequently encounter issues during the feature extraction process, such as the loss of detailed information due to downsampling, which hampers a model’s ability to accurately capture complex spectral features. Additionally, these methods often neglect the integration of multi-scale information, resulting in suboptimal local feature extraction and, consequently, diminished model performance. To address these limitations, we propose a multi-scale fusion network (MsFNet) which leverages dynamic spectral features for effective multi-temporal HS-CD. Our approach incorporates a dynamic convolution module with spectral attention, which adaptively modulates the receptive field size according to the spectral characteristics of different bands. This flexibility enhances the model’s capacity to focus on critical bands, thereby improving its ability to identify and differentiate changes across spectral dimensions. Furthermore, we develop a multi-scale feature fusion module which extracts and integrates features from deep feature maps, enriching local information and augmenting the model’s sensitivity to local variations. Experimental evaluations conducted on three real-world HS-CD datasets demonstrate that the proposed MsFNet significantly outperforms contemporary advanced CD methods in terms of both efficacy and performance. Full article
(This article belongs to the Special Issue Recent Advances in the Processing of Hyperspectral Images)
Show Figures

Figure 1

27 pages, 3359 KiB  
Article
A Multi-Scale Mask Convolution-Based Blind-Spot Network for Hyperspectral Anomaly Detection
by Zhiwei Yang, Rui Zhao, Xiangchao Meng, Gang Yang, Weiwei Sun, Shenfu Zhang and Jinghui Li
Remote Sens. 2024, 16(16), 3036; https://doi.org/10.3390/rs16163036 (registering DOI) - 18 Aug 2024
Viewed by 324
Abstract
Existing methods of hyperspectral anomaly detection still face several challenges: (1) Due to the limitations of self-supervision, avoiding the identity mapping of anomalies remains difficult; (2) the ineffective interaction between spatial and spectral features leads to the insufficient utilization of spatial information; and [...] Read more.
Existing methods of hyperspectral anomaly detection still face several challenges: (1) Due to the limitations of self-supervision, avoiding the identity mapping of anomalies remains difficult; (2) the ineffective interaction between spatial and spectral features leads to the insufficient utilization of spatial information; and (3) current methods are not adaptable to the detection of multi-scale anomaly targets. To address the aforementioned challenges, we proposed a blind-spot network based on multi-scale blind-spot convolution for HAD. The multi-scale mask convolution module is employed to adapt to diverse scales of anomaly targets, while the dynamic fusion module is introduced to integrate the advantages of mask convolutions at different scales. The proposed approach includes a spatial–spectral joint module and a background feature attention mechanism to enhance the interaction between spatial–spectral features, with a specific emphasis on highlighting the significance of background features within the network. Furthermore, we propose a preprocessing technique that combines pixel shuffle down-sampling (PD) with spatial spectral joint screening. This approach addresses anomalous identity mapping and enables finite-scale mask convolution for better detection of targets at various scales. The proposed approach was assessed on four real hyperspectral datasets comprising anomaly targets of different scales. The experimental results demonstrate the effectiveness and superior performance of the proposed methodology compared with nine state-of-the-art methods. Full article
19 pages, 14105 KiB  
Article
Identification of Pine Wilt-Diseased Trees Using UAV Remote Sensing Imagery and Improved PWD-YOLOv8n Algorithm
by Jianyi Su, Bingxi Qin, Fenggang Sun, Peng Lan and Guolin Liu
Drones 2024, 8(8), 404; https://doi.org/10.3390/drones8080404 (registering DOI) - 18 Aug 2024
Viewed by 344
Abstract
Pine wilt disease (PWD) is one of the most destructive diseases for pine trees, causing a significant effect on ecological resources. The identification of PWD-infected trees is an effective approach for disease control. However, the effects of complex environments and the multi-scale features [...] Read more.
Pine wilt disease (PWD) is one of the most destructive diseases for pine trees, causing a significant effect on ecological resources. The identification of PWD-infected trees is an effective approach for disease control. However, the effects of complex environments and the multi-scale features of PWD trees hinder detection performance. To address these issues, this study proposes a detection model based on PWD-YOLOv8 by utilizing aerial images. In particular, the coordinate attention (CA) and convolutional block attention module (CBAM) mechanisms are combined with YOLOv8 to enhance feature extraction. The bidirectional feature pyramid network (BiFPN) structure is used to strengthen feature fusion and recognition capability for small-scale diseased trees. Meanwhile, the lightweight FasterBlock structure and efficient multi-scale attention (EMA) mechanism are employed to optimize the C2f module. In addition, the Inner-SIoU loss function is introduced to seamlessly improve model accuracy and reduce missing rates. The experiment showed that the proposed PWD-YOLOv8n algorithm outperformed conventional target-detection models on the validation set ([email protected] = 94.3%, precision = 87.9%, recall = 87.0%, missing rate = 6.6%; model size = 4.8 MB). Therefore, the proposed PWD-YOLOv8n model demonstrates significant superiority in diseased-tree detection. It not only enhances detection efficiency and accuracy but also provides important technical support for forest disease control and prevention. Full article
Show Figures

Figure 1

23 pages, 6463 KiB  
Article
Assessing the Role of Environmental Covariates and Pixel Size in Soil Property Prediction: A Comparative Study of Various Areas in Southwest Iran
by Pegah Khosravani, Majid Baghernejad, Ruhollah Taghizadeh-Mehrjardi, Seyed Roohollah Mousavi, Ali Akbar Moosavi, Seyed Rashid Fallah Shamsi, Hadi Shokati, Ndiye M. Kebonye and Thomas Scholten
Land 2024, 13(8), 1309; https://doi.org/10.3390/land13081309 (registering DOI) - 18 Aug 2024
Viewed by 298
Abstract
(1) Background: The use of multiscale prediction or the optimal scaling of predictors can enhance soil maps by applying pixel size in digital soil mapping (DSM). (2) Methods: A total of 200, 50, and 129 surface soil samples (0–30 cm) were collected by [...] Read more.
(1) Background: The use of multiscale prediction or the optimal scaling of predictors can enhance soil maps by applying pixel size in digital soil mapping (DSM). (2) Methods: A total of 200, 50, and 129 surface soil samples (0–30 cm) were collected by the CLHS method in three different areas, namely, the Marvdasht, Bandamir, and Lapuee plains in southwest Iran. Then, four soil properties—soil organic matter (SOM), bulk density (BD), soil shear strength (SS), and mean weighted diameter (MWD)—were measured at each sampling point as representative attributes of soil physical and chemical quality. This study examined different-scale scenarios ranging from resampling the original 30 m digital elevation model and remote sensing indices to various pixel sizes, including 60 × 60, 90 × 90, 120 × 120, and up to 2100 × 2100 m. (3) Results: After evaluating 22 environmental covariates, 11 of them were identified as the most suitable candidates for predicting soil properties based on recursive feature elimination (RFE) and expert opinion methods. Furthermore, among different pixel size scenarios for SOM, BD, SS, and MWD, the highest accuracy was achieved at 1200 × 1200 m (R2 = 0.35), 180 × 180 m (R2 = 0.67), 1200 × 1200 m (R2 = 0.42), and 2100 × 2100 m (R2 = 0.34), respectively, in Marvdasht plain. (4) Conclusions: Adjusting the pixel size improves the capture of soil property variability, enhancing mapping precision and supporting effective decision making for crop management, irrigation, and land use planning. Full article
Show Figures

Figure 1

21 pages, 1668 KiB  
Article
DCG-Net: Enhanced Hyperspectral Image Classification with Dual-Branch Convolutional Neural Network and Graph Convolutional Neural Network Integration
by Wenkai Zhu, Xueying Sun and Qiang Zhang
Electronics 2024, 13(16), 3271; https://doi.org/10.3390/electronics13163271 (registering DOI) - 18 Aug 2024
Viewed by 224
Abstract
In recent years, graph convolutional neural networks (GCNs) and convolutional neural networks (CNNs) have made significant strides in hyperspectral image (HSI) classification. However, existing models often encounter information redundancy and feature mismatch during feature fusion, and they struggle with small-scale refined features. To [...] Read more.
In recent years, graph convolutional neural networks (GCNs) and convolutional neural networks (CNNs) have made significant strides in hyperspectral image (HSI) classification. However, existing models often encounter information redundancy and feature mismatch during feature fusion, and they struggle with small-scale refined features. To address these issues, we propose DCG-Net, an innovative classification network integrating CNN and GCN architectures. Our approach includes the development of a double-branch expanding network (E-Net) to enhance spectral features and efficiently extract high-level features. Additionally, we incorporate a GCN with an attention mechanism to facilitate the integration of multi-space scale superpixel-level and pixel-level features. To further improve feature fusion, we introduce a feature aggregation module (FAM) that adaptively learns channel features, enhancing classification robustness and accuracy. Comprehensive experiments on three widely used datasets show that DCG-Net achieves superior classification results compared to other state-of-the-art methods. Full article
(This article belongs to the Topic Hyperspectral Imaging and Signal Processing)
Show Figures

Figure 1

22 pages, 15192 KiB  
Article
Joint Luminance-Saliency Prior and Attention for Underwater Image Quality Assessment
by Zhiqiang Lin, Zhouyan He, Chongchong Jin, Ting Luo and Yeyao Chen
Remote Sens. 2024, 16(16), 3021; https://doi.org/10.3390/rs16163021 (registering DOI) - 17 Aug 2024
Viewed by 308
Abstract
Underwater images, as a crucial medium for storing ocean information in underwater sensors, play a vital role in various underwater tasks. However, they are prone to distortion due to the imaging environment, which leads to a decline in visual quality, which is an [...] Read more.
Underwater images, as a crucial medium for storing ocean information in underwater sensors, play a vital role in various underwater tasks. However, they are prone to distortion due to the imaging environment, which leads to a decline in visual quality, which is an urgent issue for various marine vision systems to address. Therefore, it is necessary to develop underwater image enhancement (UIE) and corresponding quality assessment methods. At present, most underwater image quality assessment (UIQA) methods primarily rely on extracting handcrafted features that characterize degradation attributes, which struggle to measure complex mixed distortions and often exhibit discrepancies with human visual perception in practical applications. Furthermore, current UIQA methods lack the consideration of the perception perspective of enhanced effects. To this end, this paper employs luminance and saliency priors as critical visual information for the first time to measure the enhancement effect of global and local quality achieved by the UIE algorithms, named JLSAU. The proposed JLSAU is built upon an overall pyramid-structured backbone, supplemented by the Luminance Feature Extraction Module (LFEM) and Saliency Weight Learning Module (SWLM), which aim at obtaining perception features with luminance and saliency priors at multiple scales. The supplement of luminance priors aims to perceive visually sensitive global distortion of luminance, including histogram statistical features and grayscale features with positional information. The supplement of saliency priors aims to perceive visual information that reflects local quality variation both in spatial and channel domains. Finally, to effectively model the relationship among different levels of visual information contained in the multi-scale features, the Attention Feature Fusion Module (AFFM) is proposed. Experimental results on the public UIQE and UWIQA datasets demonstrate that the proposed JLSAU outperforms existing state-of-the-art UIQA methods. Full article
(This article belongs to the Special Issue Ocean Remote Sensing Based on Radar, Sonar and Optical Techniques)
Show Figures

Figure 1

19 pages, 4688 KiB  
Article
Enhancing the Resolution of Satellite Ocean Data Using Discretized Satellite Gridding Neural Networks
by Shirong Liu, Wentao Jia, Qianyun Wang, Weimin Zhang and Huizan Wang
Remote Sens. 2024, 16(16), 3020; https://doi.org/10.3390/rs16163020 (registering DOI) - 17 Aug 2024
Viewed by 278
Abstract
Ocean satellite data are often impeded by intrinsic limitations in resolution and accuracy. However, conventional data reconstruction approaches encounter substantial challenges when facing the nonlinear oceanic system and high-resolution fusion of variables. This research presents a Discrete Satellite Gridding Neural Network (DSGNN), a [...] Read more.
Ocean satellite data are often impeded by intrinsic limitations in resolution and accuracy. However, conventional data reconstruction approaches encounter substantial challenges when facing the nonlinear oceanic system and high-resolution fusion of variables. This research presents a Discrete Satellite Gridding Neural Network (DSGNN), a new machine learning method that processes satellite data within a discrete grid framework. By transforming the positional information of grid elements into a standardized vector format, the DSGNN significantly elevates the accuracy and resolution of data fusion through a neural network model. This method’s innovative aspect lies in its discretization and fusion technique, which not only enhances the spatial resolution of oceanic data but also, through the integration of multi-element datasets, better reflects the true physical state of the ocean. A comprehensive analysis of the reconstructed datasets indicates the DSGNN’s consistency and reliability across different seasons and oceanic regions, especially in its adept handling of complex nonlinear interactions and small-scale oceanic features. The DSGNN method has demonstrated exceptional competence in reconstructing global ocean datasets, maintaining small error variance, and achieving high congruence with in situ observations, which is almost equivalent to 1/12° hybrid coordinate ocean model (HYCOM) data. This study offers a novel and potent strategy for the high-resolution reconstruction and fusion of ocean satellite datasets. Full article
(This article belongs to the Special Issue Artificial Intelligence and Big Data for Oceanography)
Show Figures

Figure 1

17 pages, 3807 KiB  
Review
A Survey on IoT Application Architectures
by Abdulkadir Dauda, Olivier Flauzac and Florent Nolot
Sensors 2024, 24(16), 5320; https://doi.org/10.3390/s24165320 (registering DOI) - 17 Aug 2024
Viewed by 349
Abstract
The proliferation of the IoT has led to the development of diverse application architectures to optimize IoT systems’ deployment, operation, and maintenance. This survey provides a comprehensive overview of the existing IoT application architectures, highlighting their key features, strengths, and limitations. The architectures [...] Read more.
The proliferation of the IoT has led to the development of diverse application architectures to optimize IoT systems’ deployment, operation, and maintenance. This survey provides a comprehensive overview of the existing IoT application architectures, highlighting their key features, strengths, and limitations. The architectures are categorized based on their deployment models, such as cloud, edge, and fog computing approaches, each offering distinct advantages regarding scalability, latency, and resource efficiency. Cloud architectures leverage centralized data processing and storage capabilities to support large-scale IoT applications but often suffer from high latency and bandwidth constraints. Edge architectures mitigate these issues by bringing computation closer to the data source, enhancing real-time processing, and reducing network congestion. Fog architectures combine the strengths of both cloud and edge paradigms, offering a balanced solution for complex IoT environments. This survey also examines emerging trends and technologies in IoT application management, such as the solutions provided by the major IoT service providers like Intel, AWS, Microsoft Azure, and GCP. Through this study, the survey identifies latency, privacy, and deployment difficulties as key areas for future research. It highlights the need to advance IoT Edge architectures to reduce network traffic, improve data privacy, and enhance interoperability by developing multi-application and multi-protocol edge gateways for efficient IoT application management. Full article
(This article belongs to the Special Issue Cloud and Edge Computing for IoT Applications)
Show Figures

Figure 1

24 pages, 22182 KiB  
Article
Multi-Channel Multi-Scale Convolution Attention Variational Autoencoder (MCA-VAE): An Interpretable Anomaly Detection Algorithm Based on Variational Autoencoder
by Jingwen Liu, Yuchen Huang, Dizhi Wu, Yuchen Yang, Yanru Chen, Liangyin Chen and Yuanyuan Zhang
Sensors 2024, 24(16), 5316; https://doi.org/10.3390/s24165316 - 16 Aug 2024
Viewed by 252
Abstract
With the rapid development of industry, the risks factories face are increasing. Therefore, the anomaly detection algorithms deployed in factories need to have high accuracy, and they need to be able to promptly discover and locate the specific equipment causing the anomaly to [...] Read more.
With the rapid development of industry, the risks factories face are increasing. Therefore, the anomaly detection algorithms deployed in factories need to have high accuracy, and they need to be able to promptly discover and locate the specific equipment causing the anomaly to restore the regular operation of the abnormal equipment. However, the neural network models currently deployed in factories cannot effectively capture both temporal features within dimensions and relationship features between dimensions; some algorithms that consider both types of features lack interpretability. Therefore, we propose a high-precision, interpretable anomaly detection algorithm based on variational autoencoder (VAE). We use a multi-scale local weight-sharing convolutional neural network structure to fully extract the temporal features within each dimension of the multi-dimensional time series. Then, we model the features from various aspects through multiple attention heads, extracting the relationship features between dimensions. We map the attention output results to the latent space distribution of the VAE and propose an optimization method to improve the reconstruction performance of the VAE, detecting anomalies through reconstruction errors. Regarding anomaly interpretability, we utilize the VAE probability distribution characteristics, decompose the obtained joint probability density into conditional probabilities on each dimension, and calculate the anomaly score, which provides helpful value for technicians. Experimental results show that our algorithm performed best in terms of F1 score and AUC value. The AUC value for anomaly detection is 0.982, and the F1 score is 0.905, which is 4% higher than the best-performing baseline algorithm, Transformer with a Discriminator for Anomaly Detection (TDAD). It also provides accurate anomaly interpretation capability. Full article
Show Figures

Figure 1

16 pages, 2860 KiB  
Article
Attention-Enhanced Bi-LSTM with Gated CNN for Ship Heave Multi-Step Forecasting
by Wenzhuo Shi, Zimeng Guo, Zixiang Dai, Shizhen Li and Meng Chen
J. Mar. Sci. Eng. 2024, 12(8), 1413; https://doi.org/10.3390/jmse12081413 - 16 Aug 2024
Viewed by 196
Abstract
This study addresses the challenges of predicting ship heave motion in real time, which is essential for mitigating sensor–actuator delays in high-performance active compensation control. Traditional methods often fall short due to training on specific sea conditions, and they lack real-time prediction capabilities. [...] Read more.
This study addresses the challenges of predicting ship heave motion in real time, which is essential for mitigating sensor–actuator delays in high-performance active compensation control. Traditional methods often fall short due to training on specific sea conditions, and they lack real-time prediction capabilities. To overcome these limitations, this study introduces a multi-step prediction model based on a Seq2Seq framework, training with heave data taken from various sea conditions. The model features a long-term encoder with attention-enhanced Bi-LSTM, a short-term encoder with Gated CNN, and a decoder composed of multiple fully connected layers. The long-term encoder and short-term encoder are designed to maximize the extraction of global characteristics and multi-scale short-term features of heave data, respectively. An optimized Huber loss function is used to improve the fitting performance in peak and valley regions. The experimental results demonstrate that this model outperforms baseline methods across all metrics, providing precise predictions for high-sampling-rate real-time applications. Trained on simulated sea conditions and fine-tuned through transfer learning on actual ship data, the proposed model shows strong generalization with prediction errors smaller than 0.02 m. Based on both results from the regular test and the generalization test, the model’s predictive performance is shown to meet the necessary criteria for active heave compensation control. Full article
(This article belongs to the Section Ocean Engineering)
Show Figures

Figure 1

14 pages, 7566 KiB  
Article
Ship Segmentation via Combined Attention Mechanism and Efficient Channel Attention High-Resolution Representation Network
by Xiaoyi Li
J. Mar. Sci. Eng. 2024, 12(8), 1411; https://doi.org/10.3390/jmse12081411 - 16 Aug 2024
Viewed by 235
Abstract
Ship segmentation with small imaging size, which challenges ship detection and visual navigation model performance due to imaging noise interference, has attracted significant attention in the field. To address the issues, this study proposed a novel combined attention mechanism and efficient channel attention [...] Read more.
Ship segmentation with small imaging size, which challenges ship detection and visual navigation model performance due to imaging noise interference, has attracted significant attention in the field. To address the issues, this study proposed a novel combined attention mechanism and efficient channel attention high-resolution representation network (CA2HRNET). More specially, the proposed model fulfills accurate ship segmentation by introducing a channel attention mechanism, a multi-scale spatial attention mechanism, and a weight self-adjusted attention mechanism. Overall, the proposed CA2HRNET model enhances attention mechanism performance by focusing on the trivial yet important features and pixels of a ship against background-interference pixels. The proposed ship segmentation model can accurately focus on ship features by implementing both channel and spatial fusion attention mechanisms at each scale feature layer. Moreover, the channel attention mechanism helps the proposed framework allocate higher weights to ship-feature-related pixels. The experimental results show that the proposed CA2HRNET model outperforms its counterparts in terms of accuracy (Accs), precision (Pc), F1-score (F1s), intersection over union (IoU), and frequency-weighted IoU (FIoU). The average Accs, Pc, F1s, IoU, and FIoU for the proposed CA2HRNET model were 99.77%, 97.55%, 97%, 96.97%, and 99.55%, respectively. The research findings can promote intelligent ship visual navigation and maritime traffic management in the smart shipping era. Full article
Show Figures

Figure 1

26 pages, 10106 KiB  
Article
DFLM-YOLO: A Lightweight YOLO Model with Multiscale Feature Fusion Capabilities for Open Water Aerial Imagery
by Chen Sun, Yihong Zhang and Shuai Ma
Drones 2024, 8(8), 400; https://doi.org/10.3390/drones8080400 - 16 Aug 2024
Viewed by 309
Abstract
Object detection algorithms for open water aerial images present challenges such as small object size, unsatisfactory detection accuracy, numerous network parameters, and enormous computational demands. Current detection algorithms struggle to meet the accuracy and speed requirements while being deployable on small mobile devices. [...] Read more.
Object detection algorithms for open water aerial images present challenges such as small object size, unsatisfactory detection accuracy, numerous network parameters, and enormous computational demands. Current detection algorithms struggle to meet the accuracy and speed requirements while being deployable on small mobile devices. This paper proposes DFLM-YOLO, a lightweight small-object detection network based on the YOLOv8 algorithm with multiscale feature fusion. Firstly, to solve the class imbalance problem of the SeaDroneSee dataset, we propose a data augmentation algorithm called Small Object Multiplication (SOM). SOM enhances dataset balance by increasing the number of objects in specific categories, thereby improving model accuracy and generalization capabilities. Secondly, we optimize the backbone network structure by implementing Depthwise Separable Convolution (DSConv) and the newly designed FasterBlock-CGLU-C2f (FC-C2f), which reduces the model’s parameters and inference time. Finally, we design the Lightweight Multiscale Feature Fusion Network (LMFN) to address the challenges of multiscale variations by gradually fusing the four feature layers extracted from the backbone network in three stages. In addition, LMFN incorporates the Dilated Re-param Block structure to increase the effective receptive field and improve the model’s classification ability and detection accuracy. The experimental results on the SeaDroneSee dataset indicate that DFLM-YOLO improves the mean average precision (mAP) by 12.4% compared to the original YOLOv8s, while reducing parameters by 67.2%. This achievement provides a new solution for Unmanned Aerial Vehicles (UAVs) to conduct object detection missions in open water efficiently. Full article
Show Figures

Figure 1

29 pages, 17576 KiB  
Article
Semantic Segmentation Network Based on Adaptive Attention and Deep Fusion Utilizing a Multi-Scale Dilated Convolutional Pyramid
by Shan Zhao, Zihao Wang, Zhanqiang Huo and Fukai Zhang
Sensors 2024, 24(16), 5305; https://doi.org/10.3390/s24165305 - 16 Aug 2024
Viewed by 289
Abstract
Deep learning has recently made significant progress in semantic segmentation. However, the current methods face critical challenges. The segmentation process often lacks sufficient contextual information and attention mechanisms, low-level features lack semantic richness, and high-level features suffer from poor resolution. These limitations reduce [...] Read more.
Deep learning has recently made significant progress in semantic segmentation. However, the current methods face critical challenges. The segmentation process often lacks sufficient contextual information and attention mechanisms, low-level features lack semantic richness, and high-level features suffer from poor resolution. These limitations reduce the model’s ability to accurately understand and process scene details, particularly in complex scenarios, leading to segmentation outputs that may have inaccuracies in boundary delineation, misclassification of regions, and poor handling of small or overlapping objects. To address these challenges, this paper proposes a Semantic Segmentation Network Based on Adaptive Attention and Deep Fusion with the Multi-Scale Dilated Convolutional Pyramid (SDAMNet). Specifically, the Dilated Convolutional Atrous Spatial Pyramid Pooling (DCASPP) module is developed to enhance contextual information in semantic segmentation. Additionally, a Semantic Channel Space Details Module (SCSDM) is devised to improve the extraction of significant features through multi-scale feature fusion and adaptive feature selection, enhancing the model’s perceptual capability for key regions and optimizing semantic understanding and segmentation performance. Furthermore, a Semantic Features Fusion Module (SFFM) is constructed to address the semantic deficiency in low-level features and the low resolution in high-level features. The effectiveness of SDAMNet is demonstrated on two datasets, revealing significant improvements in Mean Intersection over Union (MIOU) by 2.89% and 2.13%, respectively, compared to the Deeplabv3+ network. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

Back to TopTop