Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (135)

Search Parameters:
Keywords = multilevel structure extraction

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
27 pages, 24858 KiB  
Article
Mobile Mapping System for Urban Infrastructure Monitoring: Digital Twin Implementation in Road Asset Management
by Vittorio Scolamiero, Piero Boccardo and Luigi La Riccia
Land 2025, 14(3), 597; https://doi.org/10.3390/land14030597 - 12 Mar 2025
Abstract
In the age of digital twins, the digitalization of the urban environment is one of the key aspects in the optimization of urban management. The goal of urban digitalization is to provide a digital representation of physical infrastructure, data, information, and procedures for [...] Read more.
In the age of digital twins, the digitalization of the urban environment is one of the key aspects in the optimization of urban management. The goal of urban digitalization is to provide a digital representation of physical infrastructure, data, information, and procedures for the management of complex anthropogenic systems. To meet this new goal, one must be able to understand the urban system through the integrated use of different methods in a multi-level approach. In this context, mobile surveying is a consolidated method for data collection in urban environments. A recent innovation, the mobile mapping system (MMS), is a versatile tool used to collect geospatial data efficiently, accurately, and quickly, with reduced time and costs compared to traditional survey methods. This system combines various technologies such as GNSS (global navigation satellite system), IMU (inertial measurement unit), LiDAR (light detection and ranging), and high-resolution cameras to map and create three-dimensional models of the surrounding environment. The aim of this study was to analyze the limitations, possible implementations, and the state of the art of MMSs for road infrastructure monitoring in order to create a DT (digital twin) for road infrastructure management, with a specific focus on extracting value-added information from a survey dataset. The case study presented here was part of the Turin Digital Twin project. In this context, an MMS was tested in a specific area to evaluate its potential and integration with other data sources, adhering to the multi-level and multi-sensor approach of the DT project. A key outcome of this work was the integration of the extracted information into a comprehensive geodatabase, transforming raw geospatial data into a structured tool that supports predictive maintenance and strategic road asset management toward DT implementation. Full article
(This article belongs to the Special Issue Urban Morphology: A Perspective from Space (Second Edition))
Show Figures

Figure 1

34 pages, 2580 KiB  
Article
Bayesian Estimation of Generalized Log-Linear Poisson Item Response Models for Fluency Scores Using brms and Stan
by Nils Myszkowski and Martin Storme
J. Intell. 2025, 13(3), 26; https://doi.org/10.3390/jintelligence13030026 - 23 Feb 2025
Viewed by 322
Abstract
Divergent thinking tests are popular instruments to measure a person’s creativity. They often involve scoring fluency, which refers to the count of ideas generated in response to a prompt. The two-parameter Poisson counts model (2PPCM), a generalization of the Rasch Poisson counts model [...] Read more.
Divergent thinking tests are popular instruments to measure a person’s creativity. They often involve scoring fluency, which refers to the count of ideas generated in response to a prompt. The two-parameter Poisson counts model (2PPCM), a generalization of the Rasch Poisson counts model (RPCM) that includes discrimination parameters, has been proposed as a useful approach to analyze fluency scores in creativity tasks, but its estimation was presented in the context of generalized structural equation modeling (GSEM) commercial software (e.g., Mplus). Here, we show how the 2PPCM (and RPCM) can be estimated in a Bayesian multilevel regression framework and interpreted using the R package brms, which provides an interface for the Stan programming language. We illustrate this using an example dataset, which contains fluency scores for three tasks and 202 participants. We discuss model specification, estimation, convergence, fit and comparisons. Furthermore, we provide instructions on plotting item response functions, comparing models, calculating overdispersion and reliability, as well as extracting factor scores. Full article
(This article belongs to the Special Issue Analysis of a Divergent Thinking Dataset)
Show Figures

Figure 1

16 pages, 2188 KiB  
Article
MCP: A Named Entity Recognition Method for Shearer Maintenance Based on Multi-Level Clue-Guided Prompt Learning
by Xiangang Cao, Luyang Shi, Xulong Wang, Yong Duan, Xin Yang and Xinyuan Zhang
Appl. Sci. 2025, 15(4), 2106; https://doi.org/10.3390/app15042106 - 17 Feb 2025
Viewed by 263
Abstract
The coal mining industry has accumulated a vast amount of knowledge on shearer accident analysis and handling during its development. Accurately identifying and extracting entity information related to shearer maintenance is crucial for advancing downstream tasks in intelligent shearer operations and maintenance. Currently, [...] Read more.
The coal mining industry has accumulated a vast amount of knowledge on shearer accident analysis and handling during its development. Accurately identifying and extracting entity information related to shearer maintenance is crucial for advancing downstream tasks in intelligent shearer operations and maintenance. Currently, named entity recognition in the field of shearer maintenance primarily relies on fine-tuning-based methods; however, a gap exists between pretraining and downstream tasks. In this paper, we introduce prompt learning and large language models (LLMs), proposing a named entity recognition method for shearer maintenance based on multi-level clue-guided prompt learning (MCP). This method consists of three key components: (1) the prompt learning layer, which encapsulates the information to be identified and forms multi-level sub-clues into structured prompts based on a predefined format; (2) the LLM layer, which employs a decoder-only architecture-based large language model to deeply process the connection between the structured prompts and the information to be identified through multiple stacked decoder layers; and (3) the answer layer, which maps the output of the LLM layer to a structured label space via a parser to obtain the recognition results of structured named entities in the shearer maintenance domain. By designing multi-level sub-clues, MCP enables the model to extract and learn trigger words related to entity recognition from the prompts, acquiring context-aware prompt tokens. This allows the model to make accurate predictions, bridging the gap between fine-tuning and pretraining while eliminating the reliance on labeled data for fine-tuning. Validation was conducted on a self-constructed knowledge corpus in the shearer maintenance domain. Experimental results demonstrate that the proposed method outperforms mainstream baseline models in the field of shearer maintenance. Full article
Show Figures

Figure 1

20 pages, 2363 KiB  
Article
Graph and Multi-Level Sequence Fusion Learning for Predicting the Molecular Activity of BACE-1 Inhibitors
by Shaohua Zheng, Changwang Zhang, Youjia Chen and Meimei Chen
Int. J. Mol. Sci. 2025, 26(4), 1681; https://doi.org/10.3390/ijms26041681 - 16 Feb 2025
Viewed by 349
Abstract
The development of BACE-1 (β-site amyloid precursor protein cleaving enzyme 1) inhibitors is a crucial focus in exploring early treatments for Alzheimer’s disease (AD). Recently, graph neural networks (GNNs) have demonstrated significant advantages in predicting molecular activity. However, their reliance on graph structures [...] Read more.
The development of BACE-1 (β-site amyloid precursor protein cleaving enzyme 1) inhibitors is a crucial focus in exploring early treatments for Alzheimer’s disease (AD). Recently, graph neural networks (GNNs) have demonstrated significant advantages in predicting molecular activity. However, their reliance on graph structures alone often neglects explicit sequence-level semantic information. To address this limitation, we proposed a Graph and multi-level Sequence Fusion Learning (GSFL) model for predicting the molecular activity of BACE-1 inhibitors. Firstly, molecular graph structures generated from SMILES strings were encoded using GNNs with an atomic-level characteristic attention mechanism. Next, substrings at functional group, ion level, and atomic level substrings were extracted from SMILES strings and encoded using a BiLSTM–Transformer framework equipped with a hierarchical attention mechanism. Finally, these features were fused to predict the activity of BACE-1 inhibitors. A dataset of 1548 compounds with BACE-1 activity measurements was curated from the ChEMBL database. In the classification experiment, the model achieved an accuracy of 0.941 on the training set and 0.877 on the test set. For the test set, it delivered a sensitivity of 0.852, a specificity of 0.894, a MCC of 0.744, an F1-score of 0.872, a PRC of 0.869, and an AUC of 0.915. Compared to traditional computer-aided drug design methods and other machine learning algorithms, the proposed model can effectively improve the accuracy of the molecular activity prediction of BACE-1 inhibitors and has a potential application value. Full article
(This article belongs to the Special Issue Cheminformatics in Drug Discovery and Material Design)
Show Figures

Figure 1

22 pages, 2839 KiB  
Article
Narrowband Radar Micromotion Targets Recognition Strategy Based on Graph Fusion Network Constructed by Cross-Modal Attention Mechanism
by Yuanjie Zhang, Ting Gao, Hongtu Xie, Haozong Liu, Mengfan Ge, Bin Xu, Nannan Zhu and Zheng Lu
Remote Sens. 2025, 17(4), 641; https://doi.org/10.3390/rs17040641 - 13 Feb 2025
Viewed by 379
Abstract
In the domain of micromotion target recognition, target characteristics can be extracted through either broadband or narrowband radar echoes. However, due to technical limitations and cost constraints in acquiring broadband radar waveform data, recognition can often only be performed through narrowband radar waveforms. [...] Read more.
In the domain of micromotion target recognition, target characteristics can be extracted through either broadband or narrowband radar echoes. However, due to technical limitations and cost constraints in acquiring broadband radar waveform data, recognition can often only be performed through narrowband radar waveforms. To fully utilize the information embedded within narrowband radar waveforms, it is necessary to conduct in-depth research on multi-dimensional features of micromotion targets, including radar cross-sections (RCSs), time frequency (TF) images, and cadence velocity diagrams (CVDs). To address the limitations of existing identification methodologies in achieving accurate recognition with narrowband echoes, this paper proposes a graph fusion network based on a cross-modal attention mechanism, termed GF-AM Net. The network first adopts convolutional neural networks (CNNs) to extract unimodal features from RCSs, TF images, and CVDs independently. Subsequently, a cross-modal attention mechanism integrates these extracted features into a graph structure, achieving multi-level interactions among unimodal, bimodal, and trimodal features. Finally, the fused features are input into a classification module to accomplish narrowband radar micromotion target identification. Experimental results demonstrate that the proposed methodology successfully captures potential correlations between modal features by incorporating cross-modal multi-level information interactions across different processing stages, exhibiting exceptional accuracy and robustness in narrowband radar micromotion target identification tasks. Full article
(This article belongs to the Special Issue Ocean Remote Sensing Based on Radar, Sonar and Optical Techniques)
Show Figures

Figure 1

17 pages, 2674 KiB  
Article
Research on Predictive Maintenance Methods for Current Transformers with Iron Core Structures
by Huan Hu, Kang Xu, Xianya Zhang, Fangjing Li, Lingling Zhu, Rui Xu and Deng Li
Electronics 2025, 14(3), 625; https://doi.org/10.3390/electronics14030625 - 5 Feb 2025
Viewed by 406
Abstract
The reliable operation of power systems is heavily dependent on effective maintenance strategies for critical equipment. Current maintenance methods are typically categorized into corrective, preventive, and predictive approaches. While corrective maintenance often results in significant downtime and preventive maintenance can be inefficient, predictive [...] Read more.
The reliable operation of power systems is heavily dependent on effective maintenance strategies for critical equipment. Current maintenance methods are typically categorized into corrective, preventive, and predictive approaches. While corrective maintenance often results in significant downtime and preventive maintenance can be inefficient, predictive maintenance emerges as a promising technique for accurately forecasting faults. In this study, we investigated the diagnosis and prediction of fault states, specifically single-phase short circuit (1HCF) and double-phase short circuit (2HCF) faults, using monitoring data from current transformers in 110 kV substations. We proposed a predictive maintenance method for current transformers based on core-type structures, which integrates wavelet transform to extract multi-level frequency domain features, employs feature selection techniques (including the Spearman correlation coefficient and mutual information) to identify key predictive features, and utilizes Random Forest classifiers for fault state prediction. Experimental results demonstrate an overall prediction accuracy of 94%. Full article
Show Figures

Figure 1

20 pages, 8886 KiB  
Article
Multi-Scale Hierarchical Feature Fusion for Infrared Small-Target Detection
by Yue Wang, Xinhong Wang, Shi Qiu, Xianghui Chen, Zhaoyan Liu, Chuncheng Zhou, Weiyuan Yao, Hongjia Cheng, Yu Zhang, Feihong Wang and Zhan Shu
Remote Sens. 2025, 17(3), 428; https://doi.org/10.3390/rs17030428 - 27 Jan 2025
Viewed by 569
Abstract
Detecting small targets in infrared images presents significant challenges due to their tiny size and complex backgrounds, making this task a hotspot for research. Traditional methods rely on assumption-based modeling and manual design, struggling to handle the variability of real-world scenarios. Although convolutional [...] Read more.
Detecting small targets in infrared images presents significant challenges due to their tiny size and complex backgrounds, making this task a hotspot for research. Traditional methods rely on assumption-based modeling and manual design, struggling to handle the variability of real-world scenarios. Although convolutional neural networks (CNNs) increase robustness to diverse scenes with a data-driven paradigm, many CNN-based methods are insufficient in capturing fine-grained details necessary for small targets and are less effective during multi-scale feature fusion. To overcome these challenges, we propose the novel Wide-scale Gated Fully Fusion Network (WGFFNet) in this article, which contributes to infrared small-target detection (IRSTD). WGFFNet uses a classic encoder–decoder structure, where the designed stepped fusion block (SFB) embedded in the feature extraction stage captures finer local context across multiple scales during encoding, and along the decoding path, the multi-level features are progressively integrated by a Fully Gated Interaction (FGI) Module to enhance feature representation. The inclusion of a boundary difference loss further optimizes the edge details of targets. We conducted comprehensive experiments on two public infrared small-target datasets: SIRST-V2 and IRSTD-1k. Quantitative and qualitative results demonstrate that our WGFFNet outperforms representative methods when considering various evaluation metrics together, achieving an improved detection performance and computational efficiency for detecting small targets in infrared images. Full article
Show Figures

Figure 1

18 pages, 2027 KiB  
Article
A Multi-Level SAR-Guided Contextual Attention Network for Satellite Images Cloud Removal
by Ganchao Liu, Jiawei Qiu and Yuan Yuan
Remote Sens. 2024, 16(24), 4767; https://doi.org/10.3390/rs16244767 - 20 Dec 2024
Viewed by 561
Abstract
In the field of remote sensing, cloud cover severely reduces the quality of satellite observations of the earth. Due to the complete absence of information in cloud-covered regions, cloud removal with a single optical image is an ill-posed problem. Since the synthetic aperture [...] Read more.
In the field of remote sensing, cloud cover severely reduces the quality of satellite observations of the earth. Due to the complete absence of information in cloud-covered regions, cloud removal with a single optical image is an ill-posed problem. Since the synthetic aperture radar (SAR) can effectively penetrate clouds, fusing SAR and optical remote sensing images will effectively alleviate this problem. However, existing SAR-based optical cloud removal methods fail to effectively leverage the global information provided by the SAR image, resulting in limited performance gains. In this paper, we introduce a novel cloud removal method named the Multi-Level SAR-Guided Contextual Attention Network (MSGCA-Net). MSGCA-Net is designed with a multi-level architecture that integrates a SAR-Guided Contextual Attention (SGCA) module to fuse the dependable global contextual information from SAR images with the local features of optical images effectively. In the module of SGCA, the SAR image provides reliable global contextual information and genuine structure of cloud-covered regions, while the optical image provides the local feature information. The proposed model can efficiently extract and fuse global and local contextual information in SAR and optical images. We trained and evaluated the performance of the model on both simulated and real-world datasets. Both qualitative and quantitative experimental evaluation demonstrated that the proposed method can yield high quality cloud-free images and outperform state-of-the-art cloud removal methods. Full article
(This article belongs to the Special Issue Recent Advances in Remote Sensing Image Processing Technology)
Show Figures

Figure 1

17 pages, 9200 KiB  
Article
Multi-Condition Intelligent Fault Diagnosis Based on Tree-Structured Labels and Hierarchical Multi-Granularity Diagnostic Network
by Hehua Yan, Jinbiao Tan, Yixiong Luo, Shiyong Wang and Jiafu Wan
Machines 2024, 12(12), 891; https://doi.org/10.3390/machines12120891 - 6 Dec 2024
Viewed by 640
Abstract
The aim of this study is to improve the cross-condition domain adaptability of bearing fault diagnosis models and their diagnostic performance under previously unknown conditions. Thus, this paper proposes a multi-condition adaptive bearing fault diagnosis method based on multi-granularity data annotation. A tree-structured [...] Read more.
The aim of this study is to improve the cross-condition domain adaptability of bearing fault diagnosis models and their diagnostic performance under previously unknown conditions. Thus, this paper proposes a multi-condition adaptive bearing fault diagnosis method based on multi-granularity data annotation. A tree-structured labeling scheme is introduced to allow for multi-granularity fault annotation. A hierarchical multi-granularity diagnostic network is designed to automatically learn multi-level fault information from condition data using feature extractors of varying granularity, allowing for the extraction of shared fault information across conditions. Additionally, a multi-granularity fault loss function is developed to help the deep network learn tree-structured labels, improving intra-class compactness and reducing hierarchical similarity between classes. Two experimental cases demonstrate that the proposed method exhibits robust cross-condition domain adaptability and performs better in unseen conditions than state-of-the-art methods. Full article
(This article belongs to the Special Issue AI-Driven Reliability Analysis and Predictive Maintenance)
Show Figures

Figure 1

15 pages, 10661 KiB  
Article
A Cross-Scale Electrothermal Co-Simulation Approach for Power MOSFETs at Device–Package–Heatsink–Board Levels
by Yuxuan Dai, Jiafei Yao, Jing Chen, Qingyou Qian, Maolin Zhang, Jun Zhang, Qing Yao, Chenyang Huang, Mingshun Sun and Yufeng Guo
Micromachines 2024, 15(11), 1336; https://doi.org/10.3390/mi15111336 - 31 Oct 2024
Cited by 1 | Viewed by 2481
Abstract
This paper proposes a cross-scale simulation approach for evaluating the steady-state electrothermal performance of power MOSFETs at the device–package–heatsink–board (DPHB) level. A co-simulation framework is designed by employing the iterative process of power loss and chip temperature to bridge the device and package–heatsink–board [...] Read more.
This paper proposes a cross-scale simulation approach for evaluating the steady-state electrothermal performance of power MOSFETs at the device–package–heatsink–board (DPHB) level. A co-simulation framework is designed by employing the iterative process of power loss and chip temperature to bridge the device and package–heatsink–board (PHB) level simulators. As a result, the cross-scale electrothermal coupling effect within multilevel settings is considered. Correspondingly, variation values in chip temperature and temperature-dependent drain current can be obtained at various voltage biases, level settings, and DPHB structural parameters, incorporating cross-level physical insights. The simulation results are compared with existing methods, and their features and limitations are discussed. Additionally, this paper also derives an empirical equation from the co-simulations to characterize the relationship between the drain current and the chip temperature under different operations exactly. A commercial MOSFET with TO-220F packaging is implemented in experiments to extract the chip temperature and drain current in electrothermal equilibrium. The method comparisons and fair agreement among simulations, equations, and measurements presents the proposed approach as generalized and powerful for describing variations in chip temperature and drain current considering from micrometer devices to millimeter packages–heatsinks–PCB boards, thus providing effective support for DPHB-level co-design. Full article
Show Figures

Figure 1

24 pages, 12250 KiB  
Article
DMCCT: Dual-Branch Multi-Granularity Convolutional Cross-Substitution Transformer for Hyperspectral Image Classification
by Laiying Fu, Xiaoyong Chen, Yanan Xu and Xiao Li
Appl. Sci. 2024, 14(20), 9499; https://doi.org/10.3390/app14209499 - 17 Oct 2024
Viewed by 796
Abstract
In the field of hyperspectral image classification, deep learning technology, especially convolutional neural networks, has achieved remarkable progress. However, convolutional neural network models encounter challenges in hyperspectral image classification due to limitations in their receptive fields. Conversely, the global modeling capability of Transformers [...] Read more.
In the field of hyperspectral image classification, deep learning technology, especially convolutional neural networks, has achieved remarkable progress. However, convolutional neural network models encounter challenges in hyperspectral image classification due to limitations in their receptive fields. Conversely, the global modeling capability of Transformers has garnered attention in hyperspectral image classification. Nevertheless, the high computational cost and inadequate local feature extraction hinder its widespread application. In this study, we propose a novel fusion model of convolutional neural networks and Transformers to enhance performance in hyperspectral image classification, namely the dual-branch multi-granularity convolutional cross-substitution Transformer (DMCCT). The proposed model adopts a dual-branch structure to separately extract spatial and spectral features, thereby mitigating mutual interference and information loss between spectral and spatial data during feature extraction. Moreover, a multi-granularity embedding module is introduced to facilitate multi-scale and multi-level local feature extraction for spatial and spectral information. In particular, the improved convolutional cross-substitution Transformer module effectively integrates convolution and Transformer, reducing the complexity of attention operations and enhancing the accuracy of hyperspectral image classification tasks. Subsequently, the proposed method is evaluated against existing approaches using three classical datasets, namely Pavia University, Kennedy Space Center, and Indian Pines. Experimental results demonstrate the efficacy of the proposed method, achieving significant classification results on these datasets with overall classification accuracies of 98.57%, 97.96%, and 96.59%, respectively. These results establish the superiority of the proposed method in the context of hyperspectral image classification under similar experimental conditions. Full article
Show Figures

Figure 1

13 pages, 2228 KiB  
Article
MSMP-Net: A Multi-Scale Neural Network for End-to-End Monkeypox Virus Skin Lesion Classification
by Eryang Huan and Hui Dun
Appl. Sci. 2024, 14(20), 9390; https://doi.org/10.3390/app14209390 - 15 Oct 2024
Cited by 1 | Viewed by 1097
Abstract
Monkeypox is a zoonotic disease caused by monkeypox virus infection. It is easily transmitted among people and poses a major threat to human health, making it of great significance in public health. Therefore, this paper proposes MSMP-Net, a multi-scale neural network for end-to-end [...] Read more.
Monkeypox is a zoonotic disease caused by monkeypox virus infection. It is easily transmitted among people and poses a major threat to human health, making it of great significance in public health. Therefore, this paper proposes MSMP-Net, a multi-scale neural network for end-to-end monkeypox virus skin lesion classification ConvNeXt is used as the backbone network, and designs such as inverse bottleneck layers and large convolution kernels are used to enhance the network’s feature extraction capabilities. In order to effectively utilize the multi-level feature maps generated by the backbone network, a multi-scale feature fusion structure was designed. By fusing the deepest feature maps of multi-scale features, the model’s ability to represent monkeypox image features is enhanced. Experimental results show that the accuracy, precision, recall, and F1-score of this method on the MSLD v2.0 dataset are 87.03 ± 3.43%, 87.59 ± 3.37%, 87.03 ± 3.43%, and 86.58 ± 3.66%, respectively. Full article
Show Figures

Figure 1

19 pages, 3429 KiB  
Article
An Insulator Fault Diagnosis Method Based on Multi-Mechanism Optimization YOLOv8
by Chuang Gong, Wei Jiang, Dehua Zou, Weiwei Weng and Hongjun Li
Appl. Sci. 2024, 14(19), 8770; https://doi.org/10.3390/app14198770 - 28 Sep 2024
Viewed by 1028
Abstract
Aiming at the problem that insulator image backgrounds are complex and fault types are diverse, which makes it difficult for existing deep learning algorithms to achieve accurate insulator fault diagnosis, an insulator fault diagnosis method based on multi-mechanism optimization YOLOv8-DCP is proposed. Firstly, [...] Read more.
Aiming at the problem that insulator image backgrounds are complex and fault types are diverse, which makes it difficult for existing deep learning algorithms to achieve accurate insulator fault diagnosis, an insulator fault diagnosis method based on multi-mechanism optimization YOLOv8-DCP is proposed. Firstly, a feature extraction and fusion module, named CW-DRB, was designed. This module enhances the C2f structure of YOLOv8 by incorporating the dilation-wise residual module and the dilated re-param module. The introduction of this module improves YOLOv8’s capability for multi-scale feature extraction and multi-level feature fusion. Secondly, the CARAFE module, which is feature content-aware, was introduced to replace the up-sampling layer in YOLOv8n, thereby enhancing the model’s feature map reconstruction ability. Finally, an additional small-object detection layer was added to improve the detection accuracy of small defects. Simulation results indicate that YOLOv8-DCP achieves an accuracy of 97.7% and an [email protected] of 93.9%. Compared to YOLOv5, YOLOv7, and YOLOv8n, the accuracy improved by 1.5%, 4.3%, and 4.8%, while the [email protected] increased by 3.0%, 4.3%, and 3.1%. This results in a significant enhancement in the accuracy of insulator fault diagnosis. Full article
(This article belongs to the Special Issue Deep Learning for Object Detection)
Show Figures

Figure 1

22 pages, 6759 KiB  
Article
Automatic Method for Extracting Tree Branching Structures from a Single RGB Image
by Yinhui Yang, Huang Lai, Bin Chen, Yuchi Huo, Kai Xia and Jianqin Huang
Forests 2024, 15(9), 1659; https://doi.org/10.3390/f15091659 - 20 Sep 2024
Viewed by 1077
Abstract
Creating automated methods for detecting branches in images is crucial for applications like harvesting robots and forest monitoring. However, the tree images encountered in real-world scenarios present significant challenges for branch detection techniques due to issues such as background interference, occlusion, and varying [...] Read more.
Creating automated methods for detecting branches in images is crucial for applications like harvesting robots and forest monitoring. However, the tree images encountered in real-world scenarios present significant challenges for branch detection techniques due to issues such as background interference, occlusion, and varying environmental lighting. While there has been notable progress in extracting tree trunks for specific species, research on identifying lateral branches remains limited. The primary challenges include establishing a unified mathematical representation for multi-level branch structures, conducting quantitative analyses, and the absence of suitable datasets to facilitate the development of effective models. This study addresses these challenges by creating a dataset encompassing various tree species, developing annotation tools for multi-level branch structure labeling, designing branch vector representations and quantitative metrics. Building on this foundation, the study introduces an automatic extraction model for multi-level branch structures that utilizes ResNet and a self-attention mechanism, along with a tailored loss function for branch extraction tasks. The study evaluated several model variants through both qualitative and quantitative experiments. Results from different tree images demonstrate that the final model can accurately identify the trunk structure and effectively extract detailed lateral branch structures, offering a valuable tool for applications in this area. Full article
(This article belongs to the Section Forest Inventory, Modeling and Remote Sensing)
Show Figures

Figure 1

24 pages, 60637 KiB  
Article
SAR-NTV-YOLOv8: A Neural Network Aircraft Detection Method in SAR Images Based on Despeckling Preprocessing
by Xiaomeng Guo and Baoyi Xu
Remote Sens. 2024, 16(18), 3420; https://doi.org/10.3390/rs16183420 - 14 Sep 2024
Cited by 1 | Viewed by 1337
Abstract
Monitoring aircraft using synthetic aperture radar (SAR) images is a very important task. Given its coherent imaging characteristics, there is a large amount of speckle interference in the image. This phenomenon leads to the scattering information of aircraft targets being masked in SAR [...] Read more.
Monitoring aircraft using synthetic aperture radar (SAR) images is a very important task. Given its coherent imaging characteristics, there is a large amount of speckle interference in the image. This phenomenon leads to the scattering information of aircraft targets being masked in SAR images, which is easily confused with background scattering points. Therefore, automatic detection of aircraft targets in SAR images remains a challenging task. For this task, this paper proposes a framework for speckle reduction preprocessing of SAR images, followed by the use of an improved deep learning method to detect aircraft in SAR images. Firstly, to improve the problem of introducing artifacts or excessive smoothing in speckle reduction using total variation (TV) methods, this paper proposes a new nonconvex total variation (NTV) method. This method aims to ensure the effectiveness of speckle reduction while preserving the original scattering information as much as possible. Next, we present a framework for aircraft detection based on You Only Look Once v8 (YOLOv8) for SAR images. Therefore, the complete framework is called SAR-NTV-YOLOv8. Meanwhile, a high-resolution small target feature head is proposed to mitigate the impact of scale changes and loss of depth feature details on detection accuracy. Then, an efficient multi-scale attention module was proposed, aimed at effectively establishing short-term and long-term dependencies between feature grouping and multi-scale structures. In addition, the progressive feature pyramid network was chosen to avoid information loss or degradation in multi-level transmission during the bottom-up feature extraction process in Backbone. Sufficient comparative experiments, speckle reduction experiments, and ablation experiments are conducted on the SAR-Aircraft-1.0 and SADD datasets. The results have demonstrated the effectiveness of SAR-NTV-YOLOv8, which has the most advanced performance compared to other mainstream algorithms. Full article
Show Figures

Figure 1

Back to TopTop