Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (21)

Search Parameters:
Keywords = Flipped Feature Extraction

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
15 pages, 1401 KiB  
Article
Entropy Analysis of FPGA Interconnect and Switch Matrices for Physical Unclonable Functions
by Jenilee Jao, Ian Wilcox, Jim Plusquellic, Biliana Paskaleva and Pavel Bochev
Cryptography 2024, 8(3), 32; https://doi.org/10.3390/cryptography8030032 - 15 Jul 2024
Viewed by 257
Abstract
Random variations in microelectronic circuit structures represent the source of entropy for physical unclonable functions (PUFs). In this paper, we investigate delay variations that occur through the routing network and switch matrices of a field-programmable gate array (FPGA). The delay variations are isolated [...] Read more.
Random variations in microelectronic circuit structures represent the source of entropy for physical unclonable functions (PUFs). In this paper, we investigate delay variations that occur through the routing network and switch matrices of a field-programmable gate array (FPGA). The delay variations are isolated from other components of the programmable logic, e.g., look-up tables (LUTs), flip-flops (FFs), etc., using a feature of Xilinx FPGAs called dynamic partial reconfiguration (DPR). A set of partial designs is created to fix the placement of a time-to-digital converter (TDC) and supporting infrastructure to enable the path delays through the target interconnect and switch matrices to be extracted by subtracting out common-mode delay components. Delay variations are analyzed in the different levels of routing resources available within FPGAs, i.e., local routing and across-chip routing. Data are collected from a set of Xilinx Zynq 7010 devices, and a statistical analysis of within-die variations in delay through a set of the randomly-generated and hand-crafted interconnects is presented. Full article
(This article belongs to the Special Issue Emerging Topics in Hardware Security)
Show Figures

Figure 1

15 pages, 6519 KiB  
Article
FF-HPINet: A Flipped Feature and Hierarchical Position Information Extraction Network for Lane Detection
by Xiaofeng Zhou and Peng Zhang
Sensors 2024, 24(11), 3502; https://doi.org/10.3390/s24113502 - 29 May 2024
Viewed by 368
Abstract
Effective lane detection technology plays an important role in the current autonomous driving system. Although deep learning models, with their intricate network designs, have proven highly capable of detecting lanes, there persist key areas requiring attention. Firstly, the symmetry inherent in visuals captured [...] Read more.
Effective lane detection technology plays an important role in the current autonomous driving system. Although deep learning models, with their intricate network designs, have proven highly capable of detecting lanes, there persist key areas requiring attention. Firstly, the symmetry inherent in visuals captured by forward-facing automotive cameras is an underexploited resource. Secondly, the vast potential of position information remains untapped, which can undermine detection precision. In response to these challenges, we propose FF-HPINet, a novel approach for lane detection. We introduce the Flipped Feature Extraction module, which models pixel pairwise relationships between the flipped feature and the original feature. This module allows us to capture symmetrical features and obtain high-level semantic feature maps from different receptive fields. Additionally, we design the Hierarchical Position Information Extraction module to meticulously mine the position information of the lanes, vastly improving target identification accuracy. Furthermore, the Deformable Context Extraction module is proposed to distill vital foreground elements and contextual nuances from the surrounding environment, yielding focused and contextually apt feature representations. Our approach achieves excellent performance with the F1 score of 97.00% on the TuSimple dataset and 76.84% on the CULane dataset. Full article
Show Figures

Figure 1

9 pages, 6017 KiB  
Article
Manchester Return-to-Zero On–Off Keying Modulation for Free-Space Optical Communication
by Wenhao Zong, Qianwen Jing, Minfang Liu, Yan Gao and Yanqing Hong
Photonics 2024, 11(6), 496; https://doi.org/10.3390/photonics11060496 - 24 May 2024
Viewed by 477
Abstract
This paper proposes a Manchester return-to-zero on–off keying (M-RZ-OOK) modulation for free-space optical (FSO) communication. M-RZ-OOK modulation is achieved by introducing Manchester coding into the RZ-OOK format. M-RZ-OOK has the features of phase-flipped impulse series in the spectrum. Therefore, normal and inversed channel [...] Read more.
This paper proposes a Manchester return-to-zero on–off keying (M-RZ-OOK) modulation for free-space optical (FSO) communication. M-RZ-OOK modulation is achieved by introducing Manchester coding into the RZ-OOK format. M-RZ-OOK has the features of phase-flipped impulse series in the spectrum. Therefore, normal and inversed channel state information (CSI) can be extracted by applying a local oscillator (LO) with the frequencies of impulses, respectively. These extracted CSIs can be applied to realize adaptive threshold decision (ATD) and adaptive power transmission (APT) in the forward and backward links simultaneously. The proposed M-RZ-OOK modulation was verified in simulations using various turbulence channels. The simulation results demonstrated that ATD and APT were effectively accomplished in the forward and backward links with the estimated normal and inversed CSIs. Full article
(This article belongs to the Special Issue Free-Space Optical Communication and Networking Technology)
Show Figures

Figure 1

24 pages, 5947 KiB  
Article
FR-PatchCore: An Industrial Anomaly Detection Method for Improving Generalization
by Zhiqian Jiang, Yu Zhang, Yong Wang, Jinlong Li and Xiaorong Gao
Sensors 2024, 24(5), 1368; https://doi.org/10.3390/s24051368 - 20 Feb 2024
Viewed by 1818
Abstract
In recent years, a multitude of self-supervised anomaly detection algorithms have been proposed. Among them, PatchCore has emerged as one of the state-of-the-art methods on the widely used MVTec AD benchmark due to its efficient detection capabilities and cost-saving advantages in terms of [...] Read more.
In recent years, a multitude of self-supervised anomaly detection algorithms have been proposed. Among them, PatchCore has emerged as one of the state-of-the-art methods on the widely used MVTec AD benchmark due to its efficient detection capabilities and cost-saving advantages in terms of labeled data. However, we have identified that the PatchCore similarity principal approach faces significant limitations in accurately locating anomalies when there are positional relationships between similar samples, such as rotation, flipping, or misaligned pixels. In real-world industrial scenarios, it is common for samples of the same class to be found in different positions. To address this challenge comprehensively, we introduce Feature-Level Registration PatchCore (FR-PatchCore), which serves as an extension of the PatchCore method. FR-PatchCore constructs a feature matrix that is extracted into the memory bank and continually updated using the optimal negative cosine similarity loss. Extensive evaluations conducted on the MVTec AD benchmark demonstrate that FR-PatchCore achieves an impressive image-level anomaly detection AUROC score of up to 98.81%. Additionally, we propose a novel method for computing the mask threshold that enables the model to scientifically determine the optimal threshold and accurately partition anomalous masks. Our results highlight not only the high generalizability but also substantial potential for industrial anomaly detection offered by FR-PatchCore. Full article
(This article belongs to the Special Issue Sensing and Imaging for Defect Detection)
Show Figures

Figure 1

30 pages, 22382 KiB  
Article
IRv2-Net: A Deep Learning Framework for Enhanced Polyp Segmentation Performance Integrating InceptionResNetV2 and UNet Architecture with Test Time Augmentation Techniques
by Md. Faysal Ahamed, Md. Khalid Syfullah, Ovi Sarkar, Md. Tohidul Islam, Md. Nahiduzzaman, Md. Rabiul Islam, Amith Khandakar, Mohamed Arselene Ayari and Muhammad E. H. Chowdhury
Sensors 2023, 23(18), 7724; https://doi.org/10.3390/s23187724 - 7 Sep 2023
Cited by 4 | Viewed by 2186
Abstract
Colorectal polyps in the colon or rectum are precancerous growths that can lead to a more severe disease called colorectal cancer. Accurate segmentation of polyps using medical imaging data is essential for effective diagnosis. However, manual segmentation by endoscopists can be time-consuming, error-prone, [...] Read more.
Colorectal polyps in the colon or rectum are precancerous growths that can lead to a more severe disease called colorectal cancer. Accurate segmentation of polyps using medical imaging data is essential for effective diagnosis. However, manual segmentation by endoscopists can be time-consuming, error-prone, and expensive, leading to a high rate of missed anomalies. To solve this problem, an automated diagnostic system based on deep learning algorithms is proposed to find polyps. The proposed IRv2-Net model is developed using the UNet architecture with a pre-trained InceptionResNetV2 encoder to extract most features from the input samples. The Test Time Augmentation (TTA) technique, which utilizes the characteristics of the original, horizontal, and vertical flips, is used to gain precise boundary information and multi-scale image features. The performance of numerous state-of-the-art (SOTA) models is compared using several metrics such as accuracy, Dice Similarity Coefficients (DSC), Intersection Over Union (IoU), precision, and recall. The proposed model is tested on the Kvasir-SEG and CVC-ClinicDB datasets, demonstrating superior performance in handling unseen real-time data. It achieves the highest area coverage in the area under the Receiver Operating Characteristic (ROC-AUC) and area under Precision-Recall (AUC-PR) curves. The model exhibits excellent qualitative testing outcomes across different types of polyps, including more oversized, smaller, over-saturated, sessile, or flat polyps, within the same dataset and across different datasets. Our approach can significantly minimize the number of missed rating difficulties. Lastly, a graphical interface is developed for producing the mask in real-time. The findings of this study have potential applications in clinical colonoscopy procedures and can serve based on further research and development. Full article
(This article belongs to the Special Issue AI on Biomedical Signal Sensing and Processing for Health Monitoring)
Show Figures

Figure 1

21 pages, 3703 KiB  
Article
Robust PDF Watermarking against Print–Scan Attack
by Lei Li, Hong-Jun Zhang, Jia-Le Meng and Zhe-Ming Lu
Sensors 2023, 23(17), 7365; https://doi.org/10.3390/s23177365 - 23 Aug 2023
Viewed by 1332
Abstract
Portable document format (PDF) files are widely used in file transmission, exchange, and circulation because of their platform independence, small size, good browsing quality, and the ability to place hyperlinks. However, their security issues are also more thorny. It is common to distribute [...] Read more.
Portable document format (PDF) files are widely used in file transmission, exchange, and circulation because of their platform independence, small size, good browsing quality, and the ability to place hyperlinks. However, their security issues are also more thorny. It is common to distribute printed PDF files to different groups and individuals after printing. However, most PDF watermarking algorithms currently cannot resist print–scan attacks, making it difficult to apply them in leak tracing of both paper and scanned versions of PDF documents. To tackle this issue, we propose an invisible digital watermarking technology based on modifying the edge pixels of text strokes to hide information in PDFs, which achieves high robustness to print–scan attacks. Moreover, it cannot be detected by human perception systems. This method focuses on the representation of text by embedding watermarks by changing the features of the text to ensure that changes in these features can be reflected in the scanned PDF after printing. We first segment each text line into two sub-blocks, then select the row of pixels with the most black pixels, and flip the edge pixels closest to this row. This method requires the participation of original PDF documents in detection. The experimental results show that all peak signal-to-noise ratio (PSNR) values of our proposed method exceed 32 dB, which indicates satisfactory invisibility. Meanwhile, this method can extract the hidden information with 100% accuracy under the JPEG compression attack, and has high robustness against noise attacks and print–scan attacks. In the case of no attacks, the watermark can be recovered without any loss. In terms of practical applications, our method can be applied in the practical leak tracing of official paper documents after distribution. Full article
(This article belongs to the Special Issue Feature Papers in "Sensing and Imaging" Section 2023)
Show Figures

Figure 1

20 pages, 3605 KiB  
Article
A Depression Recognition Method Based on the Alteration of Video Temporal Angle Features
by Zhiqiang Ding, Yahong Hu, Runhui Jing, Weiguo Sheng and Jiafa Mao
Appl. Sci. 2023, 13(16), 9230; https://doi.org/10.3390/app13169230 - 14 Aug 2023
Viewed by 1538
Abstract
In recent years, significant progress has been made in the auxiliary diagnosis system for depression. However, most of the research has focused on combining features from multiple modes to enhance classification accuracy. This approach results in increased space-time overhead and feature synchronization problems. [...] Read more.
In recent years, significant progress has been made in the auxiliary diagnosis system for depression. However, most of the research has focused on combining features from multiple modes to enhance classification accuracy. This approach results in increased space-time overhead and feature synchronization problems. To address this issue, this paper presents a single-modal framework for detecting depression based on changes in facial expressions. Firstly, we propose a robust method for extracting angle features from facial landmarks. Theoretical evidence is provided to demonstrate the translation and rotation invariance of these features. Additionally, we introduce a flip correction method to mitigate angle deviations caused by head flips. The proposed method not only preserves the spatial topological relationship of facial landmarks, but also maintains the temporal correlation between frames preceding and following the facial landmarks. Finally, the GhostNet network is employed for depression detection, and the effectiveness of various modal data is compared. In the depression binary classification task using the DAIC-WOZ dataset, our proposed framework significantly improves the classification performance, achieving an F1 value of 0.80 for depression detection. Experimental results demonstrate that our method outperforms other existing depression detection models based on a single modality. Full article
(This article belongs to the Section Computing and Artificial Intelligence)
Show Figures

Figure 1

18 pages, 7502 KiB  
Article
Aircraft Detection and Fine-Grained Recognition Based on High-Resolution Remote Sensing Images
by Qinghe Guan, Ying Liu, Lei Chen, Shuang Zhao and Guandian Li
Electronics 2023, 12(14), 3146; https://doi.org/10.3390/electronics12143146 - 20 Jul 2023
Viewed by 1095
Abstract
In order to realize the detection and recognition of specific types of an aircraft in remote sensing images, this paper proposes an algorithm called Fine-grained S2ANet (FS2ANet) based on the improved Single-shot Alignment Network (S2ANet) for remote [...] Read more.
In order to realize the detection and recognition of specific types of an aircraft in remote sensing images, this paper proposes an algorithm called Fine-grained S2ANet (FS2ANet) based on the improved Single-shot Alignment Network (S2ANet) for remote sensing aircraft object detection and fine-grained recognition. Firstly, to address the imbalanced number of instances of various aircrafts in the dataset, we perform data augmentation on some remote sensing images using flip and color space transformation methods. Secondly, this paper selects ResNet101 as the backbone, combines space-to-depth (SPD) to improve the FPN structure, constructs the FPN-SPD module, and builds the aircraft fine feature focusing module (AF3M) in the detection head of the network, which reduces the loss of fine-grained information in the process of feature extraction, enhances the extraction capability of the network for fine aircraft features, and improves the detection accuracy of remote sensing micro aircraft objects. Finally, we use the SkewIoU based on Kalman filtering (KFIoU) as the algorithm’s regression loss function, improving the algorithm’s convergence speed and the object boxes’ regression accuracy. The experimental results of the detection and fine-grained recognition of 11 types of remote sensing aircraft objects such as Boeing 737, A321, and C919 using the FS2ANet algorithm show that the mAP0.5 of FS2ANet is 46.82%, which is 3.87% higher than S2ANet, and it can apply to the field of remote sensing aircraft object detection and fine-grained recognition. Full article
Show Figures

Figure 1

21 pages, 6348 KiB  
Article
Automatic Defect Recognition and Localization for Aeroengine Turbine Blades Based on Deep Learning
by Donghuan Wang, Hong Xiao and Shengqin Huang
Aerospace 2023, 10(2), 178; https://doi.org/10.3390/aerospace10020178 - 14 Feb 2023
Cited by 4 | Viewed by 1967
Abstract
Radiographic testing is generally used in the quality management of aeroengine turbine blades. Traditional radiographic testing is critically dependent on artificially detecting professional inspectors. Thus, it sometimes tends to be error-prone and time-consuming. In this study, we gave an automatic defect detection method [...] Read more.
Radiographic testing is generally used in the quality management of aeroengine turbine blades. Traditional radiographic testing is critically dependent on artificially detecting professional inspectors. Thus, it sometimes tends to be error-prone and time-consuming. In this study, we gave an automatic defect detection method by combining radiographic testing with computer vision. A defect detection algorithm named DBFF-YOLOv4 was introduced for X-ray images of aeroengine turbine blades by employing two backbones to extract hierarchical defect features. In addition, a new concatenation form containing all feature maps was developed which play an important role in the present defect detection framework. Finally, a defect detection and recognition system was established for testing and output of complete turbine blade X-ray images. Meanwhile, nine cropping cycles for one defect, flipping, brightness increasing and decreasing were applied for expansion of training samples and data augmentation. The results found that this defect detection system can obtain a recall rate of 91.87%, a precision rate of 96.7%, and a false detection rate of 7% within the score threshold of 0.5. It was proven that cropping nine times and data augmentation are extremely helpful in improving detection accuracy. This study provides a new way of automatic radiographic testing for turbine blades. Full article
(This article belongs to the Section Aeronautics)
Show Figures

Figure 1

25 pages, 30454 KiB  
Article
RecepNet: Network with Large Receptive Field for Real-Time Semantic Segmentation and Application for Blue-Green Algae
by Kaiyuan Yang, Zhonghao Wang, Zheng Yang, Peiyang Zheng, Shanliang Yao, Xiaohui Zhu, Yong Yue, Wei Wang, Jie Zhang and Jieming Ma
Remote Sens. 2022, 14(21), 5315; https://doi.org/10.3390/rs14215315 - 24 Oct 2022
Cited by 3 | Viewed by 2540
Abstract
Most high-performance semantic segmentation networks are based on complicated deep convolutional neural networks, leading to severe latency in real-time detection. However, the state-of-the-art semantic segmentation networks with low complexity are still far from detecting objects accurately. In this paper, we propose a real-time [...] Read more.
Most high-performance semantic segmentation networks are based on complicated deep convolutional neural networks, leading to severe latency in real-time detection. However, the state-of-the-art semantic segmentation networks with low complexity are still far from detecting objects accurately. In this paper, we propose a real-time semantic segmentation network, RecepNet, which balances accuracy and inference speed well. Our network adopts a bilateral architecture (including a detail path, a semantic path and a bilateral aggregation module). We devise a lightweight baseline network for the semantic path to gather rich semantic and spatial information. We also propose a detail stage pattern to store optimized high-resolution information after removing redundancy. Meanwhile, the effective feature-extraction structures are designed to reduce computational complexity. RecepNet achieves an accuracy of 78.65% mIoU (mean intersection over union) on the Cityscapes dataset in the multi-scale crop and flip evaluation. Its algorithm complexity is 52.12 GMACs (giga multiply–accumulate operations) and its inference speed on an RTX 3090 GPU is 50.12 fps. Moreover, we successfully applied RecepNet for blue-green algae real-time detection. We made and published a dataset consisting of aerial images of water surface with blue-green algae, on which RecepNet achieved 82.12% mIoU. To the best of our knowledge, our dataset is the world’s first public dataset of blue-green algae for semantic segmentation. Full article
(This article belongs to the Special Issue Remote Sensing for Water Environment Monitoring)
Show Figures

Figure 1

19 pages, 12076 KiB  
Article
Evaluation of Geometric Attractor Structure and Recurrence Analysis in Professional Dancers
by Michalina Błażkiewicz
Entropy 2022, 24(9), 1310; https://doi.org/10.3390/e24091310 - 16 Sep 2022
Cited by 4 | Viewed by 1810
Abstract
Background: Human motor systems contain nonlinear features. The purpose of this study was to evaluate the geometric structure of attractors and analyze recurrence in two different pirouettes (jazz and classic) performed by 15 professional dancers. Methods: The kinematics of the body’s center of [...] Read more.
Background: Human motor systems contain nonlinear features. The purpose of this study was to evaluate the geometric structure of attractors and analyze recurrence in two different pirouettes (jazz and classic) performed by 15 professional dancers. Methods: The kinematics of the body’s center of mass (CoM) and knee of the supporting leg (LKNE) during the pirouette were measured using the Vicon system. A time series of selected points were resampled, normalized, and randomly reordered. Then, every second time series was flipped to be combined with other time series and make a long time series out of the repetitions of a single task. The attractors were reconstructed, and the convex hull volumes (CHV) were counted for the CoM and LKNE for each pirouette in each direction. Recurrence quantification analysis (RQA) was used to extract additional information. Results: The CHVs calculated for the LKNE were significantly lower for the jazz pirouette. All RQA measures had the highest values for LKNE along the mediolateral axis for the jazz pirouette. This result underscores the high determinism, high motion recurrence, and complexity of this maneuver. Conclusions: The findings offer new insight into the evaluation of the approximation of homogeneity in motion control. A high determinism indicates a highly stable and predictive motion trajectory. Full article
(This article belongs to the Topic Recent Trends in Nonlinear, Chaotic and Complex Systems)
Show Figures

Figure 1

16 pages, 2768 KiB  
Article
Estimation of Fusarium Head Blight Severity Based on Transfer Learning
by Chunfeng Gao, Zheng Gong, Xingjie Ji, Mengjia Dang, Qiang He, Heguang Sun and Wei Guo
Agronomy 2022, 12(8), 1876; https://doi.org/10.3390/agronomy12081876 - 10 Aug 2022
Cited by 11 | Viewed by 1844
Abstract
The recognition accuracy of traditional image recognition methods is heavily dependent on the design of complicated and tedious hand-crafted features. In view of the problems of poor accuracy and complicated feature extraction, this study presents a methodology for the estimation of the severity [...] Read more.
The recognition accuracy of traditional image recognition methods is heavily dependent on the design of complicated and tedious hand-crafted features. In view of the problems of poor accuracy and complicated feature extraction, this study presents a methodology for the estimation of the severity of wheat Fusarium head blight (FHB) with a small sample dataset based on transfer learning technology and convolutional neural networks (CNNs). Firstly, we utilized the potent feature learning and feature expression capabilities of CNNs to realize the automatic learning of FHB characteristics. Using transfer learning technology, VGG16, ResNet50, and MobileNetV1 models were pre-trained on the ImageNet. The knowledge was transferred to the estimation of FHB severity, and the fully connected (FC) layer of the models was modified. Secondly, acquiring the wheat images at the peak of the outbreak of FHB as the research object, after preprocessing for size filling on the wheat images, the image dataset was expanded with operations such as mirror flip, rotation transformation, and superimposed noise to improve the performance of the model and reduce the overfitting of models. Finally, under the Tensorflow deep learning framework, the VGG16, ResNet50, and MobileNetV1 models were subjected to transfer learning. The results showed that in the case of transfer learning and data augmentation, the ResNet50 model in Accuracy, Precision, Recall, and F1 score was better than the other two models, giving the highest accuracy of 98.42% and F1 score of 97.86%. The ResNet50 model had the highest recognition accuracy, providing technical support and reference for the accurate recognition of FHB. Full article
(This article belongs to the Section Precision and Digital Agriculture)
Show Figures

Figure 1

20 pages, 4817 KiB  
Review
Circuit Techniques for High Efficiency Piezoelectric Energy Harvesting
by Yi Yang, Zhiyuan Chen, Qin Kuai, Junrui Liang, Jingjing Liu and Xiaoyang Zeng
Micromachines 2022, 13(7), 1044; https://doi.org/10.3390/mi13071044 - 30 Jun 2022
Cited by 5 | Viewed by 3479
Abstract
This brief presents a tutorial on multifaceted techniques for high efficiency piezoelectric energy harvesting. For the purpose of helping design piezoelectric energy harvesting system according to different application scenarios, we summarize and discuss the recent design trends and challenges. We divide the design [...] Read more.
This brief presents a tutorial on multifaceted techniques for high efficiency piezoelectric energy harvesting. For the purpose of helping design piezoelectric energy harvesting system according to different application scenarios, we summarize and discuss the recent design trends and challenges. We divide the design focus into the following three categories, namely, (1) AC-DC rectifiers, (2) CP compensation circuits, (3) maximum power point tracking (MPPT) circuits. The features, problems encountered, and suitable systems of various AC-DC rectifier topologies are introduced and compared. The important role of non-linear methods for piezoelectric energy harvesting is illustrated from the perspective of impedance matching. Energy extraction techniques and voltage flipping techniques based on inductors, capacitors, and hybrid structures are analyzed. MPPT techniques with different features and targets are discussed. Full article
(This article belongs to the Special Issue Piezoelectric Energy Harvesting: Analysis, Design and Fabrication)
Show Figures

Figure 1

13 pages, 1962 KiB  
Article
An Efficient and Effective Deep Learning-Based Model for Real-Time Face Mask Detection
by Shabana Habib, Majed Alsanea, Mohammed Aloraini, Hazim Saleh Al-Rawashdeh, Muhammad Islam and Sheroz Khan
Sensors 2022, 22(7), 2602; https://doi.org/10.3390/s22072602 - 29 Mar 2022
Cited by 28 | Viewed by 3769
Abstract
Since December 2019, the COVID-19 pandemic has led to a dramatic loss of human lives and caused severe economic crises worldwide. COVID-19 virus transmission generally occurs through a small respiratory droplet ejected from the mouth or nose of an infected person to another [...] Read more.
Since December 2019, the COVID-19 pandemic has led to a dramatic loss of human lives and caused severe economic crises worldwide. COVID-19 virus transmission generally occurs through a small respiratory droplet ejected from the mouth or nose of an infected person to another person. To reduce and prevent the spread of COVID-19 transmission, the World Health Organization (WHO) advises the public to wear face masks as one of the most practical and effective prevention methods. Early face mask detection is very important to prevent the spread of COVID-19. For this purpose, we investigate several deep learning-based architectures such as VGG16, VGG19, InceptionV3, ResNet-101, ResNet-50, EfficientNet, MobileNetV1, and MobileNetV2. After these experiments, we propose an efficient and effective model for face mask detection with the potential to be deployable over edge devices. Our proposed model is based on MobileNetV2 architecture that extracts salient features from the input data that are then passed to an autoencoder to form more abstract representations prior to the classification layer. The proposed model also adopts extensive data augmentation techniques (e.g., rotation, flip, Gaussian blur, sharping, emboss, skew, and shear) to increase the number of samples for effective training. The performance of our proposed model is evaluated on three publicly available datasets and achieved the highest performance as compared to other state-of-the-art models. Full article
(This article belongs to the Special Issue IoT Enabling Technologies for Smart Cities: Challenges and Approaches)
Show Figures

Figure 1

19 pages, 8460 KiB  
Article
DBA_SSD: A Novel End-to-End Object Detection Algorithm Applied to Plant Disease Detection
by Jun Wang, Liya Yu, Jing Yang and Hao Dong
Information 2021, 12(11), 474; https://doi.org/10.3390/info12110474 - 16 Nov 2021
Cited by 29 | Viewed by 3421
Abstract
In response to the difficulty of plant leaf disease detection and classification, this study proposes a novel plant leaf disease detection method called deep block attention SSD (DBA_SSD) for disease identification and disease degree classification of plant leaves. We propose three plant leaf [...] Read more.
In response to the difficulty of plant leaf disease detection and classification, this study proposes a novel plant leaf disease detection method called deep block attention SSD (DBA_SSD) for disease identification and disease degree classification of plant leaves. We propose three plant leaf detection methods, namely, squeeze-and-excitation SSD (Se_SSD), deep block SSD (DB_SSD), and DBA_SSD. Se_SSD fuses SSD feature extraction network and attention mechanism channel, DB_SSD improves VGG feature extraction network, and DBA_SSD fuses the improved VGG network and channel attention mechanism. To reduce the training time and accelerate the training process, the convolutional layers trained in the Image Net image dataset by the VGG model are migrated to this model, whereas the collected plant leaves disease image dataset is randomly divided into training set, validation set, and test set in the ratio of 8:1:1. We chose the PlantVillage dataset after careful consideration because it contains images related to the domain of interest. This dataset consists of images of 14 plants, including images of apples, tomatoes, strawberries, peppers, and potatoes, as well as the leaves of other plants. In addition, data enhancement methods, such as histogram equalization and horizontal flip were used to expand the image data. The performance of the three improved algorithms is compared and analyzed in the same environment and with the classical target detection algorithms YOLOv4, YOLOv3, Faster RCNN, and YOLOv4 tiny. Experiments show that DBA_SSD outperforms the two other improved algorithms, and its performance in comparative analysis is superior to other target detection algorithms. Full article
Show Figures

Figure 1

Back to TopTop