Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Scarcity-GAN: : Scarce data augmentation for defect detection via generative adversarial nets

Published: 04 March 2024 Publication History
  • Get Citation Alerts
  • Abstract

    Data augmentation is a crucial and challenging task for improving defect detection with limited data. Many generative models have been proposed and shown promising performance on this task. However, existing models are unable to capture the fine features of defects when training data is scarce, resulting in the inability to synthesize defects and a lack of diversity in the synthesized defects. Additionally, most models do not consider the location of synthesized defects in the image, thus limiting the ability for augmenting defect data through data generation. In this paper, we propose a new augmentation model named Scarce Data Augmentation Generative Adversarial Nets (Scarcity-GAN) to address the scarce data augmentation problem. Firstly, we design a new clustering module which selects data containing similar features to the target defect from extra datasets, in order to help the GAN learn the features of the target defect. Secondly, we modify the vanilla generator with an Encoder–Decoder model. The generator takes two inputs: one is the defect-free images, which are encoded by the Encoder to obtain defect-free features, and the other is the extra defect feature maps in the target defect set after clustering. Next, we design a Fusion Patch-Embedding module to merge the two different features, ensuring that the synthesized defects are located on the object accurately. We also design a new loss function for the generator, and then prove that it makes our model get converged. Last, we conduct extensive experiments to demonstrate the significant performance improvement and generalizability of Scarcity-GAN on two scarce datasets: industrial O-ring and Metal Iron Sheet datasets; and one general dataset: the public CODEBRIM dataset. The experimental results show that our Scarcity-GAN outperforms the SOTA augmentation models on different scarce datasets.

    References

    [1]
    Lin W., Gao J., Wang Q., Li X., Learning to detect anomaly events in crowd scenes from synthetic data, Neurocomputing 436 (2021) 248–259.
    [2]
    Zhang H., Pan D., Liu J., Jiang Z., A novel MAS-GAN-based data synthesis method for object surface defect detection, Neurocomputing 499 (2022) 106–114.
    [3]
    Ge Z., Liu S., Wang F., Li Z., Sun J., Yolox: Exceeding yolo series in 2021, 2021, arXiv preprint arXiv:2107.08430.
    [4]
    Ren S., He K., Girshick R., Sun J., Faster R-CNN: Towards real-time object detection with region proposal networks, Adv. Neural Inf. Process. Syst. 28 (2015).
    [5]
    Lee H., Hwang S.J., Shin J., Rethinking data augmentation: Self-supervision and self-distillation, 2019.
    [6]
    Li W., Chen J., Cao J., Ma C., Wang J., Cui X., Chen P., EID-GAN: Generative adversarial nets for extremely imbalanced data augmentation, IEEE Trans. Ind. Inform. (2022).
    [7]
    Chawla N.V., Bowyer K.W., Hall L.O., Kegelmeyer W.P., SMOTE: Synthetic minority over-sampling technique, J. Artificial Intelligence Res. 16 (2002) 321–357.
    [8]
    C.-L. Li, K. Sohn, J. Yoon, T. Pfister, Cutpaste: Self-supervised learning for anomaly detection and localization, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 9664–9674.
    [9]
    Zhou S., Zhang J., Jiang H., Lundh T., Ng A.Y., Data augmentation with mobius transformations, Mach. Learn.: Sci. Technol. 2 (2) (2021).
    [10]
    Lu B., Zhang M., Huang B., Deep adversarial data augmentation for fabric defect classification with scarce defect data, IEEE Trans. Instrum. Meas. 71 (2022) 1–13.
    [11]
    T. Karras, S. Laine, T. Aila, A style-based generator architecture for generative adversarial networks, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 4401–4410.
    [12]
    T. Karras, S. Laine, M. Aittala, J. Hellsten, J. Lehtinen, T. Aila, Analyzing and improving the image quality of stylegan, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 8110–8119.
    [13]
    Karras T., Aittala M., Hellsten J., Laine S., Lehtinen J., Aila T., Training generative adversarial networks with limited data, Adv. Neural Inf. Process. Syst. 33 (2020) 12104–12114.
    [14]
    Jiang L., Dai B., Wu W., Loy C.C., Deceive D: Adaptive pseudo augmentation for GAN training with limited data, Adv. Neural Inf. Process. Syst. 34 (2021) 21655–21667.
    [15]
    Yang M., Wang Z., Chi Z., Zhang Y., FreGAN: Exploiting frequency components for training GANs under limited data, 2022, arXiv preprint arXiv:2210.05461.
    [16]
    Karras T., Aittala M., Laine S., Härkönen E., Hellsten J., Lehtinen J., Aila T., Alias-free generative adversarial networks, Adv. Neural Inf. Process. Syst. 34 (2021) 852–863.
    [17]
    Wu Z., Nitzan Y., Shechtman E., Lischinski D., Stylealign: Analysis and applications of aligned stylegan models, 2021, arXiv preprint arXiv:2110.11323.
    [18]
    Liu J., Wang C., Su H., Du B., Tao D., Multistage GAN for fabric defect detection, IEEE Trans. Image Process. 29 (2019) 3388–3400.
    [19]
    Niu S., Li B., Wang X., Lin H., Defect image sample generation with GAN for improving defect recognition, IEEE Trans. Autom. Sci. Eng. 17 (3) (2020) 1611–1622.
    [20]
    Hartigan J.A., Wong M.A., Algorithm AS 136: A k-means clustering algorithm, J. R. Stat. Soc. Ser. C. Appl. Stat. 28 (1) (1979) 100–108.
    [21]
    Anand S., Mittal S., Tuzel O., Meer P., Semi-supervised kernel mean shift clustering, IEEE Trans. Pattern Anal. Mach. Intell. 36 (6) (2013) 1201–1215.
    [22]
    Li W., Liang Z., Ma P., Wang R., Cui X., Chen P., Hausdorff gan: improving gan generation quality with hausdorff metric, IEEE Transactions on Cybernetics (2021).
    [23]
    Goodfellow I., Pouget-Abadie J., Mirza M., Xu B., Warde-Farley D., Ozair S., Courville A., Bengio Y., Generative adversarial nets, Adv. Neural Inf. Process. Syst. 27 (2014).
    [24]
    Radford A., Metz L., Chintala S., Unsupervised representation learning with deep convolutional generative adversarial networks, 2015, arXiv preprint arXiv:1511.06434.
    [25]
    S. Gurumurthy, R. Kiran Sarvadevabhatla, R. Venkatesh Babu, DeliGAN: Generative adversarial networks for diverse and limited data, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017, pp. 166–174.
    [26]
    Li W., Xu L., Liang Z., Wang S., Cao J., Lam T.C., Cui X., JDGAN: Enhancing generator on extremely limited data via joint distribution, Neurocomputing 431 (2021) 148–162.
    [27]
    Li W., Chen J., Wang Z., Shen Z., Ma C., Cui X., Ifl-gan: improved federated learning generative adversarial network with maximum mean discrepancy model aggregation, IEEE Transactions on Neural Networks and Learning Systems (2022).
    [28]
    L.A. Gatys, A.S. Ecker, M. Bethge, Image style transfer using convolutional neural networks, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016, pp. 2414–2423.
    [29]
    Krizhevsky A., Sutskever I., Hinton G.E., Imagenet classification with deep convolutional neural networks, Commun. ACM 60 (6) (2017) 84–90.
    [30]
    Simonyan K., Zisserman A., Very deep convolutional networks for large-scale image recognition, 2014, arXiv preprint arXiv:1409.1556.
    [31]
    Li W., Gu C., Chen J., Ma C., Zhang X., Chen B., Chen P., Dw-gan: toward high-fidelity color-tones of gan-generated images with dynamic weights, IEEE Transactions on Neural Networks and Learning Systems (2023).
    [32]
    Li W., Gu C., Chen J., Ma C., Zhang X., Chen B., Wan S., Dls-gan: generative adversarial nets for defect location sensitive data augmentation, IEEE Transactions on Automation Science and Engineering (2023).
    [33]
    Mukherjee S., Asnani H., Lin E., Kannan S., ClusterGAN: Latent space clustering in generative adversarial networks, Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, 2019, pp. 4610–4617.
    [34]
    Kingma D.P., Welling M., Auto-encoding variational bayes, 2013, arXiv preprint arXiv:1312.6114.
    [35]
    Sion M., et al., On general minimax theorems, Pacific J. Math. 8 (1) (1958) 171–176.
    [36]
    Božič J., Tabernik D., Skočaj D., End-to-end training of a two-stage neural network for defect detection, in: 2020 25th International Conference on Pattern Recognition, ICPR, IEEE, 2021, pp. 5619–5626.
    [37]
    Huang Y., Qiu C., Yuan K., Surface defect saliency of magnetic tile, Vis. Comput. 36 (2020) 85–96.
    [38]
    Shi Y., Cui L., Qi Z., Meng F., Chen Z., Automatic road crack detection using random structured forests, IEEE Trans. Intell. Transp. Syst. 17 (12) (2016) 3434–3445.
    [39]
    Li W., Fan L., Wang Z., Ma C., Cui X., Tackling mode collapse in multi-generator gans with orthogonal vectors, Pattern Recognition 110 (2021) 107646.
    [40]
    Gulrajani I., Ahmed F., Arjovsky M., Dumoulin V., Courville A.C., Improved training of wasserstein gans, Adv. Neural Inf. Process. Syst. 30 (2017).
    [41]
    Frid-Adar M., Diamant I., Klang E., Amitai M., Goldberger J., Greenspan H., GAN-based synthetic medical image augmentation for increased CNN performance in liver lesion classification, Neurocomputing 321 (2018) 321–331.
    [42]
    Yang L., Lu Y., Yang S.X., Guo T., Liang Z., A secure clustering protocol with fuzzy trust evaluation and outlier detection for industrial wireless sensor networks, IEEE Trans. Ind. Inform. 17 (7) (2020) 4837–4847.
    [43]
    Kong F., Li J., Jiang B., Wang H., Song H., Integrated generative model for industrial anomaly detection via bidirectional LSTM and attention mechanism, IEEE Trans. Ind. Inform. 19 (1) (2021) 541–550.

    Index Terms

    1. Scarcity-GAN: Scarce data augmentation for defect detection via generative adversarial nets
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image Neurocomputing
          Neurocomputing  Volume 566, Issue C
          Jan 2024
          240 pages

          Publisher

          Elsevier Science Publishers B. V.

          Netherlands

          Publication History

          Published: 04 March 2024

          Author Tags

          1. Scarce data augmentation
          2. GAN
          3. Defect detection

          Qualifiers

          • Research-article

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • 0
            Total Citations
          • 0
            Total Downloads
          • Downloads (Last 12 months)0
          • Downloads (Last 6 weeks)0

          Other Metrics

          Citations

          View Options

          View options

          Get Access

          Login options

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media