research-article

Scarcity-GAN: : Scarce data augmentation for defect detection via generative adversarial nets

Authors:

Fengling Zheng,

Xiaowu Zhang, and

Bin ChenAuthors Info & Claims

Volume 566, Issue C

https://doi.org/10.1016/j.neucom.2023.127061

Published: 04 March 2024 Publication History

Abstract

Data augmentation is a crucial and challenging task for improving defect detection with limited data. Many generative models have been proposed and shown promising performance on this task. However, existing models are unable to capture the fine features of defects when training data is scarce, resulting in the inability to synthesize defects and a lack of diversity in the synthesized defects. Additionally, most models do not consider the location of synthesized defects in the image, thus limiting the ability for augmenting defect data through data generation. In this paper, we propose a new augmentation model named Scarce Data Augmentation Generative Adversarial Nets (Scarcity-GAN) to address the scarce data augmentation problem. Firstly, we design a new clustering module which selects data containing similar features to the target defect from extra datasets, in order to help the GAN learn the features of the target defect. Secondly, we modify the vanilla generator with an Encoder–Decoder model. The generator takes two inputs: one is the defect-free images, which are encoded by the Encoder to obtain defect-free features, and the other is the extra defect feature maps in the target defect set after clustering. Next, we design a Fusion Patch-Embedding module to merge the two different features, ensuring that the synthesized defects are located on the object accurately. We also design a new loss function for the generator, and then prove that it makes our model get converged. Last, we conduct extensive experiments to demonstrate the significant performance improvement and generalizability of Scarcity-GAN on two scarce datasets: industrial O-ring and Metal Iron Sheet datasets; and one general dataset: the public CODEBRIM dataset. The experimental results show that our Scarcity-GAN outperforms the SOTA augmentation models on different scarce datasets.

References

[1]

Lin W., Gao J., Wang Q., Li X., Learning to detect anomaly events in crowd scenes from synthetic data, Neurocomputing 436 (2021) 248–259.

[2]

Zhang H., Pan D., Liu J., Jiang Z., A novel MAS-GAN-based data synthesis method for object surface defect detection, Neurocomputing 499 (2022) 106–114.

[3]

Ge Z., Liu S., Wang F., Li Z., Sun J., Yolox: Exceeding yolo series in 2021, 2021, arXiv preprint arXiv:2107.08430.

[4]

Ren S., He K., Girshick R., Sun J., Faster R-CNN: Towards real-time object detection with region proposal networks, Adv. Neural Inf. Process. Syst. 28 (2015).

[5]

Lee H., Hwang S.J., Shin J., Rethinking data augmentation: Self-supervision and self-distillation, 2019.

[6]

Li W., Chen J., Cao J., Ma C., Wang J., Cui X., Chen P., EID-GAN: Generative adversarial nets for extremely imbalanced data augmentation, IEEE Trans. Ind. Inform. (2022).

[7]

Chawla N.V., Bowyer K.W., Hall L.O., Kegelmeyer W.P., SMOTE: Synthetic minority over-sampling technique, J. Artificial Intelligence Res. 16 (2002) 321–357.

[8]

C.-L. Li, K. Sohn, J. Yoon, T. Pfister, Cutpaste: Self-supervised learning for anomaly detection and localization, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 9664–9674.

[9]

Zhou S., Zhang J., Jiang H., Lundh T., Ng A.Y., Data augmentation with mobius transformations, Mach. Learn.: Sci. Technol. 2 (2) (2021).

[10]

Lu B., Zhang M., Huang B., Deep adversarial data augmentation for fabric defect classification with scarce defect data, IEEE Trans. Instrum. Meas. 71 (2022) 1–13.

[11]

T. Karras, S. Laine, T. Aila, A style-based generator architecture for generative adversarial networks, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 4401–4410.

[12]

T. Karras, S. Laine, M. Aittala, J. Hellsten, J. Lehtinen, T. Aila, Analyzing and improving the image quality of stylegan, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 8110–8119.

[13]

Karras T., Aittala M., Hellsten J., Laine S., Lehtinen J., Aila T., Training generative adversarial networks with limited data, Adv. Neural Inf. Process. Syst. 33 (2020) 12104–12114.

[14]

Jiang L., Dai B., Wu W., Loy C.C., Deceive D: Adaptive pseudo augmentation for GAN training with limited data, Adv. Neural Inf. Process. Syst. 34 (2021) 21655–21667.

[15]

Yang M., Wang Z., Chi Z., Zhang Y., FreGAN: Exploiting frequency components for training GANs under limited data, 2022, arXiv preprint arXiv:2210.05461.

[16]

Karras T., Aittala M., Laine S., Härkönen E., Hellsten J., Lehtinen J., Aila T., Alias-free generative adversarial networks, Adv. Neural Inf. Process. Syst. 34 (2021) 852–863.

[17]

Wu Z., Nitzan Y., Shechtman E., Lischinski D., Stylealign: Analysis and applications of aligned stylegan models, 2021, arXiv preprint arXiv:2110.11323.

[18]

Liu J., Wang C., Su H., Du B., Tao D., Multistage GAN for fabric defect detection, IEEE Trans. Image Process. 29 (2019) 3388–3400.

Digital Library

[19]

Niu S., Li B., Wang X., Lin H., Defect image sample generation with GAN for improving defect recognition, IEEE Trans. Autom. Sci. Eng. 17 (3) (2020) 1611–1622.

[20]

Hartigan J.A., Wong M.A., Algorithm AS 136: A k-means clustering algorithm, J. R. Stat. Soc. Ser. C. Appl. Stat. 28 (1) (1979) 100–108.

[21]

Anand S., Mittal S., Tuzel O., Meer P., Semi-supervised kernel mean shift clustering, IEEE Trans. Pattern Anal. Mach. Intell. 36 (6) (2013) 1201–1215.

[22]

Li W., Liang Z., Ma P., Wang R., Cui X., Chen P., Hausdorff gan: improving gan generation quality with hausdorff metric, IEEE Transactions on Cybernetics (2021).

[23]

Goodfellow I., Pouget-Abadie J., Mirza M., Xu B., Warde-Farley D., Ozair S., Courville A., Bengio Y., Generative adversarial nets, Adv. Neural Inf. Process. Syst. 27 (2014).

[24]

Radford A., Metz L., Chintala S., Unsupervised representation learning with deep convolutional generative adversarial networks, 2015, arXiv preprint arXiv:1511.06434.

[25]

S. Gurumurthy, R. Kiran Sarvadevabhatla, R. Venkatesh Babu, DeliGAN: Generative adversarial networks for diverse and limited data, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017, pp. 166–174.

[26]

Li W., Xu L., Liang Z., Wang S., Cao J., Lam T.C., Cui X., JDGAN: Enhancing generator on extremely limited data via joint distribution, Neurocomputing 431 (2021) 148–162.

[27]

Li W., Chen J., Wang Z., Shen Z., Ma C., Cui X., Ifl-gan: improved federated learning generative adversarial network with maximum mean discrepancy model aggregation, IEEE Transactions on Neural Networks and Learning Systems (2022).

[28]

L.A. Gatys, A.S. Ecker, M. Bethge, Image style transfer using convolutional neural networks, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016, pp. 2414–2423.

[29]

Krizhevsky A., Sutskever I., Hinton G.E., Imagenet classification with deep convolutional neural networks, Commun. ACM 60 (6) (2017) 84–90.

Digital Library

[30]

Simonyan K., Zisserman A., Very deep convolutional networks for large-scale image recognition, 2014, arXiv preprint arXiv:1409.1556.

[31]

Li W., Gu C., Chen J., Ma C., Zhang X., Chen B., Chen P., Dw-gan: toward high-fidelity color-tones of gan-generated images with dynamic weights, IEEE Transactions on Neural Networks and Learning Systems (2023).

[32]

Li W., Gu C., Chen J., Ma C., Zhang X., Chen B., Wan S., Dls-gan: generative adversarial nets for defect location sensitive data augmentation, IEEE Transactions on Automation Science and Engineering (2023).

[33]

Mukherjee S., Asnani H., Lin E., Kannan S., ClusterGAN: Latent space clustering in generative adversarial networks, Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, 2019, pp. 4610–4617.

[34]

Kingma D.P., Welling M., Auto-encoding variational bayes, 2013, arXiv preprint arXiv:1312.6114.

[35]

Sion M., et al., On general minimax theorems, Pacific J. Math. 8 (1) (1958) 171–176.

[36]

Božič J., Tabernik D., Skočaj D., End-to-end training of a two-stage neural network for defect detection, in: 2020 25th International Conference on Pattern Recognition, ICPR, IEEE, 2021, pp. 5619–5626.

[37]

Huang Y., Qiu C., Yuan K., Surface defect saliency of magnetic tile, Vis. Comput. 36 (2020) 85–96.

Digital Library

[38]

Shi Y., Cui L., Qi Z., Meng F., Chen Z., Automatic road crack detection using random structured forests, IEEE Trans. Intell. Transp. Syst. 17 (12) (2016) 3434–3445.

Digital Library

[39]

Li W., Fan L., Wang Z., Ma C., Cui X., Tackling mode collapse in multi-generator gans with orthogonal vectors, Pattern Recognition 110 (2021) 107646.

[40]

Gulrajani I., Ahmed F., Arjovsky M., Dumoulin V., Courville A.C., Improved training of wasserstein gans, Adv. Neural Inf. Process. Syst. 30 (2017).

[41]

Frid-Adar M., Diamant I., Klang E., Amitai M., Goldberger J., Greenspan H., GAN-based synthetic medical image augmentation for increased CNN performance in liver lesion classification, Neurocomputing 321 (2018) 321–331.

[42]

Yang L., Lu Y., Yang S.X., Guo T., Liang Z., A secure clustering protocol with fuzzy trust evaluation and outlier detection for industrial wireless sensor networks, IEEE Trans. Ind. Inform. 17 (7) (2020) 4837–4847.

[43]

Kong F., Li J., Jiang B., Wang H., Song H., Integrated generative model for industrial anomaly detection via bidirectional LSTM and attention mechanism, IEEE Trans. Ind. Inform. 19 (1) (2021) 541–550.

Index Terms

Scarcity-GAN: Scarce data augmentation for defect detection via generative adversarial nets
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
  2. Machine learning
    1. Learning paradigms
      1. Supervised learning
    2. Machine learning approaches
      1. Neural networks

Index terms have been assigned to the content through auto-classification.

Recommendations

Missing Data Imputation Using Data Generated By GAN
ICCBD '20: 2020 the 3rd International Conference on Computing and Big Data

Missing data is a common and challenging problem that arises in many research domains and led to the complication of data analysis. Therefore, handling missing data is a necessity as proposed in many previous studies. In this paper, we proposed two ...
Read More
GAN-Based Anomaly Detection In Imbalance Problems
Computer Vision – ECCV 2020 Workshops
Abstract
Imbalance pre one of the key issues that affect the performance greatly. Our focus in this work is to address an imbalance problem arising from defect detection in industrial inspections, including the different number of defect and non-defect ...
Read More
A Semi-supervised Defect Detection Method Based on Image Inpainting
PRICAI 2021: Trends in Artificial Intelligence
Abstract
Defect detection plays an important role in the industrial field. Because the defective images are often insufficient and defects can be various, defective image synthesis is commonly used and models always tend to learn the distribution of ...
Read More

Comments

Information & Contributors

Information

Published In

cover image Neurocomputing

Neurocomputing Volume 566, Issue C

Jan 2024

240 pages

ISSN:0925-2312

Issue’s Table of Contents

Elsevier B.V.

Publisher

Elsevier Science Publishers B. V.

Netherlands

Publication History

Published: 04 March 2024

Author Tags

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Other Metrics

View Author Metrics

Citations

View Options

View options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Issue’s Table of Contents