Abstract
For large-scale medical data, annotation requires certain medical knowledge and experience, and manual annotation takes a lot of time and human resources. The application of deep learning technology in medical images can help doctors make diagnoses more quickly and accurately, which promotes technological progress in the medical field. However, images may contain sensitive information such as disease details and individual body structures. It has been shown that an attacker can determine whether a patient participates in training by launching a membership inference attack. To prevent the leakage of patient training samples, this paper proposes a privacy-preserving scheme named PATE-Medical for training high-performance medical image classifiers using a small number of labeled samples. It is based on the idea of employing a faculty-student architecture to preserve the privacy of the training data. To overcome the low accuracy and privacy challenges posed by limited medical images, firstly, a Siamese neural network is utilized to train a single faculty model to obtain accurate prediction results with insufficient training samples. Then, the student model is trained as the output classifier model based on the semi-supervised Mixmatch method, which aims to ensure the performance of the student model while reducing the loss of privacy spent on student training. Experiments show that the accuracy of faculty ensemble voting in PATE-Medical reaches 98.25%; the student model achieves a maximum accuracy of 97%. The minimum privacy cost is only 1.154.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Abadi, M., et al.: Deep learning with differential privacy. In: Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security, pp. 308–318 (2016)
Adnan, M., Kalra, S., Cresswell, J.C., Taylor, G.W., Tizhoosh, H.R.: Federated learning and differential privacy for medical image analysis. Sci. Rep. 12(1), 1953 (2022)
Berthelot, D., Carlini, N., Goodfellow, I., Papernot, N., Oliver, A., Raffel, C.A.: Mixmatch: a holistic approach to semi-supervised learning. In: Advances in Neural Information Processing Systems, vol. 32 (2019)
Chowdhury, M.E., et al.: Can AI help in screening viral and COVID-19 pneumonia? IEEE Access 8, 132665–132676 (2020)
Dwork, C., Kenthapadi, K., McSherry, F., Mironov, I., Naor, M.: Our data, ourselves: privacy via distributed noise generation. In: Vaudenay, S. (ed.) EUROCRYPT 2006. LNCS, vol. 4004, pp. 486–503. Springer, Heidelberg (2006). https://doi.org/10.1007/11761679_29
Dwork, C., Roth, A., et al.: The algorithmic foundations of differential privacy. Found. Trends® Theoret. Comput. Sci. 9(3–4), 211–407 (2014)
Faisal, F.: Privacy-preserving synthetic image data generation and classification (2023)
Gwon, H., et al.: LDP-GAN: generative adversarial networks with local differential privacy for patient medical records synthesis. Comput. Biol. Med. 168, 107738 (2024)
Hadsell, R., Chopra, S., LeCun, Y.: Dimensionality reduction by learning an invariant mapping. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), vol. 2, pp. 1735–1742. IEEE (2006)
Luan, H., et al.: Multi-class cancer classification of whole slide images through transformer and multiple instance learning. In: Guo, X., Mangul, S., Patterson, M., Zelikovsky, A. (eds.) ISBRA 2023. LNCS, vol. 14248, pp. 150–164. Springer, Singapore (2023). https://doi.org/10.1007/978-981-99-7074-2_12
Narayan, V., Mall, P.K., Awasthi, S., Srivastava, S., Gupta, A.: FuzzyNet: medical image classification based on GLCM texture feature. In: 2023 International Conference on Artificial Intelligence and Smart Communication (AISC), pp. 769–773. IEEE (2023)
Papernot, N., Abadi, M., Erlingsson, U., Goodfellow, I., Talwar, K.: Semi-supervised knowledge transfer for deep learning from private training data. arXiv preprint arXiv:1610.05755 (2016)
Rahman, T., et al.: Exploring the effect of image enhancement techniques on COVID-19 detection using chest X-Ray images. Comput. Biol. Med. 132, 104319 (2021)
Rehman, A., Abbas, S., Khan, M., Ghazal, T.M., Adnan, K.M., Mosavi, A.: A secure healthcare 5.0 system based on blockchain technology entangled with federated learning technique. Comput. Biol. Med. 150, 106019 (2022)
Shorfuzzaman, M., Hossain, M.S.: MetaCOVID: a Siamese neural network framework with contrastive loss for n-shot diagnosis of COVID-19 patients. Pattern Recogn. 113, 107700 (2021)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
Tayebi Arasteh, S., et al.: Preserving fairness and diagnostic accuracy in private large-scale AI models for medical imaging. Commun. Med. 4(1), 46 (2024)
Zhang, H., Cisse, M., Dauphin, Y.N., Lopez-Paz, D.: mixup: beyond empirical risk minimization. arXiv preprint arXiv:1710.09412 (2017)
Ziller, A., Usynin, D., Braren, R., Makowski, M., Rueckert, D., Kaissis, G.: Medical imaging deep learning with differential privacy. Sci. Rep. 11(1), 13524 (2021)
Acknowledgments
This work was supported by the Science and Technology Development Plan Project of Henan Province (No. 242102211062; No. 222102210238; No. 212102210091); the National Natural Science Foundation of China under Grant number (No. 61972134).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Yan, C., Yin, M., Liang, W., Yan, H., Luo, H., Luo, J. (2024). Enhancing Privacy and Preserving Accuracy in Medical Image Classification with Limited Labeled Samples. In: Peng, W., Cai, Z., Skums, P. (eds) Bioinformatics Research and Applications. ISBRA 2024. Lecture Notes in Computer Science(), vol 14954. Springer, Singapore. https://doi.org/10.1007/978-981-97-5128-0_31
Download citation
DOI: https://doi.org/10.1007/978-981-97-5128-0_31
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-5127-3
Online ISBN: 978-981-97-5128-0
eBook Packages: Computer ScienceComputer Science (R0)