Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Imbalanced Breast Cancer Classification Using Transfer Learning

Published: 03 February 2021 Publication History

Abstract

Accurate breast cancer detection using automated algorithms remains a problem within the literature. Although a plethora of work has tried to address this issue, an exact solution is yet to be found. This problem is further exacerbated by the fact that most of the existing datasets are imbalanced, i.e., the number of instances of a particular class far exceeds that of the others. In this paper, we propose a framework based on the notion of transfer learning to address this issue and focus our efforts on histopathological and imbalanced image classification. We use the popular VGG-19 as the base model and complement it with several state-of-the-art techniques to improve the overall performance of the system. With the ImageNet dataset taken as the source domain, we apply the learned knowledge in the target domain consisting of histopathological images. With experimentation performed on a large-scale dataset consisting of 277,524 images, we show that the framework proposed in this paper gives superior performance than those available in the existing literature. Through numerical simulations conducted on a supercomputer, we also present guidelines for work in transfer learning and imbalanced image classification.

References

[1]
I. A. of Cancer Research (2018) Latest Global Cancer Data:, “Cancer burden rises to 18.1 million new cases and 9.6 million cancer deaths in 2018,” World Health Organization, Geneva, Nov. 11, 2019. [Online]. Available: https://www.who.int/cancer/PRGlobocanFinal.pdf
[2]
G. K. Malhotra, X. Zhao, H. Band, and V. Band, “Histological, molecular and functional subtypes of breast cancers,” Cancer Biol. Ther., vol. 10, no. 10, pp. 955–960, 2010.
[3]
F. A. Tavassoli, “Pathology and genetics of tumours of the breast and female genital organs,” World Hhealth Org. Classification Tumours, vol. 46, pp. 229–229, 2003.
[4]
M. F. Lerwill, “Current practical applications of diagnostic immunohistochemistry in breast pathology,” Amer. J. Surgical Pathol., vol. 28, no. 8, pp. 1076–1091, 2004.
[5]
B.-N. Zhang, et al., “Guidelines on the diagnosis and treatment of breast cancer (2011 edition),” Gland Surgery, vol. 1, no. 1, 2012, Art. no.
[6]
C. Pagani, D. Coscia, C. Dellabianca, M. Bonardi, S. Alessi, and F. Calliada, “Ultrasound guided fine-needle aspiration cytology of breast lesions,” J. Ultrasound, vol. 14, no. 4, pp. 182–187, 2011.
[7]
S. Hussain, et al., “Multiplexed autoantibody signature for serological detection of canine mammary tumours,” Sci. Rep., vol. 8, no. 1, 2018, Art. no.
[8]
S. Mittal, H. Kaur, N. Gautam, and A. K. Mantha, “Biosensors for breast cancer diagnosis: A review of bioreceptors, biotransducers and signal amplification strategies,” Biosensors Bioelectronics, vol. 88, pp. 217–231, 2017.
[9]
S. C. Jena, et al., “Surface plasmon resonance immunosensor for label-free detection of BIRC5 biomarker in spontaneously occurring canine mammary tumours,” Sci. Rep., vol. 9, no. 1, pp. 1–12, 2019.
[10]
A. F. Taktak and A. C. Fisher, Outcome Prediction in Cancer. Amsterdam, The Netherlands: Elsevier, 2006.
[11]
W. H. Wolberg, W. N. Street, and O. Mangasarian, “Machine learning techniques to diagnose breast cancer from image-processed nuclear features of fine needle aspirates,” Cancer Lett., vol. 77, no. 2/3, pp. 163–171, 1994.
[12]
H. R. Roth, et al., “Anatomy-specific classification of medical images using deep convolutional nets,” in Proc. IEEE 12th Int. Symp. Biomed. Imag., 2015, pp. 101–104.
[13]
G. Wang, et al., “Interactive medical image segmentation using deep learning with image-specific fine tuning,” IEEE Trans. Med. Imag., vol. 37, no. 7, pp. 1562–1573, Jul. 2018.
[14]
A. Cruz-Roa, et al., “Automatic detection of invasive ductal carcinoma in whole slide images with convolutional neural networks,” in Proc. SPIE Med. Imag.: Digit. Pathol., 2014, Art. no.
[15]
A. Janowczyk and A. Madabhushi, “Deep learning for digital pathology image analysis: A comprehensive tutorial with selected use cases,” J. Pathol. Informat., vol. 7, 2016, Art. no.
[16]
M. S. Reza and J. Ma, “Imbalanced histopathological breast cancer image classification with convolutional neural network,” in Proc. 14th IEEE Int. Conf. Signal Process., 2018, pp. 619–624.
[17]
S. Li, Z. Wang, G. Zhou, and S. Y. M. Lee, “Semi-supervised learning for imbalanced sentiment classification,” in Proc. 22nd Int. Joint Conf. Artif. Intell., 2011, pp. 1826–1831.
[18]
B. Krawczyk, M. Woźniak, and G. Schaefer, “Cost-sensitive decision tree ensembles for effective imbalanced classification,” Appl. Soft Comput., vol. 14, pp. 554–562, 2014.
[19]
S. González, S. García, S.-T. Li, and F. Herrera, “Chain based sampling for monotonic imbalanced classification,” Inf. Sci., vol. 474, pp. 187–204, 2019.
[20]
M. Lázaro, F. Herrera, and A. R. Figueiras-Vidal, “Ensembles of cost-diverse Bayesian neural learners for imbalanced binary classification,” Inf. Sci., vol. 520, pp. 31–45, 2020.
[21]
K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition,” 2014,.
[22]
D. Shen, G. Wu, and H.-I. Suk, “Deep learning in medical image analysis,” Annu. Rev. Biomed. Eng., vol. 19, pp. 221–248, 2017.
[23]
X. Li, T. Pang, B. Xiong, W. Liu, P. Liang, and T. Wang, “Convolutional neural networks based transfer learning for diabetic retinopathy fundus image classification,” in Proc. 10th Int. Congr. Image Signal Process. BioMed. Eng. Informat., 2017, pp. 1–11.
[24]
M. Gao, et al., “Holistic classification of CT attenuation patterns for interstitial lung diseases via deep convolutional neural networks,” Comput. Methods Biomechanics Biomed. Eng.: Imag. Vis., vol. 6, no. 1, pp. 1–6, 2018.
[25]
B. Q. Huynh, H. Li, and M. L. Giger, “Digital mammographic tumor classification using transfer learning from deep convolutional neural networks,” J. Med. Imag., vol. 3, no. 3, 2016, Art. no.
[26]
B. Demir, F. Bovolo, and L. Bruzzone, “Updating land-cover maps by classification of image time series: A novel change-detection-driven transfer learning approach,” IEEE Trans. Geosci. Remote Sens., vol. 51, no. 1, pp. 300–312, Jan. 2013.
[27]
L. Ge, J. Gao, H. Ngo, K. Li, and A. Zhang, “On handling negative transfer and imbalanced distributions in multiple source transfer learning,” Statist. Anal. Data Mining: ASA Data Sci. J., vol. 7, no. 4, pp. 254–271, 2014.
[28]
S. Al-Stouhi and C. K. Reddy, “Transfer learning for class imbalance problems with inadequate data,” Knowl. Inf. Syst., vol. 48, no. 1, pp. 201–228, 2016.
[29]
A. M. Romano and A. A. Hernandez, “Enhanced deep learning approach for predicting invasive ductal carcinoma from histopathology images,” in Proc. 2nd Int. Conf. Artif. Intell. Big Data, 2019, pp. 142–148.
[30]
J. W. Johnson, “Dectecting invasive ductal carcinoma with semi-supervised conditional GANs,” 2019,.
[31]
S. Li, S. Ju, G. Zhou, and X. Li, “Active learning for imbalanced sentiment classification,” in Proc. Joint Conf. Empir. Methods Natural Lang. Process. Comput. Natural Lang. Learn., 2012, pp. 139–148.
[32]
A. Sun, E.-P. Lim, and Y. Liu, “On strategies for imbalanced text classification using SVM: A comparative study,” Decis. Support Syst., vol. 48, no. 1, pp. 191–201, 2009.
[33]
T. Imam, K. M. Ting, and J. Kamruzzaman, “z-SVM: An SVM for improved classification of imbalanced data,” in Proc. Australas. Joint Conf. Artif. Intell., 2006, pp. 264–273.
[34]
Y.-H. Shao, W.-J. Chen, J.-J. Zhang, Z. Wang, and N.-Y. Deng, “An efficient weighted lagrangian twin support vector machine for imbalanced data classification,” Pattern Recognit., vol. 47, no. 9, pp. 3158–3167, 2014.
[35]
J. A. Sáez, J. Luengo, J. Stefanowski, and F. Herrera, “SMOTE–IPF: Addressing the noisy and borderline examples problem in imbalanced classification by a re-sampling method with filtering,” Inf. Sci., vol. 291, pp. 184–203, 2015.
[36]
Z.-Q. Zhao, “A novel modular neural network for imbalanced classification problems,” Pattern Recognit. Lett., vol. 30, no. 9, pp. 783–788, 2009.
[37]
Q. Zou, S. Xie, Z. Lin, M. Wu, and Y. Ju, “Finding the best classification threshold in imbalanced classification,” Big Data Res., vol. 5, pp. 2–8, 2016.
[38]
L. A. Jeni, J. F. Cohn, and F. De La Torre, “Facing imbalanced data–recommendations for the use of performance metrics,” in Proc. Humaine Assoc. Conf. Affect. Comput. Intell. Interact., 2013, pp. 245–251.
[39]
C. C. Chatterjee and G. Krishna, “A novel method for IDC prediction in breast cancer histopathology images using deep residual neural networks,” 2019,.
[40]
A. Sriram, S. Kalra, and H. R. Tizhoosh, “Projectron–A shallow and interpretable network for classifying medical images,” in Proc. Int. Joint Conf. Neural Netw., 2019, pp. 1–9.
[41]
M. J.-U. Rahman, R. I. Sultan, F. Mahmud, S. Al Ahsan, and A. Matin, “Automatic system for detecting invasive ductal carcinoma using convolutional neural networks,” in Proc. TENCON IEEE Region 10 Conf., 2018, pp. 673–678.
[42]
J. L. Wang, A. K. Ibrahim, H. Zhuang, A. M. Ali, A. Y. Li, and A. Wu, “A study on automatic detection of IDC breast cancer with convolutional neural networks,” in Proc. Int. Conf. Comput. Sci. Comput. Intell., 2018, pp. 703–708.
[43]
K. J. Cios, H. Mamitsuka, T. Nagashima, and R. Tadeusiewicz, “Computational intelligence in solving bioinformatics problems,” Artif. Intell. Medicine, vol. 35, no. 1, pp. 1–8, 2005. [Online]. Available: https://doi.org/10.1016/j.artmed.2005.07.001
[44]
A. Krizhevsky, I. Sutskever, and G. E. Hinton, “ImageNet classification with deep convolutional neural networks,” in Proc. Int. Conf. Neural Inf. Process. Syst., 2012, pp. 1097–1105.
[45]
R. Yamashita, M. Nishio, R. K. G. Do, and K. Togashi, “Convolutional neural networks: An overview and application in radiology,” Insights Imag., vol. 9, no. 4, pp. 611–629, 2018.
[46]
G. Litjens, et al., “A survey on deep learning in medical image analysis,” Med. Image Anal., vol. 42, pp. 60–88, 2017.
[47]
S. Roy, R. Kumar, V. Mittal, and D. Gupta, “Classification models for invasive ductal carcinoma progression, based on gene expression data-trained supervised machine learning,” Scientific Rep., vol. 10, no. 1, pp. 1–15, 2020. [Online]. Available: https://doi.org/10.1038/s41598-020-60740-w
[48]
B. Gecer, S. Aksoy, E. Mercan, L. G. Shapiro, D. L. Weaver, and J. G. Elmore, “Detection and classification of cancer in whole slide breast histopathology images using deep convolutional networks,” Pattern Recognit., vol. 84, pp. 345–356, 2018.
[49]
E. Mercan, S. Mehta, J. Bartlett, L. G. Shapiro, D. L. Weaver, and J. G. Elmore, “Assessment of machine learning of breast pathology structures for automated differentiation of breast cancer and high-risk proliferative lesions,” JAMA Netw. Open, vol. 2, no. 8, pp. e198 777–e198 777, 2019.
[50]
A. Kumar, et al., “Deep feature learning for histopathological image classification of canine mammary tumors and human breast cancer,” Inf. Sci., vol. 508, pp. 405–421, 2020.
[51]
Shallu Mehra, “Breast cancer histology images classification: Training from scratch or transfer learning?” ICT Exp., vol. 4, no. 4, pp. 247–254, 2018.
[52]
J. N. Kather, et al., “Predicting survival from colorectal cancer histology slides using deep learning: A retrospective multicenter study,” PLoS Med., vol. 16, no. 1, 2019, Art. no.
[53]
K. Faust, et al., “Visualizing histopathologic deep learning classification and anomaly detection using nonlinear feature space dimensionality reduction,” BMC Bioinformatics, vol. 19, no. 1, 2018, Art. no.
[54]
P. Xie, K. Zuo, Y. Zhang, F. Li, M. Yin, and K. Lu, “Interpretable classification from skin cancer histology slides using deep learning: A retrospective multicenter study,” 2019,.
[55]
M. Mateen, et al., “Fundus image classification using VGG-19 architecture with PCA and SVD,” Symmetry, vol. 11, no. 1, 2019, Art. no.
[56]
Y. Gao and K. M. Mosalam, “Deep transfer learning for image-based structural damage recognition,” Comput.-Aided Civil Infrastructure Eng., vol. 33, no. 9, pp. 748–768, 2018.
[57]
X. Liu, M. Chi, Y. Zhang, and Y. Qin, “Classifying high resolution remote sensing images by fine-tuned VGG deep networks,” in Proc. IEEE Int. Geosci. Remote Sens. Symp., 2018, pp. 7137–7140.
[58]
S. J. Pan and Q. Yang, “A survey on transfer learning,” IEEE Trans. Knowl. Data Eng., vol. 22, no. 10, pp. 1345–1359, Oct. 2010.
[59]
S. S. Udmale, S. K. Singh, R. Singh, and A. K. Sangaiah, “Multi-fault bearing classification using sensors and ConvNet-based transfer learning approach,” IEEE Sensors J., vol. 20, no. 3, pp. 1433–1444, Feb. 2020.
[60]
A. Canziani, A. Paszke, and E. Culurciello, “An analysis of deep neural network models for practical applications,” 2016,.
[61]
V. Dumoulin, J. Shlens, and M. Kudlur, “A learned representation for artistic style,” 2016,.
[62]
Y. Saeys, T. Abeel, and Y. Van de Peer, “Robust feature selection using ensemble feature selection techniques,” in Proc. Joint Eur. Conf. Mach. Learn. Knowl. Discov. Databases, 2008, pp. 313–325.
[63]
M. Long, J. Wang, G. Ding, D. Shen, and Q. Yang, “Transfer learning with graph co-regularization,” IEEE Trans. Knowl. Data Eng., vol. 26, no. 7, pp. 1805–1818, Jul. 2014.
[64]
L. Jie, T. Tommasi, and B. Caputo, “Multiclass transfer learning from unconstrained priors,” in Proc. Int. Conf. Comput. Vis., 2011, pp. 1863–1870.
[65]
D. H. Wolpert and W. G. Macready, “No free lunch theorems for optimization,” IEEE Trans. Evol. Comput., vol. 1, no. 1, pp. 67–82, Apr. 1997.

Cited By

View all
  • (2024)Addressing Data Imbalance via Image Augmentation for Automated Quality Inspection in Steel ProductionProceedings of the 2024 10th International Conference on Computer Technology Applications10.1145/3674558.3674583(174-181)Online publication date: 15-May-2024
  • (2024)Computational Techniques in PET/CT Image Processing for Breast Cancer: A Systematic Mapping ReviewACM Computing Surveys10.1145/364835956:8(1-38)Online publication date: 26-Apr-2024
  • (2024)SLIDE-Net: A Sequential Modeling Approach With Adaptive Fuzzy C-Mean Empowered Data Balancing Policy for IDC DetectionIEEE Transactions on Fuzzy Systems10.1109/TFUZZ.2024.337353432:10(5557-5570)Online publication date: 1-Oct-2024
  • Show More Cited By

Index Terms

  1. Imbalanced Breast Cancer Classification Using Transfer Learning
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image IEEE/ACM Transactions on Computational Biology and Bioinformatics
      IEEE/ACM Transactions on Computational Biology and Bioinformatics  Volume 18, Issue 1
      Jan.-Feb. 2021
      395 pages

      Publisher

      IEEE Computer Society Press

      Washington, DC, United States

      Publication History

      Published: 03 February 2021
      Published in TCBB Volume 18, Issue 1

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)19
      • Downloads (Last 6 weeks)3
      Reflects downloads up to 04 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Addressing Data Imbalance via Image Augmentation for Automated Quality Inspection in Steel ProductionProceedings of the 2024 10th International Conference on Computer Technology Applications10.1145/3674558.3674583(174-181)Online publication date: 15-May-2024
      • (2024)Computational Techniques in PET/CT Image Processing for Breast Cancer: A Systematic Mapping ReviewACM Computing Surveys10.1145/364835956:8(1-38)Online publication date: 26-Apr-2024
      • (2024)SLIDE-Net: A Sequential Modeling Approach With Adaptive Fuzzy C-Mean Empowered Data Balancing Policy for IDC DetectionIEEE Transactions on Fuzzy Systems10.1109/TFUZZ.2024.337353432:10(5557-5570)Online publication date: 1-Oct-2024
      • (2023)GC-EnCComputers in Biology and Medicine10.1016/j.compbiomed.2022.106329152:COnline publication date: 1-Jan-2023
      • (2023)Optimized Bayesian convolutional neural networks for invasive breast cancer diagnosis system▪Applied Soft Computing10.1016/j.asoc.2023.110810147:COnline publication date: 1-Nov-2023
      • (2022)A deep learning assisted image-guided framework for differentiation among tumors and hemorrhages in head imagingComputers and Electrical Engineering10.1016/j.compeleceng.2022.108330103:COnline publication date: 1-Oct-2022
      • (2022)EMOCGAN: a novel evolutionary multiobjective cyclic generative adversarial network and its application to unpaired image translationNeural Computing and Applications10.1007/s00521-021-05975-y34:24(21433-21447)Online publication date: 1-Dec-2022
      • (2021)A Novel Cloud-Assisted Secure Deep Feature Classification Framework for Cancer Histopathology ImagesACM Transactions on Internet Technology10.1145/342422121:2(1-22)Online publication date: 2-Jun-2021
      • (undefined)Cognitive Hybrid Deep Learning-based Multi-modal Sentiment Analysis for Online Product ReviewsACM Transactions on Asian and Low-Resource Language Information Processing10.1145/3615356

      View Options

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media