Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3587716.3587774acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlcConference Proceedingsconference-collections
research-article

Generative and contrastive based self-supervised learning model for histopathology image analysis

Published: 07 September 2023 Publication History
  • Get Citation Alerts
  • Abstract

    In recent years, the deep neural network has achieved astonishing results in the automatic Whole slide images(WSI) processing. The current mainstream approach is inseparable from a large number of manual annotations. However, labeling such giant images with billions of pixels is very labor-intensive. So the shortage of annotation has become a bottleneck in developing Whole slide image diagnostic models. Therefore, we propose a new self-supervised learning(SSL) network to solve the problem of insufficient annotation. In our work, massive semantic information can be extracted from a large number of WSI, which significantly gets rid of our dependence on the label. At the same time, the results are further refined by a silhouette-coefficient-based recursive Spectral Clustering Bipartition, which significantly improved the classification accuracy. Moreover, our framework is highly transferable and can take on many downstream tasks in pathology. Our final results are verified on the NCT-CRC-100K and MSHIR datasets. Our code is available at https://github.com/Hongbo-Chu/generative-contrastive

    References

    [1]
    Hangbo Bao, Li Dong, and Furu Wei. 2021. Beit: Bert pre-training of image transformers. arXiv preprint arXiv:2106.08254 (2021).
    [2]
    Mathilde Caron, Piotr Bojanowski, Armand Joulin, and Matthijs Douze. 2018. Deep clustering for unsupervised learning of visual features. In Proceedings of the European conference on computer vision (ECCV). 132–149.
    [3]
    Ting Chen, Simon Kornblith, Mohammad Norouzi, and Geoffrey Hinton. 2020. A simple framework for contrastive learning of visual representations. In International conference on machine learning. PMLR, 1597–1607.
    [4]
    Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition. Ieee, 248–255.
    [5]
    Laurent Dinh, David Krueger, and Yoshua Bengio. 2014. Nice: Non-linear independent components estimation. arXiv preprint arXiv:1410.8516 (2014).
    [6]
    Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, 2020. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020).
    [7]
    Jean-Bastien Grill, Florian Strub, Florent Altché, Corentin Tallec, Pierre Richemond, Elena Buchatskaya, Carl Doersch, Bernardo Avila Pires, Zhaohan Guo, Mohammad Gheshlaghi Azar, 2020. Bootstrap your own latent-a new approach to self-supervised learning. Advances in neural information processing systems 33 (2020), 21271–21284.
    [8]
    Stanley R Hamilton, Lauri A Aaltonen, 2000. Pathology and genetics of tumours of the digestive system. Vol. 2. IARC press Lyon:.
    [9]
    Kaiming He, Xinlei Chen, Saining Xie, Yanghao Li, Piotr Dollár, and Ross Girshick. 2022. Masked autoencoders are scalable vision learners. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 16000–16009.
    [10]
    Kaiming He, Haoqi Fan, Yuxin Wu, Saining Xie, and Ross Girshick. 2020. Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 9729–9738.
    [11]
    Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770–778.
    [12]
    Yordanka Karayaneva and Diana Hintea. 2018. Object Recognition in Python and MNIST Dataset Modification and Recognition with Five Machine Learning Classifiers. Journal of Image and Graphics 6, 1 (2018), 10–20.
    [13]
    Jakob Nikolas Kather, Niels Halama, and Alexander Marx. 2018. 100,000 histological images of human colorectal cancer and healthy tissue. https://doi.org/10.5281/zenodo.1214456
    [14]
    Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
    [15]
    Diederik P Kingma and Max Welling. 2013. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 (2013).
    [16]
    Yunfan Li, Peng Hu, Zitao Liu, Dezhong Peng, Joey Tianyi Zhou, and Xi Peng. 2021. Contrastive clustering. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 8547–8555.
    [17]
    Meng-Jia Lian, Chih-Ling Huang, and Tzer-Min Lee. 2018. Automation characterization for oral cancer by pathological image processing with gray-level co-occurrence matrix. In 5th International Conference on Mechanics and Mechatronics Research, Tokyo.
    [18]
    Pengfei Liu, Weizhe Yuan, Jinlan Fu, Zhengbao Jiang, Hiroaki Hayashi, and Graham Neubig. 2021. Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing. arXiv preprint arXiv:2107.13586 (2021).
    [19]
    Xiao Liu, Fanjin Zhang, Zhenyu Hou, Li Mian, Zhaoyu Wang, Jing Zhang, and Jie Tang. 2021. Self-supervised learning: Generative or contrastive. IEEE Transactions on Knowledge and Data Engineering (2021).
    [20]
    Stuart Lloyd. 1982. Least squares quantization in PCM. IEEE transactions on information theory 28, 2 (1982), 129–137.
    [21]
    Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, 2019. Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems 32 (2019).
    [22]
    Fabian Pedregosa, Gaël Varoquaux, Alexandre Gramfort, Vincent Michel, Bertrand Thirion, Olivier Grisel, Mathieu Blondel, Peter Prettenhofer, Ron Weiss, Vincent Dubourg, 2011. Scikit-learn: Machine learning in Python. the Journal of machine Learning research 12 (2011), 2825–2830.
    [23]
    Maithra Raghu, Thomas Unterthiner, Simon Kornblith, Chiyuan Zhang, and Alexey Dosovitskiy. 2021. Do vision transformers see like convolutional neural networks?Advances in Neural Information Processing Systems 34 (2021), 12116–12128.
    [24]
    Peter J Rousseeuw. 1987. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. Journal of computational and applied mathematics 20 (1987), 53–65.
    [25]
    David Tellez, Geert Litjens, Jeroen van der Laak, and Francesco Ciompi. 2019. Neural image compression for gigapixel histopathology image analysis. IEEE transactions on pattern analysis and machine intelligence 43, 2 (2019), 567–578.
    [26]
    Nguyen Trieu and Nguyen Truong Thinh. 2022. A Study of Combining KNN and ANN for Classifying Dragon Fruits Automatically. Journal of Image and Graphics 10 (01 2022). https://doi.org/10.18178/joig.10.1.28-35
    [27]
    Aäron Van Den Oord, Nal Kalchbrenner, and Koray Kavukcuoglu. 2016. Pixel recurrent neural networks. In International conference on machine learning. PMLR, 1747–1756.
    [28]
    Laurens Van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE.Journal of machine learning research 9, 11 (2008).
    [29]
    Ulrike Von Luxburg. 2007. A tutorial on spectral clustering. Statistics and computing 17, 4 (2007), 395–416.
    [30]
    Xi Wang, Hao Chen, Caixia Gan, Huangjing Lin, Qi Dou, Efstratios Tsougenis, Qitao Huang, Muyan Cai, and Pheng-Ann Heng. 2019. Weakly supervised deep learning for whole slide lung cancer image analysis. IEEE transactions on cybernetics 50, 9 (2019), 3950–3962.
    [31]
    Yen-Ju Wu, Chun-Ming Tsai, and Frank Shih. 2016. Improving leaf classification rate via background removal and ROI extraction. Journal of Image and Graphics 4, 2 (2016), 93–98.
    [32]
    Zhenda Xie, Zheng Zhang, Yue Cao, Yutong Lin, Jianmin Bao, Zhuliang Yao, Qi Dai, and Han Hu. 2022. Simmim: A simple framework for masked image modeling. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 9653–9663.
    [33]
    Jiangpeng Yan, Hanbo Chen, Xiu Li, and Jianhua Yao. 2022. Deep contrastive learning based tissue clustering for annotation-free histopathology image analysis. Computerized Medical Imaging and Graphics 97 (2022), 102053.

    Index Terms

    1. Generative and contrastive based self-supervised learning model for histopathology image analysis

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      ICMLC '23: Proceedings of the 2023 15th International Conference on Machine Learning and Computing
      February 2023
      619 pages
      ISBN:9781450398411
      DOI:10.1145/3587716
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 07 September 2023

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Computer vision
      2. Pathological image
      3. Self-supervised learning

      Qualifiers

      • Research-article
      • Research
      • Refereed limited

      Conference

      ICMLC 2023

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 51
        Total Downloads
      • Downloads (Last 12 months)51
      • Downloads (Last 6 weeks)2

      Other Metrics

      Citations

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format.

      HTML Format

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media