Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3449301.3449316acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicraiConference Proceedingsconference-collections
research-article

Classification of Liver Cancer Subtypes Based on Hierarchical Integrated Stacked Autoencoder

Published: 09 June 2021 Publication History

Abstract

The development of high-throughput sequencing technology provides an opportunity to obtain multi-omics data for liver cancer,However,omics data often comes from different platforms and has different attributes, it has the characteristics of high feature dimension and small sample size. This will increase the overfitting of the model and the imbalance of categories,and the cross-platform integration analysis of omics data will challenge the traditional data analysis methods. In this regard, the Hierarchical Integrated Stacked Encoder (HI-SAE) is proposed.which can achieve deeper feature learning and data integration while reducing the differences caused by the characteristics of the data itself. Finally,the integrated feature expression is used to identify the subtype of liver cancer by softmax classifier. Experiments show that the classification accuracy when using Hi-SAE method for feature learning is 3.7% higher than that when using PCA, and 7.6% higher than that when using NMF.

References

[1]
Siegel RL, Miller KD, Jemal A. Cancer statistics, 2016. CA Cancer J Clin. 2016;66(1):7–30.
[2]
Liu G, Dong C, Liu L . Integrated Multiple "-omics" Data Reveal Subtypes of Hepatocellular Carcinoma[J]. Plos One, 2016, 11(11):e0165457.
[3]
Bailey P, Chang D K, Nones K, Genomic analyses identify molecular subtypes of pancreatic cancer[J]. Nature, 2016, 531(7592): 47.
[4]
Lee JS, Chu IS, Heo J, Calvisi DF, Sun Z, Roskams T, Classification and prediction of survival in hepatocellular carcinoma by gene expression profiling. Hepatology. 2004;40(3):667–76.
[5]
Hoshida Y, Nijman SM, Kobayashi M, Chan JA, Brunet JP, Chiang DY, Integrative transcriptome analysis reveals common molecular subclasses of human hepatocellular carcinoma. Cancer Res. 2009;69(18):7385–92.
[6]
Song MA, Tiirikainen M, Kwee S, Okimoto G, Yu H, Wong LL. Elucidating the landscape of aberrant DNA methylation in hepatocellular carcinoma. PLoS One. 2013;8(2):e55761.
[7]
Guo Y, Zheng J, Shang X, Li Z. A similarity regression fusion model for integrating multi-omics data to identify cancer subtypes. Genes. 2018; 9(7):314.
[8]
Yehudit Hasin,Marcus Seldin,Aldons Lusis. Multi-omics approaches to disease[J]. Genome Biology,2017,18(1).
[9]
Rappoport Nimrod,Shamir Ron. Multi-omic and multi-view clustering algorithms: review and cancer benchmark.[J]. Nucleic acids research,2019,47(2).
[10]
Liu G, Dong C, Liu L . Integrated Multiple "-omics" Data Reveal Subtypes of Hepatocellular Carcinoma[J]. Plos One, 2016, 11(11):e0165457.
[11]
Kaur H, Bhalla S, Raghava GPS. Classification of early and late stage liver hepatocellular carcinoma patients from their genomics and epigenomics profiles. PLoS One. 2019;14(9) e0221476.
[12]
Abeer Alzubaidi,Jonathan Tepper,Ahmad Lotfi. A novel deep mining model for effective knowledge discovery from omics data[J]. Artificial Intelligence In Medicine,2020,104.
[13]
Ng A, Sparse autoencoder. CS294A Lect Notes. 2011; 72(2011):1–19.
[14]
Xu J, Xiang L, Liu Q, Stacked Sparse Autoencoder (SSAE) for Nuclei Detection on Breast Cancer Histopathology Images[J]. Medical Imaging IEEE Transactions on, 2016, 35(1):119-130.
[15]
Vincent P, Larochelle H, Lajoie I, Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion[J]. Journal of Machine Learning Research, 2010, 11(12):3371-3408.
[16]
Xu, J., Wu, P., Chen, Y. et al. A hierarchical integration deep flexible neural forest framework for cancer subtype classification by integrating multi-omics data. BMC Bioinformatics 20, 527 (2019)
[17]
Guo Y, Shang X, Li Z . Identification of cancer subtypes by integrating multiple types of transcriptomics data with deep learning in breast cancer[J]. Neurocomputing, 2019, 324(JAN.9):20-30.
[18]
Zhu Y, Qiu P, Ji Y . TCGA-Assembler: open-source software for retrieving and processing TCGA data[J]. Nature Methods, 2014, 11(6):599-600.
[19]
Bo Wang,Aziz M Mezlini,Feyyaz Demir,Marc Fiume,Zhuowen Tu,Michael Brudno,Benjamin Haibe-Kains,Anna Goldenberg. Similarity network fusion for aggregating data types on a genomic scale[J]. Nature Methods: Techniques for life scientists and chemists,2014,11(3).
[20]
Xiang Q, Dai X, Deng Y, He C, Wang J, Feng J, Missing value imputation for microarray gene expression data using histone acetylation information. BMC Bioinformatics. 2008;9:252.
[21]
Chaudhary K, Poirion O B, Lu L, Deep Learning-Based Multi-Omics IntegrationRobustly Predicts Survival in Liver Cancer[J].Clinical Cancer Research, 2017, 24(6):

Index Terms

  1. Classification of Liver Cancer Subtypes Based on Hierarchical Integrated Stacked Autoencoder
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        ICRAI '20: Proceedings of the 6th International Conference on Robotics and Artificial Intelligence
        November 2020
        288 pages
        ISBN:9781450388597
        DOI:10.1145/3449301
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 09 June 2021

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. data dimensionality reduction
        2. liver cancer subtypes
        3. multi-omics
        4. tacked autoencoders

        Qualifiers

        • Research-article
        • Research
        • Refereed limited

        Conference

        ICRAI 2020

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • 0
          Total Citations
        • 43
          Total Downloads
        • Downloads (Last 12 months)5
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 01 Nov 2024

        Other Metrics

        Citations

        View Options

        Get Access

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format.

        HTML Format

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media