Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Dimensionality Reduction of Protein Mass Spectrometry Data Using Random Projection

  • Conference paper
Neural Information Processing (ICONIP 2006)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4233))

Included in the following conference series:

  • 972 Accesses

Abstract

Protein mass spectrometry (MS) pattern recognition has recently emerged as a new method for cancer diagnosis. Unfortunately, classification performance may degrade owing to the enormously high dimensionality of the data. This paper investigates the use of Random Projection in protein MS data dimensionality reduction. The effectiveness of Random Projection (RP) is analyzed and compared against Principal Component Analysis (PCA) by using three classification algorithms, namely Support Vector Machine, Feed-forward Neural Networks and K-Nearest Neighbour. Three real-world cancer data sets are employed to evaluate the performances of RP and PCA. Through the investigations, RP method demonstrated better or at least comparable classification performance as PCA if the dimensionality of the projection matrix is sufficiently large. This paper also explores the use of RP as a pre-processing step prior to PCA. The results show that without sacrificing classification accuracy, performing RP prior to PCA significantly improves the computational time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Perkins, G.L., et al.: Serum Tumor Markers. American Family Physician 68(6), 1075–1082 (2003)

    Google Scholar 

  2. Petricon, E.F., et al.: Use of Proteomic Patterns in Serum to Identify Ovarian Cancer. The Lancet 359, 572–577 (2002)

    Article  Google Scholar 

  3. Dasgupta, S.: Experiments with Random Projections. In: Proc. 16th Conf. Uncertainty in Artificial Intelligence (2000)

    Google Scholar 

  4. Bingham, E., Mannila, H.: Random Projection in Dimensionality Reduction Application to Image and Text Data. Knowledge Discovery and Data Mining, pp. 245–250 (2001)

    Google Scholar 

  5. Levner, I.: Feature Selection and Nearest Centroid Classification for Protein Mass Spectrometry. Bioinformatics 6(68) (2005)

    Google Scholar 

  6. Lilien, R.H., Farid, H., Donald, B.R.: Probabilistic Disease Classification of Expression-Dependent Proteomic Data from Mass Spectrometry of Human Serum. J. of Computational Biology 10(6), 925–946 (2003)

    Article  Google Scholar 

  7. Shen, L., Tan, E.C.: Dimension Reduction-Based Penalized Logistic Regression for Cancer Classification using Microarray Data. IEEE/ACM Trans. on Computational Biology and Bioinformatics 2(2), 166–174 (2005)

    Article  MathSciNet  Google Scholar 

  8. Purohit, P.V., Rocke, D.M.: Discriminant Models for High-Throughput Proteomics Mass Spectrometer Data. Proteomics 3, 1699–1703 (2003)

    Article  Google Scholar 

  9. Vempala, S.S.: The Random Projection Method, vol. 65. American Mathematical Society, Providence, RI (2004)

    MATH  Google Scholar 

  10. Achlioptas, D.: Database-Friendly Random Projections. In: Symposium on Principles of Database Systems, pp. 274–281 (2001)

    Google Scholar 

  11. Clinical Proteomics Program Databank, National Cancer Institute: http://home.ccr.cancer.gov/ncifdaproteomics/ppatterns.asp

  12. Conrads, T.P., et al.: High-Resolution Serum Proteomic Features for Ovarian Cancer Detection. Endocrine-Related Cancer 11, 163–178 (2004)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Loy, C.C., Lai, W.K., Lim, C.P. (2006). Dimensionality Reduction of Protein Mass Spectrometry Data Using Random Projection. In: King, I., Wang, J., Chan, LW., Wang, D. (eds) Neural Information Processing. ICONIP 2006. Lecture Notes in Computer Science, vol 4233. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11893257_86

Download citation

  • DOI: https://doi.org/10.1007/11893257_86

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-46481-5

  • Online ISBN: 978-3-540-46482-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics