Dimensionality Reduction of Protein Mass Spectrometry Data Using Random Projection

Loy, Chen Change; Lai, Weng Kin; Lim, Chee Peng

doi:10.1007/11893257_86

Chen Change Loy²⁰,
Weng Kin Lai²⁰ &
Chee Peng Lim²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4233))

Included in the following conference series:

International Conference on Neural Information Processing

972 Accesses

Abstract

Protein mass spectrometry (MS) pattern recognition has recently emerged as a new method for cancer diagnosis. Unfortunately, classification performance may degrade owing to the enormously high dimensionality of the data. This paper investigates the use of Random Projection in protein MS data dimensionality reduction. The effectiveness of Random Projection (RP) is analyzed and compared against Principal Component Analysis (PCA) by using three classification algorithms, namely Support Vector Machine, Feed-forward Neural Networks and K-Nearest Neighbour. Three real-world cancer data sets are employed to evaluate the performances of RP and PCA. Through the investigations, RP method demonstrated better or at least comparable classification performance as PCA if the dimensionality of the projection matrix is sufficiently large. This paper also explores the use of RP as a pre-processing step prior to PCA. The results show that without sacrificing classification accuracy, performing RP prior to PCA significantly improves the computational time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Feature Selection and Machine Learning with Mass Spectrometry Data

On Eigen-matrix translation method for classification of biological data

Article 30 July 2015

Protein Folding Recognition

References

Perkins, G.L., et al.: Serum Tumor Markers. American Family Physician 68(6), 1075–1082 (2003)
Google Scholar
Petricon, E.F., et al.: Use of Proteomic Patterns in Serum to Identify Ovarian Cancer. The Lancet 359, 572–577 (2002)
Article Google Scholar
Dasgupta, S.: Experiments with Random Projections. In: Proc. 16th Conf. Uncertainty in Artificial Intelligence (2000)
Google Scholar
Bingham, E., Mannila, H.: Random Projection in Dimensionality Reduction Application to Image and Text Data. Knowledge Discovery and Data Mining, pp. 245–250 (2001)
Google Scholar
Levner, I.: Feature Selection and Nearest Centroid Classification for Protein Mass Spectrometry. Bioinformatics 6(68) (2005)
Google Scholar
Lilien, R.H., Farid, H., Donald, B.R.: Probabilistic Disease Classification of Expression-Dependent Proteomic Data from Mass Spectrometry of Human Serum. J. of Computational Biology 10(6), 925–946 (2003)
Article Google Scholar
Shen, L., Tan, E.C.: Dimension Reduction-Based Penalized Logistic Regression for Cancer Classification using Microarray Data. IEEE/ACM Trans. on Computational Biology and Bioinformatics 2(2), 166–174 (2005)
Article MathSciNet Google Scholar
Purohit, P.V., Rocke, D.M.: Discriminant Models for High-Throughput Proteomics Mass Spectrometer Data. Proteomics 3, 1699–1703 (2003)
Article Google Scholar
Vempala, S.S.: The Random Projection Method, vol. 65. American Mathematical Society, Providence, RI (2004)
MATH Google Scholar
Achlioptas, D.: Database-Friendly Random Projections. In: Symposium on Principles of Database Systems, pp. 274–281 (2001)
Google Scholar
Clinical Proteomics Program Databank, National Cancer Institute: http://home.ccr.cancer.gov/ncifdaproteomics/ppatterns.asp
Conrads, T.P., et al.: High-Resolution Serum Proteomic Features for Ovarian Cancer Detection. Endocrine-Related Cancer 11, 163–178 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Grid Computing and Bioinformatics Lab, MIMOS Berhad, 57000, Kuala Lumpur, Malaysia
Chen Change Loy & Weng Kin Lai
School of Electrical & Electronic Engineering, University of Science Malaysia, Engineering Campus, 14300, Nibong Tebal, Penang, Malaysia
Chee Peng Lim

Authors

Chen Change Loy
View author publications
You can also search for this author in PubMed Google Scholar
Weng Kin Lai
View author publications
You can also search for this author in PubMed Google Scholar
Chee Peng Lim
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Computer Science and Engineering, The Chinese Univ. of Hong Kong, Shatin, N.T., Hong Kong
Irwin King
Department of Mechanical and Automation Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, China
Jun Wang
The Chinese University of Hong Kong, Shatin, N.T., Hong Kong
Lai-Wan Chan
Department of Computer Science and Engineering & Center for Cognitive Science, The Ohio State University, OH 43210, Columbus
DeLiang Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Loy, C.C., Lai, W.K., Lim, C.P. (2006). Dimensionality Reduction of Protein Mass Spectrometry Data Using Random Projection. In: King, I., Wang, J., Chan, LW., Wang, D. (eds) Neural Information Processing. ICONIP 2006. Lecture Notes in Computer Science, vol 4233. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11893257_86

Download citation

DOI: https://doi.org/10.1007/11893257_86
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-46481-5
Online ISBN: 978-3-540-46482-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Dimensionality Reduction of Protein Mass Spectrometry Data Using Random Projection

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Feature Selection and Machine Learning with Mass Spectrometry Data

On Eigen-matrix translation method for classification of biological data

Protein Folding Recognition

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Dimensionality Reduction of Protein Mass Spectrometry Data Using Random Projection

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Feature Selection and Machine Learning with Mass Spectrometry Data

On Eigen-matrix translation method for classification of biological data

Protein Folding Recognition

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation