Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Computational prediction of human proteins that can be secreted into the bloodstream

Published: 15 October 2008 Publication History

Abstract

We present a novel computational method for predicting which proteins from highly and abnormally expressed genes in diseased human tissues, such as cancers, can be secreted into the bloodstream, suggesting possible marker proteins for follow-up serum proteomic studies. A main challenging issue in tackling this problem is that our understanding about the downstream localization after proteins are secreted outside the cells is very limited and not sufficient to provide useful hints about secretion to the bloodstream. To bypass this difficulty, we have taken a data mining approach by first collecting, through extensive literature searches, human proteins that are known to be secreted into the bloodstream due to various pathological conditions as detected by previous proteomic studies, and then asking the question: ‘what do these secreted proteins have in common in terms of their physical and chemical properties, amino acid sequence and structural features that can be used to predict them__ __’ We have identified a list of features, such as signal peptides, transmembrane domains, glycosylation sites, disordered regions, secondary structural content, hydrophobicity and polarity measures that show relevance to protein secretion. Using these features, we have trained a support vector machine-based classifier to predict protein secretion to the bloodstream. On a large test set containing 98 secretory proteins and 6601 non-secretory proteins of human, our classifier achieved ~90% prediction sensitivity and ~98% prediction specificity. Several additional datasets are used to further assess the performance of our classifier. On a set of 122 proteins that were found to be of abnormally high abundance in human blood due to various cancers, our program predicted 62 as blood-secreted proteins. By applying our program to abnormally highly expressed genes in gastric cancer and lung cancer tissues detected through microarray gene expression studies, we predicted 13 and 31 as blood secreted, respectively, suggesting that they could serve as potential biomarkers for these two cancers, respectively. Our study demonstrated that our method can provide highly useful information to link genomic and proteomic studies for disease biomarker discovery. Our software can be accessed at http://csbl1.bmb.uga.edu/cgi-bin/Secretion/secretion.cgi.
Supplementary information: Supplementary data are available at Bioinformatics online.

Cited By

View all
  • (2023)Spike2CGR: an efficient method for spike sequence classification using chaos game representationMachine Language10.1007/s10994-023-06371-4112:10(3633-3658)Online publication date: 28-Aug-2023
  • (2021)DeepHBSP: A Deep Learning Framework for Predicting Human Blood-Secretory Proteins Using Transfer LearningJournal of Computer Science and Technology10.1007/s11390-021-0851-936:2(234-247)Online publication date: 1-Apr-2021
  • (2019)Identification and Functional Inference for Tumor-Associated Long Non-Coding RNAIEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)10.1109/TCBB.2017.268744216:4(1288-1301)Online publication date: 10-Dec-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Bioinformatics
Bioinformatics  Volume 24, Issue 20
October 2008
145 pages

Publisher

Oxford University Press, Inc.

United States

Publication History

Published: 15 October 2008

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 15 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Spike2CGR: an efficient method for spike sequence classification using chaos game representationMachine Language10.1007/s10994-023-06371-4112:10(3633-3658)Online publication date: 28-Aug-2023
  • (2021)DeepHBSP: A Deep Learning Framework for Predicting Human Blood-Secretory Proteins Using Transfer LearningJournal of Computer Science and Technology10.1007/s11390-021-0851-936:2(234-247)Online publication date: 1-Apr-2021
  • (2019)Identification and Functional Inference for Tumor-Associated Long Non-Coding RNAIEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)10.1109/TCBB.2017.268744216:4(1288-1301)Online publication date: 10-Dec-2019
  • (2016)PUEPro: A Computational Pipeline for Prediction of Urine Excretory ProteinsAdvanced Data Mining and Applications10.1007/978-3-319-49586-6_51(714-725)Online publication date: 12-Dec-2016

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media