research-article

Identification of structurally important amino acids in proteins by graph-theoretic measures

Authors:

Tammy M. K. Cheng,

Pietro LióAuthors Info & Claims

StReBio '09: Proceedings of the KDD-09 Workshop on Statistical and Relational Learning in Bioinformatics

Pages 7 - 11

https://doi.org/10.1145/1562090.1562092

Published: 28 June 2009 Publication History

Abstract

Identifying key residues important for maintaining a protein structure is a non-trivial problem in Computational Biology. In this paper, we present results based on a graph model representing protein structures. This model considers the structure as residue-residue interactions in order to capture protein stability. We propose the application of approximate minimum vertex cover algorithms (MVC) as a novel approach for identifying the structurally important residues, which we shall refer to as key residues. We establish that MVC based algorithms captures the essence of protein structural stability by correlation analysis with ΔΔG, the change of protein free energies due to amino acid variations. We also benchmark our approach with popular approaches for analyzing large complex networks --- betweenness, and Eigenvector centrality. Our findings are such that they do not correlate well with ΔΔG. We give explanations from the free energy point of view, which shall benefit future development measures for protein structure stability.

References

[1]

Hamosh, A., Scott, A. F., Amberger, J., Bocchini, C., Valle, D. and McKusick, V. A. 2002. Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res 30, 52--55.

[2]

Yue, P., Li, Z. and Moult, J. 2005. Loss of protein structure stability as a major causative factor in monogenic disease. J Mol Biol 353, 459--473.

[3]

Sunyaev, S., Ramensky, V. and Bork, P. 2000. Towards a structural basis of human non-synonymous single nucleotide polymorphisms. Trends Genet 16, 198--200.

[4]

Wang, Z. and Moult, J. 2001. SNPs, protein structure, and disease. Hum Mutat 17, 263--270.

[5]

Vendruscolo, M., Dokholyan, N. V., Paci, E. and Karplus, M. 2002. Small-world view of the amino acids that play a key role in protein folding. Physical Review E65, 061910.

[6]

del Sol, A., Fujihashi, H., Amoros, D. and Nussinov, R. 2006. Residues crucial for maintaining short paths in network communication mediate signaling in proteins. Mol Syst Biol. 2, 2006.0019.

[7]

del Sol, A., Araúzo-Bravo, M. J., Amoros, D. and Nussinov, R. 2007. Modular architecture of protein structures and allosteric communications: potential implications for signaling proteins and regulatory linkages. Genome Biol 8, R92.

[8]

Greene, L. and Higman, V. 2003. Uncovering network systems within protein structures. J Mol Biol 334, 781--791.

[9]

Olivier, M., Eeles, R., Hollstein, M., Khan, M. A., Harris, C. C. and Hainaut, P. 2002. The IARC TP53 database: new online mutation analysis and recommendations to users. Hum Mutat. 19, 607--614.

[10]

Freeman, L. C. 1977. A set of measures of centrality based on betweenness. Sociometry 40, 35--41.

[11]

Brin, S. and Page, L. 1998. The anatomy of a large-scale hypertextual Web search engine. Proceedings of the seventh international conference on World Wide Web 7, 107--117.

Digital Library

[12]

Kleinberg, J. M. 1999. Authoritative sources in a hyperlinked environment. J. ACM 46, 5.

Digital Library

[13]

McDonald, I. K. and Thornton, J. M. 1994. Satisfying Hydrogen Bonding Potential in Proteins. J Mol Biol 238, 777--793.

[14]

Gore, P. S., Burke, F. D. and Blundell, T. L. 2005. PROVAT: a tool for Voronoi tessellation analysis of protein structures and complexes. Bioinformatics 21, 3316--3317.

Digital Library

[15]

Capriotti, E., Fariselli, P. and Casadio, R. 2005. I-Mutant2.0: predicting stability changes upon mutation from the protein sequence or structure. Nucleic Acids Res 33, W306--310.

[16]

Böde, C., Kovács, I. A., Szalay, M. S., Palotai, R., Korcsmáros, T. and Csermely, P. 2007. Network analysis of protein dynamics. FEBS Lett. 581, 2776--2782.

Recommendations

Halogen bonding in complexes of proteins and non-natural amino acids

We have analyzed the influence of halogen bonding to the stability of complexes of proteins and non-natural amino acids.We employed conservation pattern, structural motif, secondary structure, and solvent accessibility calculation.The roles of key ...
Identification of β-barrel membrane proteins based on amino acid composition properties and predicted secondary structure

Unlike all-helices membrane proteins, @b-barrel membrane proteins can not be successfully discriminated from other proteins, especially from all-@b soluble proteins. This paper performs an analysis on the amino acid composition in membrane parts of 12 @...
Sequence Based Prediction of Protein Mutant Stability and Discrimination of Thermophilic Proteins
PRIB '08: Proceedings of the Third IAPR International Conference on Pattern Recognition in Bioinformatics

Prediction of protein stability upon amino acid substitution and discrimination of thermophilic proteins from mesophilic ones are important problems in designing stable proteins. We have developed a classification rule generator using the information ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

StReBio '09: Proceedings of the KDD-09 Workshop on Statistical and Relational Learning in Bioinformatics

June 2009

40 pages

ISBN:9781605586670

DOI:10.1145/1562090

Copyright © 2009 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 June 2009

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Seventh Framework Programme

Conference

KDD09

Sponsor:

KDD09: The 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

June 28, 2009

Paris, France

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
193
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 09 Nov 2024

Other Metrics

View Author Metrics

Citations

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents