Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1562090.1562092acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Identification of structurally important amino acids in proteins by graph-theoretic measures

Published: 28 June 2009 Publication History

Abstract

Identifying key residues important for maintaining a protein structure is a non-trivial problem in Computational Biology. In this paper, we present results based on a graph model representing protein structures. This model considers the structure as residue-residue interactions in order to capture protein stability. We propose the application of approximate minimum vertex cover algorithms (MVC) as a novel approach for identifying the structurally important residues, which we shall refer to as key residues. We establish that MVC based algorithms captures the essence of protein structural stability by correlation analysis with ΔΔG, the change of protein free energies due to amino acid variations. We also benchmark our approach with popular approaches for analyzing large complex networks --- betweenness, and Eigenvector centrality. Our findings are such that they do not correlate well with ΔΔG. We give explanations from the free energy point of view, which shall benefit future development measures for protein structure stability.

References

[1]
Hamosh, A., Scott, A. F., Amberger, J., Bocchini, C., Valle, D. and McKusick, V. A. 2002. Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res 30, 52--55.
[2]
Yue, P., Li, Z. and Moult, J. 2005. Loss of protein structure stability as a major causative factor in monogenic disease. J Mol Biol 353, 459--473.
[3]
Sunyaev, S., Ramensky, V. and Bork, P. 2000. Towards a structural basis of human non-synonymous single nucleotide polymorphisms. Trends Genet 16, 198--200.
[4]
Wang, Z. and Moult, J. 2001. SNPs, protein structure, and disease. Hum Mutat 17, 263--270.
[5]
Vendruscolo, M., Dokholyan, N. V., Paci, E. and Karplus, M. 2002. Small-world view of the amino acids that play a key role in protein folding. Physical Review E65, 061910.
[6]
del Sol, A., Fujihashi, H., Amoros, D. and Nussinov, R. 2006. Residues crucial for maintaining short paths in network communication mediate signaling in proteins. Mol Syst Biol. 2, 2006.0019.
[7]
del Sol, A., Araúzo-Bravo, M. J., Amoros, D. and Nussinov, R. 2007. Modular architecture of protein structures and allosteric communications: potential implications for signaling proteins and regulatory linkages. Genome Biol 8, R92.
[8]
Greene, L. and Higman, V. 2003. Uncovering network systems within protein structures. J Mol Biol 334, 781--791.
[9]
Olivier, M., Eeles, R., Hollstein, M., Khan, M. A., Harris, C. C. and Hainaut, P. 2002. The IARC TP53 database: new online mutation analysis and recommendations to users. Hum Mutat. 19, 607--614.
[10]
Freeman, L. C. 1977. A set of measures of centrality based on betweenness. Sociometry 40, 35--41.
[11]
Brin, S. and Page, L. 1998. The anatomy of a large-scale hypertextual Web search engine. Proceedings of the seventh international conference on World Wide Web 7, 107--117.
[12]
Kleinberg, J. M. 1999. Authoritative sources in a hyperlinked environment. J. ACM 46, 5.
[13]
McDonald, I. K. and Thornton, J. M. 1994. Satisfying Hydrogen Bonding Potential in Proteins. J Mol Biol 238, 777--793.
[14]
Gore, P. S., Burke, F. D. and Blundell, T. L. 2005. PROVAT: a tool for Voronoi tessellation analysis of protein structures and complexes. Bioinformatics 21, 3316--3317.
[15]
Capriotti, E., Fariselli, P. and Casadio, R. 2005. I-Mutant2.0: predicting stability changes upon mutation from the protein sequence or structure. Nucleic Acids Res 33, W306--310.
[16]
Böde, C., Kovács, I. A., Szalay, M. S., Palotai, R., Korcsmáros, T. and Csermely, P. 2007. Network analysis of protein dynamics. FEBS Lett. 581, 2776--2782.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
StReBio '09: Proceedings of the KDD-09 Workshop on Statistical and Relational Learning in Bioinformatics
June 2009
40 pages
ISBN:9781605586670
DOI:10.1145/1562090
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 June 2009

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. computational biology
  2. graph theory
  3. protein stability
  4. protein structure
  5. vertex cover

Qualifiers

  • Research-article

Funding Sources

Conference

KDD09
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 193
    Total Downloads
  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media