Application of Graph Clustering and Visualisation Methods to Analysis of Biomolecular Data

Celms, Edgars; Čerāns, Kārlis; Freivalds, Kārlis; Ķikusts, Paulis; Lāce, Lelde; Melkus, Gatis; Opmanis, Mārtiņš; Rituma, Dārta; Ručevskis, Pēteris; Vīksna, Juris

doi:10.1007/978-3-319-97571-9_20

Edgars Celms¹¹,
Kārlis Čerāns¹¹,
Kārlis Freivalds¹¹,
Paulis Ķikusts¹¹,
Lelde Lāce¹¹,
Gatis Melkus¹¹,
Mārtiņš Opmanis¹¹,
Dārta Rituma¹¹,
Pēteris Ručevskis¹¹ &
…
Juris Vīksna¹¹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 838))

Included in the following conference series:

International Baltic Conference on Databases and Information Systems

587 Accesses
2 Citations

Abstract

In this paper we present an approach based on integrated use of graph clustering and visualisation methods for semi-supervised discovery of biologically significant features in biomolecular data sets. We describe several clustering algorithms that have been custom designed for analysis of biomolecular data and feature an iterated two step approach involving initial computation of thresholds and other parameters used in clustering algorithms, which is followed by identification of connected graph components, and, if needed, by adjustment of clustering parameters for processing of individual subgraphs.

We demonstrate the applications of these algorithms to two concrete use cases: (1) analysis of protein coexpression in colorectal cancer cell lines; and (2) protein homology identification from, both sequence and structural similarity, data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Graph partitioning and visualization in graph mining: a survey

Article 23 May 2022

Vienna Graph Clustering

A General Powerful Graph Pattern Matching System for Data Analysis

References

Boccaletti, S., et al.: The structure and dynamics of multilayer networks. Phys. Rep. 544, 1–122 (2014)
Article MathSciNet Google Scholar
Choudhari, J., et al.: Genomic determinants of protein abundance variation in colorectal cancer cells. Cell Rep. 20, 2201–2214 (2017)
Article Google Scholar
Enright, A., et al.: An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 30, 1575–1584 (2002)
Article Google Scholar
Fortunato, A.: Community detection in graphs. Phys. Rep. 486, 75–174 (2010)
Article MathSciNet Google Scholar
Freivalds, K., Dogrusoz, U., Kikusts, P.: Disconnected graph layout and the polyomino packing approach. In: Mutzel, P., Jünger, M., Leipert, S. (eds.) GD 2001. LNCS, vol. 2265, pp. 378–391. Springer, Heidelberg (2002). https://doi.org/10.1007/3-540-45848-4_30
Chapter MATH Google Scholar
Freivalds, K., Glagoļevs, J.: Graph compact orthogonal layout algorithm. In: Fouilhoux, P., Gouveia, L.E.N., Mahjoub, A.R., Paschos, V.T. (eds.) ISCO 2014. LNCS, vol. 8596, pp. 255–266. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-09174-7_22
Chapter Google Scholar
Grishin, N.: Fold change in evolution of protein structures. Struct. Biol. 134, 167–185 (2001)
Article Google Scholar
Higgins, D., Sievers, F.: Clustal Omega, accurate alignment of very large numbers of sequences. Methods Mol. Biol. 1079, 105–116 (2014)
Article Google Scholar
Higgins, D., et al.: ClustalW and ClustalX version 2.0. Bioinformatics 23, 2947–2948 (2007)
Article Google Scholar
Jonsson, P., et al.: Cluster analysis of networks generated through homology: automatic identification of important protein communities involved in cancer metastasis. BMC Bioinform. 7(1), 2 (2006)
Article Google Scholar
Kurbatova, N., Mancinska, L., Viksna, J.: Protein structure comparison based on fold evolution. Lect. Notes Inform. 115, 78–89 (2007)
Google Scholar
Kurbatova, N., Viksna, J.: Exploration of evolutionary relations between protein structures. Commun. Comput. Inf. Sci. 13, 154–166 (2008)
Google Scholar
Langfelder, P., Horwath, S.: WGCNA: an R package for weighted correlation network analysis. BMC Bioinform. 9, 559 (2008)
Article Google Scholar
Maddi, A., Eslahchi, C.: Discovering overlapped protein complexes from weighted PPI networks by removing inter-module hubs. Sci. Rep. 7, 3247 (2017)
Article Google Scholar
Nepusz, T., Yu, H., Paccanaro, A.: Detecting overlapping protein complexes in protein-protein interaction networks. Nat. Methods 9, 471–472 (2012)
Article Google Scholar
Orengo, C., et al.: New functional families in CATH to improve the mapping of conserved functional sites to 3D structures. Nucleic Acids Res. 44, 490–498 (2013)
Google Scholar
Pearson, R.: Effective protein sequence comparison. Methods Enzymol. 266, 227–258 (1996)
Article Google Scholar
Petryszak, R., et al.: Expression Atlas update - an integrated database of gene and protein expression in humans, animals and plants. Nucleic Acids Res. 44(D1), 746–752 (2016)
Article Google Scholar
Pirim, H., Eksioglu, B., Perkins, A.: Clustering high throughput biological data with B-MST, a minimum spanning tree based heuristic. Comput. Biol. Med. 62, 94–102 (2015)
Article Google Scholar
Rung, J., Schlitt, T., Brazma, A., Freivalds, K., Vilo, J.: Building and analysing genome-wide gene disruption networks. Bioinformatics 18, S202–S210 (2002)
Article Google Scholar
Schaeffer, S.: Graph clustering. Comput. Sci. Rev. 1, 27–64 (2007)
Article Google Scholar
Smith, T., Waterman, M.: Identification of common molecular subsequences. J. Mol. Biol. 147, 195–197 (1981)
Article Google Scholar
Traag, A., Doreian, P., Mrvar, A.: Partitioning signed networks. ArXiv e-prints abs/1803.02082 (2018)
van Dongen, S., Abreu-Goodger, C.: Using MCL to extract clusters from networks. In: van Helden, J., Toussaint, A., Thieffry, D. (eds.) Bacterial Molecular Networks. Methods in Molecular Biology (Methods and Protocols), vol. 804, pp. 281–295. Springer, New York (2012). https://doi.org/10.1007/978-1-61779-361-5_15
Chapter Google Scholar
Vihrovs, J., Prusis, K., Freivalds, K., Rucevskis, P., Krebs, V.: A potential field function for overlapping point set and graph cluster visualization. Commun. Comput. Inf. Sci. 550, 136–152 (2015)
Google Scholar
Viksna, J., Gilbert, D.: Assessment of the probabilities for evolutionary structural changes in protein folds. Bioinformatics 23, 832–841 (2007)
Article Google Scholar

Download references

Acknowledgements

The research was supported by ERDF project 1.1.1.1/16/A/135.

Author information

Authors and Affiliations

Institute of Mathematics and Computer Science, University of Latvia, Riga, Latvia
Edgars Celms, Kārlis Čerāns, Kārlis Freivalds, Paulis Ķikusts, Lelde Lāce, Gatis Melkus, Mārtiņš Opmanis, Dārta Rituma, Pēteris Ručevskis & Juris Vīksna

Authors

Edgars Celms
View author publications
You can also search for this author in PubMed Google Scholar
Kārlis Čerāns
View author publications
You can also search for this author in PubMed Google Scholar
Kārlis Freivalds
View author publications
You can also search for this author in PubMed Google Scholar
Paulis Ķikusts
View author publications
You can also search for this author in PubMed Google Scholar
Lelde Lāce
View author publications
You can also search for this author in PubMed Google Scholar
Gatis Melkus
View author publications
You can also search for this author in PubMed Google Scholar
Mārtiņš Opmanis
View author publications
You can also search for this author in PubMed Google Scholar
Dārta Rituma
View author publications
You can also search for this author in PubMed Google Scholar
Pēteris Ručevskis
View author publications
You can also search for this author in PubMed Google Scholar
Juris Vīksna
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Juris Vīksna .

Editor information

Editors and Affiliations

Institute of Data Science and Digital Technologies, Vilnius University, Vilnius, Lithuania
Audrone Lupeikiene
Information Systems Department, Vilnius Gediminas Technical University, Vilnius, Lithuania
Olegas Vasilecas
Institute of Data Science and Digital Technologies, Vilnius University, Vilnius, Lithuania
Gintautas Dzemyda

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Celms, E. et al. (2018). Application of Graph Clustering and Visualisation Methods to Analysis of Biomolecular Data. In: Lupeikiene, A., Vasilecas, O., Dzemyda, G. (eds) Databases and Information Systems. DB&IS 2018. Communications in Computer and Information Science, vol 838. Springer, Cham. https://doi.org/10.1007/978-3-319-97571-9_20

Download citation

DOI: https://doi.org/10.1007/978-3-319-97571-9_20
Published: 15 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-97570-2
Online ISBN: 978-3-319-97571-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Application of Graph Clustering and Visualisation Methods to Analysis of Biomolecular Data

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Graph partitioning and visualization in graph mining: a survey

Vienna Graph Clustering

A General Powerful Graph Pattern Matching System for Data Analysis

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Application of Graph Clustering and Visualisation Methods to Analysis of Biomolecular Data

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Graph partitioning and visualization in graph mining: a survey

Vienna Graph Clustering

A General Powerful Graph Pattern Matching System for Data Analysis

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation