Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Sequential algorithm for fast clique percolation

Jussi M. Kumpula, Mikko Kivelä, Kimmo Kaski, and Jari Saramäki
Phys. Rev. E 78, 026109 – Published 15 August 2008

Abstract

In complex network research clique percolation, introduced by Palla, Derényi, and Vicsek [Nature (London) 435, 814 (2005)], is a deterministic community detection method which allows for overlapping communities and is purely based on local topological properties of a network. Here we present a sequential clique percolation algorithm (SCP) to do fast community detection in weighted and unweighted networks, for cliques of a chosen size. This method is based on sequentially inserting the constituent links to the network and simultaneously keeping track of the emerging community structure. Unlike existing algorithms, the SCP method allows for detecting k-clique communities at multiple weight thresholds in a single run, and can simultaneously produce a dendrogram representation of hierarchical community structure. In sparse weighted networks, the SCP algorithm can also be used for implementing the weighted clique percolation method recently introduced by Farkas et al. [New J. Phys. 9, 180 (2007)]. The computational time of the SCP algorithm scales linearly with the number of k-cliques in the network. As an example, the method is applied to a product association network, revealing its nested community structure.

  • Figure
  • Figure
  • Figure
  • Figure
  • Received 12 May 2008

DOI:https://doi.org/10.1103/PhysRevE.78.026109

©2008 American Physical Society

Authors & Affiliations

Jussi M. Kumpula*, Mikko Kivelä, Kimmo Kaski, and Jari Saramäki

  • Department of Biomedical Engineering and Computational Science, Helsinki University of Technology, P.O. Box 9203, FIN-02015 HUT, Finland

  • *jkumpula@lce.hut.fi

Article Text (Subscription Required)

Click to Expand

References (Subscription Required)

Click to Expand
Issue

Vol. 78, Iss. 2 — August 2008

Reuse & Permissions
Access Options
Author publication services for translation and copyediting assistance advertisement

Authorization Required


×

Images

  • Figure 1
    Figure 1
    (Color online) Schematic illustration of the process for detecting the k-cliques a newly inserted link completes. The dashed line depicts the new link, inserted between nodes vi and vj. The common neighbors of nodes vi and vj are Nij={vm,vn,vp,vq}. For detecting newly formed 4-cliques, all pairs of nodes in Nij are checked to see if they are connected, that is, if they form a 2-clique. Each 2-clique in the set gives rise to a 4-clique, so in total the link lij will generate three 4-cliques. In the case k=5, only one 3-clique is found, which contains the nodes vm, vn, and vp. It will give rise to a single 5-clique including these nodes in addition to vi and vj.Reuse & Permissions
  • Figure 2
    Figure 2
    (Color online) Illustration of the algorithm for detecting k-clique communities in a simple example network. Here, k=3. (a) The original network Γ consists of three 3-cliques labeled A, B, and C. 2-cliques, i.e., nodes connected by single links, are labeled with lower case letters. (b) Bipartite network presentation of the clique structure. Note that in the bipartite network, the 3-cliques B and C, which form a 3-clique community, are connected by the shared 2-clique f. Clique A forms another 3-clique community. (c) 3-cliques detected by the first part of the algorithm as links are sequentially inserted into the network. Each new k-clique is denoted by dark nodes whereas nodes associated with existing k-cliques appear gray. (d) Corresponding updates to the (k1)-clique network Γ* as a result of the second part of the algorithm. k-clique communities correspond to connected components of this network (shaded areas).Reuse & Permissions
  • Figure 3
    Figure 3
    (Color online) Computation time of the algorithm for three values of k, as a function of the number of k-cliques (upper row) and network size (lower row). Symbols denote different test networks: GN (◼), WSN (▲), and CM (▲), see text for details. The solid line is a linear reference. For comparison, we have also plotted the computational time of the CFINDER 1.21 algorithm for the GN networks (▶). Note that CFINDER always processes all values of k.Reuse & Permissions
  • Figure 4
    Figure 4
    Dendrogram visualization of the nested k-community structure of the trading categories of the Finnish online auction site Huuto.net for k=3 (a) and k=4 (b).Reuse & Permissions
×

Sign up to receive regular email alerts from Physical Review E

Log In

Cancel
×

Search


Article Lookup

Paste a citation or DOI

Enter a citation
×