Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2505515.2505535acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Overlapping community detection using seed set expansion

Published: 27 October 2013 Publication History
  • Get Citation Alerts
  • Abstract

    Community detection is an important task in network analysis. A community (also referred to as a cluster) is a set of cohesive vertices that have more connections inside the set than outside. In many social and information networks, these communities naturally overlap. For instance, in a social network, each vertex in a graph corresponds to an individual who usually participates in multiple communities. One of the most successful techniques for finding overlapping communities is based on local optimization and expansion of a community metric around a seed set of vertices. In this paper, we propose an efficient overlapping community detection algorithm using a seed set expansion approach. In particular, we develop new seeding strategies for a personalized PageRank scheme that optimizes the conductance community score. The key idea of our algorithm is to find good seeds, and then expand these seed sets using the personalized PageRank clustering procedure. Experimental results show that this seed set expansion approach outperforms other state-of-the-art overlapping community detection methods. We also show that our new seeding strategies are better than previous strategies, and are thus effective in finding good overlapping clusters in a graph.

    References

    [1]
    Stanford Network Analysis Project. http://snap.stanford.edu/.
    [2]
    B. Abrahao, S. Soundarajan, J. Hopcroft, and R. Kleinberg. On the separability of structural classes of communities. In KDD, pages 624--632, 2012.
    [3]
    Y.-Y. Ahn, J. P. Bagrow, and S. Lehmann. Link communities reveal multiscale complexity in networks. Nature, 466:761--764, 2010.
    [4]
    R. Andersen, F. Chung, and K. Lang. Local graph partitioning using PageRank vectors. In FOCS, 2006.
    [5]
    R. Andersen and K. J. Lang. Communities from seed sets. In WWW, pages 223--232, 2006.
    [6]
    F. Bonchi, P. Esfandiar, D. F. Gleich, C. Greif, and L. V. Lakshmanan. Fast matrix computations for pairwise and columnwise commute times and Katz scores. Internet Mathematics, 8(1--2):73--112, 2012.
    [7]
    R. Burt. Structural Holes: The Social Structure of Competition. Harvard University Press, 1995.
    [8]
    M. Coscia, G. Rossetti, F. Giannotti, and D. Pedreschi. Demon: a local-first discovery method for overlapping communities. In KDD, 2012.
    [9]
    I. S. Dhillon, Y. Guan, and B. Kulis. Weighted graph cuts without eigenvectors: A multilevel approach. PAMI, 29(11):1944--1957, 2007.
    [10]
    U. Gargi, W. Lu, V. Mirrokni, and S. Yoon. Large-scale community detection on YouTube for topic discovery and exploration. In ICWSM, 2011.
    [11]
    D. F. Gleich and C. Seshadhri. Vertex neighborhoods, low conductance cuts, and good seeds for local community methods. In KDD, pages 597--605, 2012.
    [12]
    G. Karypis and V. Kumar. Multilevel k-way partitioning scheme for irregular graphs. JPDC, 48:96--129, 1998.
    [13]
    R. Khandekar, G. Kortsarz, and V. Mirrokni. Advantage of overlapping clusters for minimizing conductance. In LATIN, pages 494--505, 2012.
    [14]
    R. I. Kondor and J. D. Lafferty. Diffusion kernels on graphs and other discrete input spaces. In ICML, 2002.
    [15]
    D. Lai, X. Wu, H. Lu, and C. Nardini. Learning overlapping communities in complex networks via non-negative matrix factorization. Int. J. Mod Phys C, 22(10):1173--1190, 2011.
    [16]
    J. Leskovec, K. J. Lang, A. Dasgupta, and M. W. Mahoney. Community structure in large networks: Natural cluster sizes and the absence of large well-defined clusters. Internet Mathematics, 6(1):29--123, 2009.
    [17]
    M. W. Mahoney, L. Orecchia, and N. K. Vishnoi. A local spectral method for graphs: With applications to improving graph partitions and exploring data graphs locally. JMLR, 13:2339--2365, 2012.
    [18]
    N. Mishra, R. Schreiber, I. Stanton, and R. E. Tarjan. Clustering social networks. In WAW, 2007.
    [19]
    A. Mislove, H. S. Koppula, K. P. Gummadi, P. Druschel, and B. Bhattacharjee. Growth of the Flickr social network. In The First Workshop on Online Social Networks, 2008.
    [20]
    G. Palla, I. Derényi, I. Farkas, and T. Vicsek. Uncovering the overlapping community structure of complex networks in nature and society. Nature, 435:814--818, 2005.
    [21]
    J.-Y. Pan, H.-J. Yang, C. Faloutsos, and P. Duygulu. Automatic multimedia cross-modal correlation discovery. In KDD, pages 653--658, 2004.
    [22]
    B. S. Rees and K. B. Gallagher. Overlapping community detection by collective friendship group inference. In ASONAM, pages 375--379, 2010.
    [23]
    H. Shen, X. Cheng, K. Cai, and M.-B. Hu. Detect overlapping and hierarchical community structure in networks. Phys. A, 388(8):1706--1712, 2009.
    [24]
    H. H. Song, B. Savas, T. W. Cho, V. Dave, I. Dhillon, Y. Zhang, and L. Qiu. Clustered embedding of massive social networks. In SIGMETRICS, 2012.
    [25]
    J. J. Whang, X. Sui, and I. S. Dhillon. Scalable and memory-efficient clustering of large-scale social networks. In ICDM, pages 705--714, 2012.
    [26]
    J. Xie, S. Kelley, and B. K. Szymanski. Overlapping community detection in networks: the state of the art and comparative study. ACM Computing Surveys, 2013.
    [27]
    J. Yang and J. Leskovec. Overlapping community detection at scale: a nonnegative matrix factorization approach. In WSDM, pages 587--596, 2013.
    [28]
    S. Zhang, R.-S. Wang, and X.-S. Zhang. Identification of overlapping community structure in complex networks using fuzzy c-means clustering. Phys. A, 374(1):483 -- 490, 2007.

    Cited By

    View all
    • (2023)Boosting Multitask Learning on Graphs through Higher-Order Task AffinitiesProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599265(1213-1222)Online publication date: 6-Aug-2023
    • (2023)Towards hypergraph cognitive networks as feature-rich models of knowledgeEPJ Data Science10.1140/epjds/s13688-023-00409-212:1Online publication date: 16-Aug-2023
    • (2023)Overlapping community detection with adaptive density peaks clustering and iterative partition strategyExpert Systems with Applications10.1016/j.eswa.2022.119213213(119213)Online publication date: Mar-2023
    • Show More Cited By

    Index Terms

    1. Overlapping community detection using seed set expansion

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CIKM '13: Proceedings of the 22nd ACM international conference on Information & Knowledge Management
      October 2013
      2612 pages
      ISBN:9781450322638
      DOI:10.1145/2505515
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 27 October 2013

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. clustering
      2. community detection
      3. overlapping clusters
      4. seed set expansion
      5. seeds

      Qualifiers

      • Research-article

      Conference

      CIKM'13
      Sponsor:
      CIKM'13: 22nd ACM International Conference on Information and Knowledge Management
      October 27 - November 1, 2013
      California, San Francisco, USA

      Acceptance Rates

      CIKM '13 Paper Acceptance Rate 143 of 848 submissions, 17%;
      Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

      Upcoming Conference

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)79
      • Downloads (Last 6 weeks)7

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Boosting Multitask Learning on Graphs through Higher-Order Task AffinitiesProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599265(1213-1222)Online publication date: 6-Aug-2023
      • (2023)Towards hypergraph cognitive networks as feature-rich models of knowledgeEPJ Data Science10.1140/epjds/s13688-023-00409-212:1Online publication date: 16-Aug-2023
      • (2023)Overlapping community detection with adaptive density peaks clustering and iterative partition strategyExpert Systems with Applications10.1016/j.eswa.2022.119213213(119213)Online publication date: Mar-2023
      • (2023)A novel overlapping community detection strategy based on Core-Bridge seedsInternational Journal of Machine Learning and Cybernetics10.1007/s13042-023-02020-315:6(2131-2147)Online publication date: 4-Dec-2023
      • (2023)Local community detection based on influence maximization in dynamic networksApplied Intelligence10.1007/s10489-022-04403-553:15(18294-18318)Online publication date: 27-Jan-2023
      • (2023)The Parameterized Complexity of s-Club with Triangle and Seed ConstraintsTheory of Computing Systems10.1007/s00224-023-10135-x67:5(1050-1081)Online publication date: 12-Aug-2023
      • (2023)AutoGF: Runtime Graph Filter Tuning for Community Node RankingComplex Networks and Their Applications XI10.1007/978-3-031-21131-7_15(189-202)Online publication date: 26-Jan-2023
      • (2022)Know thy tools! Limits of popular algorithms used for topic reconstructionQuantitative Science Studies10.1162/qss_a_002173:4(1054-1078)Online publication date: 20-Dec-2022
      • (2022)Uncovering Local Hierarchical Overlapping Communities at ScaleIEEE Transactions on Big Data10.1109/TBDATA.2019.29404508:2(432-445)Online publication date: 1-Apr-2022
      • (2022)A Community-Driven Deep Collaborative Approach for Recommender SystemsIEEE Access10.1109/ACCESS.2022.323032310(131144-131152)Online publication date: 2022
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media