Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Put Three and Three Together: Triangle-Driven Community Detection

Published: 29 January 2016 Publication History

Abstract

Community detection has arisen as one of the most relevant topics in the field of graph data mining due to its applications in many fields such as biology, social networks, or network traffic analysis. Although the existing metrics used to quantify the quality of a community work well in general, under some circumstances, they fail at correctly capturing such notion. The main reason is that these metrics consider the internal community edges as a set, but ignore how these actually connect the vertices of the community. We propose the Weighted Community Clustering (WCC), which is a new community metric that takes the triangle instead of the edge as the minimal structural motif indicating the presence of a strong relation in a graph. We theoretically analyse WCC in depth and formally prove, by means of a set of properties, that the maximization of WCC guarantees communities with cohesion and structure. In addition, we propose Scalable Community Detection (SCD), a community detection algorithm based on WCC, which is designed to be fast and scalable on SMP machines, showing experimentally that WCC correctly captures the concept of community in social networks using real datasets. Finally, using ground-truth data, we show that SCD provides better quality than the best disjoint community detection algorithms of the state of the art while performing faster.

References

[1]
Y. Y. Ahn, J. P. Bagrow, and S. Lehmann. 2010. Link communities reveal multiscale complexity in networks. Nature 466, 7307 (2010), 761--764.
[2]
J. P Bagrow. 2012. Communities and bottlenecks: Trees and treelike networks have high modularity. Phy. Rev. E 85, 6 (2012), 066118.
[3]
V. D. Blondel, J. L. Guillaume, R. Lambiotte, and E. Lefebvre. 2008. Fast unfolding of communities in large networks. JSTAT 2008, 10 (2008), 10008.
[4]
A. Clauset, M. E. J. Newman, and C. Moore. 2004. Finding community structure in very large networks. Phy. Rev. E 70, 6 (2004), 066111.
[5]
Dblp. 2012. Retrieved September 2012 from http://dblp.uni-trier.de/xml/.
[6]
Aurelien Decelle, Florent Krzakala, Cristopher Moore, and Lenka Zdeborová. 2011. Inference and phase transitions in the detection of modules in sparse networks. Phys. Rev. Lett. 107, 6 (2011), 065701.
[7]
E. Di Giacomo, W. Didimo, L. Grilli, and G. Liotta. 2007. Graph visualization techniques for web clustering engines. IEEE Trans. Vis. Comput. Graph. 31, 2 (2007), 294--304.
[8]
G. W. Flake, S. Lawrence, and C. L. Giles. 2000. Efficient identification of web communities. In SIGKDD. ACM, 150--160.
[9]
S. Fortunato. 2010. Community detection in graphs. Phys. Rep. 486, 3--5 (2010), 75--174.
[10]
S. Fortunato and M. Barthélemy. 2007. Resolution limit in community detection. PNAS 104, 1 (2007), 36--41.
[11]
M. Girvan and M. E. J. Newman. 2002. Community structure in social and biological networks. PNAS 99, 12 (2002), 7821--7826.
[12]
B. H. Good, Y. A. de Montjoye, and A. Clauset. 2010. Performance of modularity maximization in practical contexts. Phy. Rev. E 81, 4 (2010), 046106.
[13]
Joan Guisado-Gámez, David Dominguez-Sal, and Josep-Lluis Larriba-Pey. 2014. Massive query expansion by exploiting graph knowledge bases for image retrieval. In International Conference on Multimedia Retrieval, ICMR '14, Glasgow, UK. (April 01--04, 2014). 33.
[14]
R. Kannan, S. Vempala, and A. Vetta. 2004. On clusterings: good, bad and spectral. JACM 51, 3 (2004), 497--515.
[15]
A. Lancichinetti. 2009. Community detection algorithms: a comparative analysis. Phy. Rev. E 80, 5 (2009), 056117.
[16]
A. Lancichinetti, F. Radicchi, J. J. Ramasco, and S. Fortunato. 2011. Finding statistically significant communities in networks. PloS one 6, 4 (2011), e18961.
[17]
J. Leskovec, L. Backstrom, R. Kumar, and A. Tomkins. 2008. Microscopic evolution of social networks. In SIGKDD. ACM, 462--470.
[18]
J. Leskovec, K. J. Lang, and M. Mahoney. 2010. Empirical comparison of algorithms for network community detection. In WWW. ACM, 631--640.
[19]
D. Liben-Nowell and J. Kleinberg. 2007. The link-prediction problem for social networks. ASIST 58, 7 (2007), 1019--1031.
[20]
M. McPherson, L. Smith-Lovin, and J. M. Cook. 2001. Birds of a feather: homophily in social networks. Annu. Rev. Sociol. 1, 1 (2001), 415--444.
[21]
A. Medus, G. Acuna, and C. O. Dorso. 2005. Detection of community structures in networks via global optimization. Phys. A 358, 2--4 (2005), 593--604.
[22]
M. E. J. Newman. 2001. The structure of scientific collaboration networks. PNAS 98, 2 (2001), 404--409.
[23]
M. E. J. Newman and M. Girvan. 2004. Finding and evaluating community structure in networks. Phy. Rev. E 69, 2 (2004), 026113.
[24]
M. E. J. Newman and J. Park. 2003. Why social networks are different from other types of networks. Phy. Rev. E 68, 3 (2003), 036122.
[25]
A. Padrol-Sureda, G. Perarnau-Llobet, J. Pfeifle, and V. Muntés-Mulero. 2010. Overlapping community search for social networks. In ICDE. IEEE, 992--995.
[26]
G. Palla, I. Derényi, I. Farkas, and T. Vicsek. 2005. Uncovering the overlapping community structure of complex networks in nature and society. Nature 435, 7043 (2005), 814--818.
[27]
A. Prat-Pérez, D. Dominguez-Sal, and J. L. Larriba-Pey. 2011. Social based layouts for the increase of locality in graph operations. In DASFAA. Springer Berlin Heidelberg, 558--569.
[28]
A. Prat-Pérez, D. Dominguez-Sal, and J. L. Larriba-Pey. 2014. High quality, scalable and parallel community detection for large real graphs. In WWW. ACM, 225--236.
[29]
Filippo Radicchi. 2014. A paradox in community detection. Europhys. Lett. 106, 3 (2014), 38001.
[30]
F. Radicchi, C. Castellano, F. Cecconi, V. Loreto, and D. Parisi. 2004. Defining and identifying communities in networks. PNAS 101, 9 (2004), 2658--2663.
[31]
U. N. Raghavan, R. Albert, and S. Kumara. 2007. Near linear time algorithm to detect community structures in large-scale networks. Phys. Rev. E 76, 3 (2007), 036106.
[32]
M. Rosvall and C. T. Bergstrom. 2008. Maps of random walks on complex networks reveal community structure. PNAS 105, 4 (2008), 1118--1123.
[33]
V. Satuluri, S. Parthasarathy, and Y. Ruan. 2011. Local graph sparsification for scalable clustering. In SIGMOD. ACM, 721--732.
[34]
X. Shi, L. A. Adamic, and M. J. Strauss. 2007. Networks of strong ties. Phys. A, Stat. Mechanics Appl. 378, 1 (2007), 33--47.
[35]
M. Sozio and A. Gionis. 2010. The community-search problem and how to plan a successful cocktail party. In KDD. ACM, 939--948.
[36]
T. Wang, B. Yang, J. Gao, D. Yang, S. Tang, H. Wu, K. Liu, and J. Pei. 2009. Mobileminer: a real world case study of data mining in mobile communication. In SIGMOD. ACM, 1083--1086.
[37]
J. Yang and J. Leskovec. 2012. Defining and evaluating network communities based on ground-truth. In ICDM. IEEE, 745--754.
[38]
J. Yang and J. Leskovec. 2013. Overlapping community detection at scale: a nonnegative matrix factorization approach. In WSDM. ACM, 587--596.
[39]
Jaewon Yang and Jure Leskovec. 2014. Overlapping communities explain core-periphery organization of networks. Proc. IEEE 102, 12 (2014), 1892--1902.

Cited By

View all
  • (2025)Double negative sampled graph adversarial representation learning with motif-based structural attention networkNeurocomputing10.1016/j.neucom.2024.129030619(129030)Online publication date: Mar-2025
  • (2025)LeaDCDInformation Sciences: an International Journal10.1016/j.ins.2024.121341686:COnline publication date: 1-Jan-2025
  • (2025)HAGCN: A hybrid-order brain network-based graph convolution learning framework with multi-head attention for brain disorder classificationBiomedical Signal Processing and Control10.1016/j.bspc.2024.106944100(106944)Online publication date: Mar-2025
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Knowledge Discovery from Data
ACM Transactions on Knowledge Discovery from Data  Volume 10, Issue 3
February 2016
358 pages
ISSN:1556-4681
EISSN:1556-472X
DOI:10.1145/2888412
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 January 2016
Accepted: 01 May 2015
Revised: 01 May 2015
Received: 01 April 2014
Published in TKDD Volume 10, Issue 3

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Community detection
  2. parallel algorithm
  3. scalable algorithm
  4. social networks
  5. triangles

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)22
  • Downloads (Last 6 weeks)2
Reflects downloads up to 27 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Double negative sampled graph adversarial representation learning with motif-based structural attention networkNeurocomputing10.1016/j.neucom.2024.129030619(129030)Online publication date: Mar-2025
  • (2025)LeaDCDInformation Sciences: an International Journal10.1016/j.ins.2024.121341686:COnline publication date: 1-Jan-2025
  • (2025)HAGCN: A hybrid-order brain network-based graph convolution learning framework with multi-head attention for brain disorder classificationBiomedical Signal Processing and Control10.1016/j.bspc.2024.106944100(106944)Online publication date: Mar-2025
  • (2024)Motif-Based Contrastive Learning for Community DetectionIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2024.336787335:9(11706-11719)Online publication date: Sep-2024
  • (2024)Alleviating the resolution limit problem in spatial community detection: a local network structure-based methodInternational Journal of Geographical Information Science10.1080/13658816.2024.2421778(1-23)Online publication date: 30-Oct-2024
  • (2024)Motif-based community detection in heterogeneous multilayer networksScientific Reports10.1038/s41598-024-59120-514:1Online publication date: 16-Apr-2024
  • (2024)A novel three-phase expansion algorithm for uncovering communities in social networks using local influence and similarity in embedding spaceDecision Analytics Journal10.1016/j.dajour.2024.10047211(100472)Online publication date: Jun-2024
  • (2024)An Improved Label Propagation Algorithm Based on Motif and Critical Node for Community DetectionAdvanced Intelligent Computing Technology and Applications10.1007/978-981-97-5678-0_11(121-133)Online publication date: 1-Aug-2024
  • (2023)Parallel Overlapping Community Detection Algorithm on GPUIEEE Transactions on Big Data10.1109/TBDATA.2022.31803609:2(677-687)Online publication date: 1-Apr-2023
  • (2023)SMOG: Accelerating Subgraph Matching on GPUs2023 IEEE High Performance Extreme Computing Conference (HPEC)10.1109/HPEC58863.2023.10363569(1-7)Online publication date: 25-Sep-2023
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media