Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3106426.3106493acmconferencesArticle/Chapter ViewAbstractPublication PageswiConference Proceedingsconference-collections
research-article
Public Access

Assessing the suitability of network community detection to available meta-data using rank stability

Published: 23 August 2017 Publication History

Abstract

In the last two decades, we have witnessed the widespread use of structural analysis of data. The area, generally called Network Science, concentrates on understanding complex phenomena by looking for properties that emerge from the relationships between the pieces of data instead of the traditional mining of the data itself. A commonly used structural analysis in networks consists of finding subgraphs whose density of connections within the subgraph surpasses that of outside connections; called Community Detection. Many techniques have been proposed to find communities as well as benchmarks to evaluate the algorithms ability to find these substructures. Until recently, the literature has mostly neglected the fact that these communities often represent common characteristic of the elements in the community. For instance, in a social network, communities could represent: people who follow the same particular sport, people from the same classroom, authors working in the same field of study, to name a few. The problem here is one of community detection selection as a function of the ground truth provided by available meta-data. In this work, we propose the use of rank stability (entropy of ranks) to assess communities identified using different techniques from the perspective of meta-data. We validate our approach using a large-scale data set of on-line social interactions across multiple community detection techniques.

References

[1]
Anne Condon and Richard M. Karp. 1999. Algorithms for graph partitioning on the planted partition model. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 1671 (1999), 221--232.
[2]
Leon Danon, Albert Díaz-Guilera, Jordi Duch, and Alex Arenas. 2005. Comparing community structure identification. Journal of Statistical Mechanics: Theory and Experiment 2005, 09 (9 2005), P09008--P09008.
[3]
Nicolas Dugue and Anthony Perez. 2015. Directed Louvain : maximizing modularity in directed networks. (2015), 0--14.
[4]
Santo Fortunato. 2010. Community detection in graphs. Physics Reports 486, 3--5 (2 2010), 75--174.
[5]
Santo Fortunato and Marc Barthélemy. 2007. Resolution limit in community detection. Pnas 104, 1 (2007), 36--41.
[6]
Santo Fortunato and Darko Hric. 2016. Community detection in networks: A user guide. Physics Reports 659 (2016), 1--44.
[7]
Gourab Ghoshal and Albert-László Barabási. 2011. Ranking stability and super-stable nodes in complex networks. Nature Communications 2 (2011), 394.
[8]
M Girvan and M E J Newman. 2002. Community structure in social and biological networks. Proceedings of the National Academy of Sciences of the United States of America 99, 12 (6 2002), 7821--6.
[9]
William L. Hamilton, Justine Zhang, Cristian Danescu-Niculescu-Mizil, Dan Jurafsky, and Jure Leskovec. 2017. Loyalty in Online Communities. (2017). http://arxiv.org/abs/1703.03386
[10]
Darko Hric, Richard K. Darst, and Santo Fortunato. 2014. Community detection in networks: Structural communities versus ground truth. Physical Review E - Statistical, Nonlinear, and Soft Matter Physics 90, 6 (2014), 1--19.
[11]
Brian Karrer and M. E J Newman. 2011. Stochastic blockmodels and community structure in networks. Physical Review E - Statistical, Nonlinear, and Soft Matter Physics 83, 1 (2011), 1--10.
[12]
Andrea Lancichinetti and Santo Fortunato. 2009. Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. Physical Review E 80, 1 (7 2009), 016118.
[13]
Andrea Lancichinetti, Santo Fortunato, and Filippo Radicchi. 2008. Benchmark graphs for testing community detection algorithms. Physical Review E - Statistical, Nonlinear, and Soft Matter Physics 78, 4 (2008), 1--5.
[14]
Andrea Lancichinetti, Santo Fortunato, and Filippo Radicchi. 2008. Benchmark graphs for testing community detection algorithms. Physical Review E 80, 4 (11 2008), 1--5.
[15]
Jure Leskovec, Lars Backstrom, Ravi Kumar, and Andrew Tomkins. 2008. Microscopic evolution of social networks. Kdd (2008), 462.
[16]
Jure Leskovec, Kevin J. Lang, Anirban Dasgupta, and Michael W. Mahoney. 2009. Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters. Internet Mathematics 6, 1 (1 2009), 29--123.
[17]
Jure Leskovec, Kevin J. Lang, and Michael Mahoney. 2010. Empirical comparison of algorithms for network community detection. Proceedings of the 19th international conference on World wide web - WWW '10 (2010), 631.
[18]
M. Newman and M. Girvan. 2004. Finding and evaluating community structure in networks. Physical Review E 69, 2 (2 2004), 026113.
[19]
Mark EJ Newman. 2002. Assortative mixing in networks. Physical review letters 89, 20 (2002), 208701.
[20]
M. E. J. Newman. 2003. The Structure and Function of Complex Networks. SIAM Rev. 45, 2 (1 2003), 167--256.
[21]
M. E. J. Newman and Aaron Clauset. 2016. Structure and inference in annotated networks. Nature Communications 7, May (6 2016), 11863.
[22]
Randal S. Olson and Zachary P. Neal. 2015. Navigating the massive world of reddit: using backbone networks to map user interests in social media. PeerJ Computer Science 1 (2015), e4.
[23]
Gergely Palla, Imre Derényi, Ills Farkas, and Tams Vicsek. 2005. Uncovering the overlapping community structure of complex networks in nature and society. Nature 435, 7043 (6 2005), 814--8.
[24]
Leto Peel, Daniel B. Larremore, and Aaron Clauset. 2016. The ground truth about metadata and community detection in networks. arXiv (2016), 1--24.
[25]
Tiago P. Peixoto. 2014. Hierarchical block structures and high-resolution model selection in large networks. Physical Review X 4, 1 (2014), 1--18.
[26]
M. Rosvall and C. T. Bergstrom. 2008. Maps of random walks on complex networks reveal community structure. Proceedings of the National Academy of Sciences 105, 4 (1 2008), 1118--1123.
[27]
Philipp Singer, Fabian Flöck, Clemens Meinhart, Elias Zeitfogel, and Markus Strohmaier. 2014. Evolution of Reddit: From the Front Page of the Internet to a Self-referential Community? Proceedings of the 23rd International Conference on World Wide Web - WWW '14 Companion (2 2014), 517--522.
[28]
Jierui Xie, Boleslaw K. Szymanski, and Xiaoming Liu. 2011. SLPA: Uncovering overlapping communities in social networks via a speaker-listener interaction dynamic process. Proceedings - IEEE International Conference on Data Mining, ICDM (2011), 344--349.
[29]
Jaewon Yang and Jure Leskovec. 2015. Defining and evaluating network communities based on ground-truth. Knowledge and Information Systems 42, 1 (2015), 181--213.
[30]
Zhao Yang, Ren Algesheimer, and Claudio J. Tessone. 2016. A Comparative Analysis of Community Detection Algorithms on Artificial Networks. Scientific Reports 6, July (8 2016), 30750.

Cited By

View all
  • (2020)Characterizing the Dynamics of Academic Affiliations: A Network Science ApproachComplex Networks XI10.1007/978-3-030-40943-2_33(393-404)Online publication date: 22-Feb-2020
  • (2020)Network-Based Delineation of Health Service Areas: A Comparative Analysis of Community Detection AlgorithmsComplex Networks XI10.1007/978-3-030-40943-2_30(359-370)Online publication date: 22-Feb-2020
  • (2019)Unveiling the Interplay between the TLR4/MD2 Complex and HSP70 in the Human Cardiovascular System: A Computational ApproachInternational Journal of Molecular Sciences10.3390/ijms2013312120:13(3121)Online publication date: 26-Jun-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
WI '17: Proceedings of the International Conference on Web Intelligence
August 2017
1284 pages
ISBN:9781450349512
DOI:10.1145/3106426
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 August 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. community detection
  2. meta-data
  3. rank stability

Qualifiers

  • Research-article

Funding Sources

Conference

WI '17
Sponsor:

Acceptance Rates

WI '17 Paper Acceptance Rate 118 of 178 submissions, 66%;
Overall Acceptance Rate 118 of 178 submissions, 66%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)61
  • Downloads (Last 6 weeks)7
Reflects downloads up to 23 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2020)Characterizing the Dynamics of Academic Affiliations: A Network Science ApproachComplex Networks XI10.1007/978-3-030-40943-2_33(393-404)Online publication date: 22-Feb-2020
  • (2020)Network-Based Delineation of Health Service Areas: A Comparative Analysis of Community Detection AlgorithmsComplex Networks XI10.1007/978-3-030-40943-2_30(359-370)Online publication date: 22-Feb-2020
  • (2019)Unveiling the Interplay between the TLR4/MD2 Complex and HSP70 in the Human Cardiovascular System: A Computational ApproachInternational Journal of Molecular Sciences10.3390/ijms2013312120:13(3121)Online publication date: 26-Jun-2019
  • (2019)Knowledge Graph Enhanced Community Detection and CharacterizationProceedings of the Twelfth ACM International Conference on Web Search and Data Mining10.1145/3289600.3291031(51-59)Online publication date: 30-Jan-2019
  • (2019)Community detection in civil society online networks: Theoretical guide and empirical assessmentSocial Networks10.1016/j.socnet.2019.07.00159(120-133)Online publication date: Oct-2019
  • (2019)A data-driven network approach for characterization of political parties’ ideology dynamicsApplied Network Science10.1007/s41109-019-0161-04:1Online publication date: 16-Jul-2019
  • (2018)Entropy in Network Community as an Indicator of Language Structure in Emoji Usage: A Twitter Study Across Various Thematic DatasetsComplex Networks and Their Applications VII10.1007/978-3-030-05411-3_27(328-337)Online publication date: 2-Dec-2018
  • (2017)Representing Emoji Usage Using Directed Networks: A Twitter Case StudyComplex Networks & Their Applications VI10.1007/978-3-319-72150-7_67(829-842)Online publication date: 27-Nov-2017

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media