Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

CC-GA: : A clustering coefficient based genetic algorithm for detecting communities in social networks

Published: 01 February 2018 Publication History

Highlights

Using clustering coefficient for initial population results in high modularity.
Proposed community structure based mutation allows fast convergence.
CC-GA produces competitive results to nine existing algorithms on various networks.

Abstract

A community structure is an integral part of a social network. Detecting such communities plays an important role in a wide range of applications, including but not limited to cluster analysis, recommendation systems and understanding the behaviour of complex systems. Researchers have derived many algorithms to discover the community structures of networks. Discovering communities is a challenging task, and there is no single algorithm that produces the best results for all networks. Therefore, despite many elegant solutions, discovering communities remains an active area of research. In this paper, we propose a novel algorithm, the Clustering Coefficient-based Genetic Algorithm (CC-GA), for detecting them in social and complex networks. Researchers have used several genetic algorithms to detect communities, but the proposed algorithm is novel in terms of both the generation of the initial population and the mutation method, and these improve its efficiency and accuracy. Experiments on a variety of real-world datasets and a comparison to state-of-the-art genetic and non-genetic-based algorithms show improved results.

References

[1]
A.-L. Barabasi, Z.N. Oltvai, Network biology: understanding the cell's functional organization, Nat. Rev. Genet. 5 (2) (2004) 101–113.
[2]
M. Girvan, M.E. Newman, Community structure in social and biological networks, Proc. Natl. Acad. Sci. U. S. A. 99 (12) (2002) 7821–7826.
[3]
G. Palla, A.-L. Barabási, T. Vicsek, Quantifying social group evolution, Nature 446 (7136) (2007) 664–667.
[4]
J.-P. Onnela, J. Saramäki, J. Hyvönen, G. Szabó, D. Lazer, K. Kaski, J. Kertész, A.-L. Barabási, Structure and tie strengths in mobile communication networks, Proc. Natl. Acad. Sci. U. S. A. 104 (18) (2007) 7332–7336.
[5]
A.-L. Barabási, N. Gulbahce, J. Loscalzo, Network medicine: a network-based approach to human disease, Nat. Rev. Genet. 12 (1) (2011) 56–68.
[6]
D.J. Watts, S.H. Strogatz, Collective dynamics of ‘small-world’ networks, Nature 393 (6684) (1998) 440.
[7]
R. Guimera, S. Mossa, A. Turtschi, L.A. Amaral, The worldwide air transportation network: anomalous centrality, community structure, and cities’ global roles, Proc. Natl. Acad. Sci. U. S. A. 102 (22) (2005) 7794–7799.
[8]
A.-L. Barabási, R. Albert, Emergence of scaling in random networks, Science 286 (5439) (1999) 509–512.
[9]
M.E. Newman, M. Girvan, Finding and evaluating community structure in networks, Phys. Rev. E 69 (2) (2004) 026113.
[10]
A. Daud, N.R. Aljohani, R.A. Abbasi, Z. Rafique, T. Amjad, H. Dawood, K.H. Alyoubi, Finding rising stars in co-author networks via weighted mutual influence, Proceedings of the 26th International Conference on World Wide Web Companion, WWW ’17 Companion, International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland (2017) 33–41,.
[11]
J. Kanwal, O. Maqbool, R. Abbasi, A.Q. Abbasi, Network analysis of software change history for understanding software evolution, 17th IEEE International Multi Topic Conference 2014 (2014) 229–234,.
[12]
M. Marra, A. Emrouznejad, W. Ho, J.S. Edwards, The value of indirect ties in citation networks: SNA analysis with OWA operator weights, Inf. Sci. 314 (2015) 135–151.
[13]
R.A. Abbasi, O. Maqbool, M. Mushtaq, N.R. Aljohani, A. Daud, J.S. Alowibdi, B. Shahzad, Saving lives using social media: analysis of the role of twitter for personal blood donation requests and dissemination, Telemat. Inform. (2017),. http://www.sciencedirect.com/science/article/pii/S0736585316303835.
[14]
M.A. Jarwar, R.A. Abbasi, M. Mushtaq, O. Maqbool, N.R. Aljohani, A. Daud, J.S. Alowibdi, J. Cano, S. García, I. Chong, Communiments: a framework for detecting community based sentiments for events, Int. J. Seman. Web Inf. Syst. (IJSWIS) 13 (2) (2017) 87–108,.
[15]
L. Bai, X. Cheng, J. Liang, Y. Guo, Fast graph clustering with a new description model for community detection, Inf. Sci. 388–389 (2017) 37–47,. http://www.sciencedirect.com/science/article/pii/S0020025517301184.
[16]
M. Hajiabadi, H. Zare, H. Bobarshad, IEDC: an integrated approach for overlapping and non-overlapping community detection, Knowl.-Based Syst. 123 (2017) 188–199,. http://www.sciencedirect.com/science/article/pii/S0950705117300850.
[17]
A. Hollocou, J. Maudet, T. Bonald, M. Lelarge, A linear streaming algorithm for community detection in very large networks, ArXiv e-printsarXiv:1703.02955.
[18]
W. Liu, X. Jiang, M. Pellegrini, X. Wang, Discovering communities in complex networks by edge label propagation, Sci. Rep. 6 (2016) 22470,. http://europepmc.org/articles/PMC4772381.
[19]
F. Zou, D. Chen, S. Li, R. Lu, M. Lin, Community detection in complex networks: multi-objective discrete backtracking search optimization algorithm with decomposition, Appl. Soft Comput. 53 (2017) 285–295,. http://www.sciencedirect.com/science/article/pii/S1568494617300091.
[20]
A. Lancichinetti, S. Fortunato, Consensus clustering in complex networks, Sci. rep. (2012) 2.
[21]
L.M. Naeni, R. Berretta, P. Moscato, Ma-net: a reliable memetic algorithm for community detection by modularity optimization, in: Proceedings of the 18th Asia Pacific Symposium on Intelligent and Evolutionary Systems, Volume 1, Springer, 2015, pp. 311–323.
[22]
M.E. Newman, Modularity and community structure in networks, Proc. Natl. Acad. Sci. U. S. A. 103 (23) (2006) 8577–8582.
[23]
R. Shang, J. Bai, L. Jiao, C. Jin, Community detection based on modularity and an improved genetic algorithm, Physica A: Stat. Mech. Appl. 392 (5) (2013) 1215–1231.
[24]
M. Rosvall, C.T. Bergstrom, Maps of random walks on complex networks reveal community structure, Proc. Natl. Acad. Sci. U. S. A. 105 (4) (2008) 1118–1123.
[25]
W. Li, C. Huang, M. Wang, X. Chen, Stepping community detection algorithm based on label propagation and similarity, Physica A: Stat. Mech. Appl. 472 (2017) 145–155,. http://www.sciencedirect.com/science/article/pii/S0378437117300225.
[26]
N. Binesh, M. Rezghi, Fuzzy clustering in community detection based on nonnegative matrix factorization with two novel evaluation criteria, Appl. Soft Comput. (2017),. http://www.sciencedirect.com/science/article/pii/S1568494616306378.
[27]
X. Ma, D. Dong, Evolutionary nonnegative matrix factorization algorithms for community detection in dynamic networks, IEEE Trans. Knowl. Data Eng. PP (99) (2017) 1,.
[28]
X. Liu, W. Wang, D. He, P. Jiao, D. Jin, C.V. Cannistraci, Semi-supervised community detection based on non-negative matrix factorization with node popularity, Inf. Sci. 381 (2017) 304–321,. http://www.sciencedirect.com/science/article/pii/S0020025516318709.
[29]
X. Shi, H. Lu, G. Jia, Adaptive overlapping community detection with bayesian nonnegative matrix factorization, Database Systems for Advanced Applications: 22nd International Conference, DASFAA 2017, Suzhou, China, March 27–30, 2017, Proceedings, Part II, Springer International Publishing, Cham (2017) 339–353,.
[30]
J.H. Chin, K. Ratnavelu, A semi-synchronous label propagation algorithm with constraints for community detection in complex networks, Sci. Rep. (2017) 7.
[31]
S. Fortunato, Community detection in graphs, Phys. Rep. 486 (3) (2010) 75–174.
[32]
Z. Yang, R. Algesheimer, C.J. Tessone, A comparative analysis of community detection algorithms on artificial networks, Sci. Rep. (2016) 6.
[33]
T.N. Bui, C. Jones, Finding good approximate vertex and edge partitions is np-hard, Inf. Process. Lett. 42 (3) (1992) 153–159,. http://www.sciencedirect.com/science/article/pii/002001909290140Q.
[34]
H. Banati, N. Arora, Modeling evolutionary group search optimization approach for community detection in social networks, in: Proceedings of the Third International Symposium on Women in Computing and Informatics, ACM, 2015, pp. 109–117.
[35]
C. Pizzuti, A multiobjective genetic algorithm to find communities in complex networks, IEEE Trans. Evol. Comput. 16 (3) (2012) 418–430.
[36]
M. Tasgin, A. Herdagdelen, H. Bingol, Community detection in complex networks using genetic algorithms, arXiv preprint arXiv:0711.0491.
[37]
Y. Park, M. Song, A genetic algorithm for clustering problems, Proceedings of the Third Annual Conference on Genetic Programming (1998) 568–575.
[38]
F. Zou, D. Chen, S. Li, R. Lu, M. Lin, Community detection in complex networks: multi-objective discrete backtracking search optimization algorithm with decomposition, Appl. Soft. Comput. (2017).
[39]
K. Deng, X. Liu, W. Li, Community detection in complex networks based on improved genetic algorithm and local optimization, Int. J. Grid Distrib. Comput. 9 (10) (2016) 357–374.
[40]
P. Wu, L. Pan, Multi-objective community detection based on memetic algorithm, PLOS ONE 10 (5) (2015) e0126845.
[41]
R. Francisquini, V. Rosset, M.C. Nascimento, Ga-lp: a genetic algorithm based on label propagation to detect communities in directed networks, Expert Syst. Appl. 74 (2017) 127–138,. http://www.sciencedirect.com/science/article/pii/S0957417416306911.
[42]
Z. Ding, X. Zhang, D. Sun, B. Luo, Overlapping community detection based on network decomposition, Sci. Rep. (2016) 6.
[43]
A.I. Hafez, N.I. Ghali, A.E. Hassanien, A.A. Fahmy, Genetic algorithms for community detection in social networks, in: 2012 12th International Conference on Intelligent Systems Design and Applications (ISDA), IEEE, 2012, pp. 460–465.
[44]
P. Mazur, K. Zmarzlowski, A. Orlowski, Genetic algorithms approach to community detection, Acta Phys. Pol.-Ser. A Gen. Phys. 117 (4) (2010) 703.
[45]
X. Wang, C.-D. Wang, J.-H. Lai, Modularity optimization by global-local search, in: 2016 International Joint Conference on Neural Networks (IJCNN), IEEE, 2016, pp. 840–846.
[46]
M.A. Tabarzad, A. Hamzeh, A heuristic local community detection method (HLCD), Appl. Intell. 46 (1) (2017) 62–78,.
[47]
J.H. Holland, Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control and Artificial Intelligence, MIT Press, Cambridge, MA, USA, 1992.
[48]
J. Handl, J. Knowles, An evolutionary approach to multiobjective clustering, IEEE Trans. Evol. Comput. 11 (1) (2007) 56–76.
[49]
L. Bai, X. Cheng, J. Liang, Y. Guo, Fast graph clustering with a new description model for community detection, Inf. Sci. (2017).
[50]
X. Liu, W. Wang, D. He, P. Jiao, D. Jin, C.V. Cannistraci, Semi-supervised community detection based on non-negative matrix factorization with node popularity, Inf. Sci. 381 (2017) 304–321.
[51]
E. Khaji, A.S. Mohammadi, A heuristic method to generate better initial population for evolutionary methods, arXiv preprint arXiv:1406.4518.
[52]
M. Latapy, Main-memory triangle computations for very large (sparse (power-law)) graphs, Theor. Comput. Sci. 407 (1) (2008) 458–473.
[53]
M.E. Newman, Fast algorithm for detecting community structure in networks, Phys. Rev. E 69 (6) (2004) 066133.
[54]
M.E. Newman, A measure of betweenness centrality based on random walks, Soc. Netw. 27 (1) (2005) 39–54.
[55]
C. Pizzuti, Community detection in social networks with genetic algorithms, in: Proceedings of the 10th Annual Conference on Genetic and Evolutionary Computation, ACM, 2008, pp. 1137–1138.
[56]
Y. Li, G. Liu, S.-Y. Lao, A genetic algorithm for community detection in complex networks, J. Cent. South Univ. 20 (2013) 1269–1276.
[57]
W.W. Zachary, An information flow model for conflict and fission in small groups, J. Anthropol. Res. (1977) 452–473.
[58]
D. Lusseau, K. Schneider, O.J. Boisseau, P. Haase, E. Slooten, S.M. Dawson, The bottlenose dolphin community of doubtful sound features a large proportion of long-lasting associations, Behav. Ecol. Sociobiol. 54 (4) (2003) 396–405.
[59]
P.M. Gleiser, L. Danon, Community structure in jazz, Adv. Complex Syst. 06 (04) (2003) 565–573,.
[60]
S.S. Shen-Orr, R. Milo, S. Mangan, U. Alon, Network motifs in the transcriptional regulation network of Escherichia coli, Nat. Genet. 31 (1) (2002) 64–68.
[61]
M.E. Newman, Finding community structure in networks using the eigenvectors of matrices, Phys. Rev. E 74 (3) (2006) 036104.
[62]
J.J. McAuley, J. Leskovec, Learning to discover social circles in ego networks., NIPS, Vol. 2012 (2012) 548–556.
[63]
L. Salwinski, C.S. Miller, A.J. Smith, F.K. Pettit, J.U. Bowie, D. Eisenberg, The database of interacting proteins: 2004 update, Nucl. Acids Res. 32 (Suppl. 1) (2004) D449–D451.
[64]
J. Leskovec, J. Kleinberg, C. Faloutsos, Graph evolution: densification and shrinking diameters, ACM Trans. Knowl. Discov. Data 1 (1) (2007),.
[65]
M. Bogu ná, R. Pastor-Satorras, A. Díaz-Guilera, A. Arenas, Models of social networks based on social distance attachment, Phys. Rev. E 70 (5) (2004) 056122.
[66]
A. Gog, D. Dumitrescu, B. Hirsbrunner, Community detection in complex networks using collaborative evolutionary algorithms, in: European Conference on Artificial Life, Springer, 2007, pp. 886–894.
[67]
S. Li, Y. Chen, H. Du, M.W. Feldman, A genetic algorithm with local search strategy for improved detection of community structure, Complexity 15 (4) (2010) 53–60.
[68]
U.N. Raghavan, R. Albert, S. Kumara, Near linear time algorithm to detect community structures in large-scale networks, Phys. Rev. E 76 (3) (2007) 036106.
[69]
W. Liu, M. Pellegrini, X. Wang, Detecting communities based on network topology, Sci. Rep. (2014) 4.
[70]
Y.-Y. Ahn, J.P. Bagrow, S. Lehmann, Link communities reveal multiscale complexity in networks, Nature 466 (7307) (2010) 761–764.
[71]
S. Gregory, Finding overlapping communities in networks by label propagation, New J. Phys. 12 (10) (2010) 103018.
[72]
M.E. Newman, The structure of scientific collaboration networks, Proc. Natl. Acad. Sci. U. S. A. 98 (2) (2001) 404–409.
[73]
A. Lancichinetti, S. Fortunato, F. Radicchi, Benchmark graphs for testing community detection algorithms, Phys. Rev. E 78 (4) (2008) 046110.
[74]
A. Lancichinetti, S. Fortunato, J. Kertész, Detecting the overlapping and hierarchical community structure in complex networks, New J. Phys. 11 (3) (2009) 033015.
[75]
J. Leskovec, J. Kleinberg, C. Faloutsos, Graph evolution: densification and shrinking diameters, ACM Trans. Knowl. Discov. Data (TKDD) 1 (1) (2007) 2.

Cited By

View all

Index Terms

  1. CC-GA: A clustering coefficient based genetic algorithm for detecting communities in social networks
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image Applied Soft Computing
      Applied Soft Computing  Volume 63, Issue C
      Feb 2018
      306 pages

      Publisher

      Elsevier Science Publishers B. V.

      Netherlands

      Publication History

      Published: 01 February 2018

      Author Tags

      1. Community detection
      2. Graph clustering
      3. Genetic algorithm
      4. Artificial intelligence
      5. Social network analysis

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 11 Sep 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)IAFCO: an intelligent agent-based framework for combinatorial optimizationThe Journal of Supercomputing10.1007/s11227-023-05852-680:8(10863-10930)Online publication date: 1-May-2024
      • (2023)Detecting Communities in Complex Networks Using an Adaptive Genetic Algorithm and Node Similarity-Based EncodingComplexity10.1155/2023/47965362023Online publication date: 1-Jan-2023
      • (2023)Dynamic community detection including node attributesExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.119791223:COnline publication date: 1-Aug-2023
      • (2023)Random search immune algorithm for community detectionSoft Computing - A Fusion of Foundations, Methodologies and Applications10.1007/s00500-023-07999-z27:12(8061-8090)Online publication date: 3-Apr-2023
      • (2023)Leiden Fitness-Based Genetic Algorithm with Niching for Community Detection in Large Social NetworksPRICAI 2023: Trends in Artificial Intelligence10.1007/978-981-99-7022-3_39(423-435)Online publication date: 15-Nov-2023
      • (2022)A Community Detection Algorithm Based on Correlation Analysis of Connection PatternProceedings of the 2022 14th International Conference on Machine Learning and Computing10.1145/3529836.3529949(347-357)Online publication date: 18-Feb-2022
      • (2022)Fuzzy Community Detection Based on Elite Symbiotic Organisms Search and Node Neighborhood InformationIEEE Transactions on Fuzzy Systems10.1109/TFUZZ.2021.308723530:7(2500-2514)Online publication date: 1-Jul-2022
      • (2022)On the Co-authorship network analysis in the Process Mining research CommunityExpert Systems with Applications: An International Journal10.1016/j.eswa.2022.117853206:COnline publication date: 15-Nov-2022
      • (2022)NodeRank: Finding influential nodes in social networks based on interestsThe Journal of Supercomputing10.1007/s11227-021-03947-678:2(2098-2124)Online publication date: 1-Feb-2022
      • (2022)An improved cuckoo search optimization algorithm with genetic algorithm for community detection in complex networksMultimedia Tools and Applications10.1007/s11042-022-12409-x81:18(25205-25231)Online publication date: 1-Jul-2022
      • Show More Cited By

      View Options

      View options

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media