Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Community detection for emerging social networks

Published: 01 November 2017 Publication History

Abstract

Many famous online social networks, e.g., Facebook and Twitter, have achieved great success in the last several years. Users in these online social networks can establish various connections via both social links and shared attribute information. Discovering groups of users who are strongly connected internally is defined as the community detection problem. Community detection problem is very important for online social networks and has extensive applications in various social services. Meanwhile, besides these popular social networks, a large number of new social networks offering specific services also spring up in recent years. Community detection can be even more important for new networks as high quality community detection results enable new networks to provide better services, which can help attract more users effectively. In this paper, we will study the community detection problem for new networks, which is formally defined as the "New Network Community Detection" problem. New network community detection problem is very challenging to solve for the reason that information in new networks can be too sparse to calculate effective similarity scores among users, which is crucial in community detection. However, we notice that, nowadays, users usually join multiple social networks simultaneously and those who are involved in a new network may have been using other well-developed social networks for a long time. With full considerations of network difference issues, we propose to propagate useful information from other well-established networks to the new network with efficient information propagation models to overcome the shortage of information problem. An effective and efficient method, Cat (Cold stArT community detector), is proposed in this paper to detect communities for new networks using information from multiple heterogeneous social networks simultaneously. Extensive experiments conducted on real-world heterogeneous online social networks demonstrate that Cat can address the new network community detection problem effectively.

References

[1]
Banfield, J., Raftery, A.: Model-based Gaussian and non-Gaussian clustering Biometrics (1993)
[2]
Belkin, M., Niyogi, P.: Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput. 15(6), 1373---1396 (2003)
[3]
Chakrabarti, D.: Autopart: Parameter-Free Graph Partitioning and Outlier Detection PKDD (2004)
[4]
Cimiano, P., Hotho, A., Staab, S.: Comparing Conceptual, Divise and Agglomerative Clustering for Learning Taxonomies from Text ECAI (2004)
[5]
Gács, P., Lovász, L.: Complexity of algorithms Lecture Notes (1999)
[6]
Hastie, T., Tibshirani, R., Friedman, J.: Hierarchical Clustering The Elements of Statistical Learning (2009)
[7]
Hopner, F., Hoppner, F., Klawonn, F.: Fuzzy Cluster Analysis: Methods for Classification, Data Analysis and Image Recognition (1999)
[8]
Huang, Z.: Extensions to the k-means algorithm for clustering large data sets with categorical values Data Mining Knowledge Discovery (1998)
[9]
Jin, S., Zhang, J., Yu, P., Yang, S., Li, A.: Synergistic partitioning in multiple large scale social networks IEEE BigData (2014)
[10]
Khorasgani, R.R., Chen, J., Zaïane. O.R.: Top Leaders Community Detection Approach in Information Networks 4Th SNA-KDD Workshop on Social Network Mining and Analysis, Washington DC. Citeseer (2010)
[11]
King, B.: Step-Wise Clustering procedures journal of the american statistical association (1967)
[12]
Kong, X., Zhang, J., Yu, P.: Inferring Anchor Links across Multiple Heterogeneous Social Networks CIKM (2013)
[13]
Kumpula, J.M., Kivelä, M., Kaski, K., Saramäki, J.: Sequential algorithm for fast clique percolation. Phys. Rev. E 78(2), 026109 (2008)
[14]
Kwak, H., Lee, C., Park, H., Moon, S.: What is twitter, a social network or a news media? WWW (2010)
[15]
Leskovec, J., Lang, K., Dasgupta, A., Mahoney, M.: Statistical Properties of Community Structure in Large Social and Information Networks WWW (2008)
[16]
Lin, C., Cho, Y., Hwang, W., Pei, P., Zhang, A.: Clustering Methods in a Protein-Protein Interaction Network (2007)
[17]
Lin, W., Kong, X., Yu, P., Wu, Q., Jia, Y., Li, C.: Community detection in incomplete information networks WWW (2012)
[18]
Liu, Y., Li, Z., Xiong, H., Gao, X., Wu, J.: Understanding of Internal Clustering Validation Measures ICDM (2010)
[19]
Luxburg, U.V.: A tutorial on spectral clustering. Stat. Comput. 17(4), 395---416 (2007)
[20]
Meila, M., Shi, J.: A Random Walks View of Spectral Segmentation AISTATS (2001)
[21]
Mishra, N., Schreiber, R., Stanton, I., Tarjan, R.: Clustering Social Networks WAW (2007)
[22]
Ng, A.Y., Jordan, M.I., Weiss, Y., et al.: On spectral clustering: Analysis and an algorithm. Adv. Neural Inf. Proces. Syst. 2, 849---856 (2002)
[23]
Noulas, A., Scellato, S., Mascolo, C., Pontil, M.: An Empirical Study of Geographic User Activity Patterns in Foursquare ICWSM (2011)
[24]
Palla, G., Derényi, I., Farkas, I., Vicsek, T.: Uncovering the overlapping community structure of complex networks in nature and society. Nature 435(7043), 814---818 (2005)
[25]
Pan, S., Yang, Q.: A survey on transfer learning TKDE (2010)
[26]
Panigrahy, R., Najork, M., Xie, Y.: How User Behavior is Related to Social Affinity WSDM (2012)
[27]
Petersen, P.: Linear algebra (2012)
[28]
Prat-Pérez, A., Dominguez-Sal, D., Brunat, J.M., Larriba-Pey, J.-L.: Shaping Communities out of Triangles Proceedings of the 21St ACM International Conference on Information and Knowledge Management, Pages 1677---1681. 1$~~~1$ (2012)
[29]
Prat-Pérez, A., Dominguez-Sal, D., Larriba-Pey, J.-L.: High Quality, Scalable and Parallel Community Detection for Large Real Graphs Proceedings of the 23Rd International Conference on World Wide Web, pp 225---236 (2014)
[30]
Raghavan, U.N., Albert, R., Kumara, S.: Near linear time algorithm to detect community structures in large-scale networks. Phys. Rev. E 76(3), 036106 (2007)
[31]
Richardson, M., Domingos, P.: Mining Knowledge-Sharing Sites for Viral Marketing KDD (2002)
[32]
Shi, J., Malik, J.: NorMalized cuts and image segmentation TPAMI (2000)
[33]
Shi, J., Malik, J.: NorMalized cuts and image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 22(8), 888---905 (2000)
[34]
Sneath, P., Sokal, R.: Numerical taxonomy the principles and practice of numerical classification (1973)
[35]
Sun, Y., Aggarwal, C., Han, J.: Relation strength-aware clustering of heterogeneous information networks with incomplete attributes VLDB (2012)
[36]
Tang, J., Gao, H., Hu, X., Liu, H.: Exploiting Homophily Effect for Trust Prediction WSDM (2013)
[37]
Tobias, J., Planqué, R., Cram, D., Seddon, N.: Species interactions and the structure of complex communication networks. PNAS (2014)
[38]
Trusov, M., Bodapati, A., Bucklin, R.: Determining influential users in internet social networks journal of marketing research (2010)
[39]
van Wijk, B., Stam, C., Daffertshofer, A.: Comparing brain networks of different size and connectivity density using graph theory PLos ONE (2010)
[40]
Wang, L., Lou, T., Tang, J., Hopcroft, J.: Detecting Community Kernels in Large Social Networks ICDM (2011)
[41]
Wang, M., Wang, C., Jeffrey, X.Y., Zhang, J.: Community detection in social networks: an in-depth benchmarking study with a procedure-oriented framework. Proc. VLDB Endowment 8(10), 998---1009 (2015)
[42]
Wang, S., Zhang, Z., Li, J.: A scalable cur matrix decomposition algorithm: Lower time complexity and tighter bound. CoRR (2012)
[43]
Ward, J.: Hierarchical grouping to optimize an objective function Journal of the American Statistical Association (1963)
[44]
Watts, D., Strogatz, S.: Collective dynamics of 'small-world' networks. Nature (1998)
[45]
Zhan, Q., Zhang, S., Wang, J., Yu, P., Xie, J.: Influence Maximization across Partially Aligned Heterogenous Social Networks PAKDD (2015)
[46]
Zhang, J., Kong, X., Yu, P.: Predicting social links for new users across aligned heterogeneous social networks ICDM (2013)
[47]
Zhang, J., Kong, X., Yu, P.: Transferring Heterogeneous Links across Location-Based Social Networks WSDM (2014)
[48]
Zhang, J., Shao, W., Wang, S., Kong, X., Yu, P.: Pna: Partial network alignment with generic stable matching IEEE IRI (2015)
[49]
Zhang, J., Yu, P.: Community Detection for Emerging Networks SDM (2015)
[50]
Zhang, J., Yu, P.: Integrated anchor and social link predictions across partially aligned social networks IJCAI (2015)
[51]
Zhang, J., Yu, P.: Mcd: Mutual Clustering across Multiple Heterogeneous Networks IEEE Bigdata Congress (2015)
[52]
Zhang, J., Yu, P., Zhou, Z.: Meta-Path Based Multi-Network Collective Link Prediction KDD (2014)
[53]
Zhou, Y., Cheng, H., Yu, J.: Graph clustering based on structural/attribute similarities VLDB (2009)
[54]
Zhou, Y., Liu, L.: Social Influence Based Clustering of Heterogeneous Information Networks KDD (2013)

Cited By

View all
  • (2023)TOAK: A Topology-oriented Attack Strategy for Degrading User Identity Linkage in Cross-network LearningProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615084(2208-2218)Online publication date: 21-Oct-2023
  • (2023)CANA: Causal-enhanced Social Network AlignmentProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614799(2219-2228)Online publication date: 21-Oct-2023
  • (2023)AsyLinkNeurocomputing10.1016/j.neucom.2022.10.027515:C(174-184)Online publication date: 1-Jan-2023
  • Show More Cited By

Index Terms

  1. Community detection for emerging social networks
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image World Wide Web
    World Wide Web  Volume 20, Issue 6
    November 2017
    387 pages

    Publisher

    Kluwer Academic Publishers

    United States

    Publication History

    Published: 01 November 2017

    Author Tags

    1. Cold start problem
    2. Community detection
    3. Data mining
    4. Transfer learning

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 01 Sep 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)TOAK: A Topology-oriented Attack Strategy for Degrading User Identity Linkage in Cross-network LearningProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615084(2208-2218)Online publication date: 21-Oct-2023
    • (2023)CANA: Causal-enhanced Social Network AlignmentProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614799(2219-2228)Online publication date: 21-Oct-2023
    • (2023)AsyLinkNeurocomputing10.1016/j.neucom.2022.10.027515:C(174-184)Online publication date: 1-Jan-2023
    • (2021)User Identification Based on Integrating Multiple User Information across Online Social NetworksSecurity and Communication Networks10.1155/2021/55334172021Online publication date: 1-Jan-2021
    • (2021)Locate Who You AreProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3482134(3413-3417)Online publication date: 26-Oct-2021
    • (2021)A fast local community detection algorithm in complex networksWorld Wide Web10.1007/s11280-021-00931-124:6(1929-1955)Online publication date: 1-Nov-2021

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media