Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Effective community search for large attributed graphs

Published: 01 August 2016 Publication History

Abstract

Given a graph G and a vertex qG, the community search query returns a subgraph of G that contains vertices related to q. Communities, which are prevalent in attributed graphs such as social networks and knowledge bases, can be used in emerging applications such as product advertisement and setting up of social events. In this paper, we investigate the attributed community query (or ACQ), which returns an attributed community (AC) for an attributed graph. The AC is a subgraph of G, which satisfies both structure cohesiveness (i.e., its vertices are tightly connected) and keyword cohesiveness (i.e., its vertices share common keywords). The AC enables a better understanding of how and why a community is formed (e.g., members of an AC have a common interest in music, because they all have the same keyword "music"). An AC can be "personalized"; for example, an ACQ user may specify that an AC returned should be related to some specific keywords like "research" and "sports".
To enable efficient AC search, we develop the CL-tree index structure and three algorithms based on it. We evaluate our solutions on four large graphs, namely Flickr, DBLP, Tencent, and DBpedia. Our results show that ACs are more effective and efficient than existing community retrieval approaches. Moreover, an AC contains more precise and personalized information than that of existing community search and detection methods.

References

[1]
https://en.wikipedia.org/wiki/Disjoint-set_data_structure.
[2]
V. Batagelj and M. Zaversnik. An o(m) algorithm for cores decomposition of networks. arXiv, 2003.
[3]
G. Bhalotia et al. Keyword searching and browsing in databases using banks. In ICDE, 2002.
[4]
W. Cui, Y. Xiao, H. Wang, Y. Lu, and W. Wang. Online search of overlapping communities. In SIGMOD, 2013.
[5]
W. Cui, Y. Xiao, H. Wang, and W. Wang. Local search of communities in large graphs. In SIGMOD, 2014.
[6]
B. Ding, J. X. Yu, S. Wang, L. Qin, X. Zhang, and X. Lin. Finding top-k min-cost connected trees in databases. In ICDE, 2007.
[7]
S. N. Dorogovtsev et al. K-core organization of complex networks. Physical review letters, 2006.
[8]
W. Fan, J. Li, S. Ma, N. Tang, Y. Wu, and Y. Wu. Graph pattern matching: from intractable to polynomial time. PVLDB, 2010.
[9]
W. Fan, X. Wang, Y. Wu, and J. Xu. Association rules with graph patterns. PVLDB, 8(12):1502--1513, 2015.
[10]
Y. Fang, H. Zhang, Y. Ye, and X. Li. Detecting hot topics from twitter: A multiview approach. Journal of Information Science, 40(5):578--593, 2014.
[11]
S. Fortunato. Community detection in graphs. Physics Reports, 486(3):75--174, 2010.
[12]
J. Han, M. Kamber, and J. Pei. Data mining: concepts and techniques. Elsevier, 2011.
[13]
J. Han, J. Pei, and Y. Yin. Mining frequent patterns without candidate generation. In SIGMOD, 2000.
[14]
H. He, H. Wang, J. Yang, and P. S. Yu. Blinks: ranked keyword searches on graphs. In SIGMOD, 2007.
[15]
X. Huang, H. Cheng, L. Qin, W. Tian, and J. X. Yu. Querying k-truss community in large and dynamic graphs. In SIGMOD, 2014.
[16]
V. Kacholia et al. Bidirectional expansion for keyword search on graph databases. In VLDB, 2005.
[17]
M. Kargar and A. An. Keyword search in graphs: Finding r-cliques. PVLDB, 4(10):681--692, 2011.
[18]
R.-H. Li, L. Qin, J. X. Yu, and R. Mao. Influential community search in large networks. In PVLDB, 2015.
[19]
R.-H. Li, J. X. Yu, and R. Mao. Efficient core maintenance in large dynamic graphs. TKDE, 2014.
[20]
Y. Liu, A. Niculescu-Mizil, and W. Gryc. Topic-link lda: joint models of topic and author community. In ICML, 2009.
[21]
R. M. Nallapati, A. Ahmed, E. P. Xing, and W. W. Cohen. Joint latent topic models for text and citations. In KDD, 2008.
[22]
M. Newman et al. Finding and evaluating community structure in networks. Physical review E, 2004.
[23]
Y. Ruan, D. Fuhry, and S. Parthasarathy. Efficient community detection in large networks using content and links. In WWW, 2013.
[24]
M. Sachan et al. Using content and interactions for discovering communities in social networks. In WWW, 2012.
[25]
S. B. Seidman. Network structure and minimum degree. Social networks, 5(3):269--287, 1983.
[26]
M. Sozio and A. Gionis. The community-search problem and how to plan a successful cocktail party. In KDD, 2010.
[27]
B. Thomee et al. The new data and new challenges in multimedia research. arXiv:1503.01817, 2015.
[28]
H. Tong, C. Faloutsos, B. Gallagher, and T. Eliassi-Rad. Fast best-effort pattern matching in large attributed graphs. In KDD, 2007.
[29]
Z. Xu, Y. Ke, Y. Wang, H. Cheng, and J. Cheng. A model-based approach to attributed graph clustering. In SIGMOD, 2012.
[30]
Y. Fang et al. Effective community search for large attributed graphs (technical report). http://www.cs.hku.hk/research/techreps/document/TR-2016-01.pdf.
[31]
J. Yang, J. McAuley, and J. Leskovec. Community detection in networks with node attributes. In ICDM, 2013.
[32]
T. Yang, R. Jin, Y. Chi, and S. Zhu. Combining link and content for community detection: a discriminative approach. In KDD, 2009.
[33]
J. X. Yu, L. Qin, and L. Chang. Keyword search in databases. Synthesis Lectures on Data Management, 2009.
[34]
Y. Zhou, H. Cheng, and J. X. Yu. Graph clustering based on structural/attribute similarities. VLDB, 2009.

Cited By

View all
  • (2024)Distributed Shortest Distance Labeling on Large-Scale GraphsProceedings of the VLDB Endowment10.14778/3675034.367505317:10(2641-2653)Online publication date: 1-Jun-2024
  • (2024)QTCS: Efficient Query-Centered Temporal Community SearchProceedings of the VLDB Endowment10.14778/3648160.364816317:6(1187-1199)Online publication date: 1-Feb-2024
  • (2024)Scalable Spatio-temporal Top-k Interaction Queries on Dynamic CommunitiesACM Transactions on Spatial Algorithms and Systems10.1145/364837410:1(1-25)Online publication date: 16-Feb-2024
  • Show More Cited By

Index Terms

  1. Effective community search for large attributed graphs
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Proceedings of the VLDB Endowment
    Proceedings of the VLDB Endowment  Volume 9, Issue 12
    August 2016
    345 pages
    ISSN:2150-8097
    Issue’s Table of Contents

    Publisher

    VLDB Endowment

    Publication History

    Published: 01 August 2016
    Published in PVLDB Volume 9, Issue 12

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)23
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 14 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Distributed Shortest Distance Labeling on Large-Scale GraphsProceedings of the VLDB Endowment10.14778/3675034.367505317:10(2641-2653)Online publication date: 1-Jun-2024
    • (2024)QTCS: Efficient Query-Centered Temporal Community SearchProceedings of the VLDB Endowment10.14778/3648160.364816317:6(1187-1199)Online publication date: 1-Feb-2024
    • (2024)Scalable Spatio-temporal Top-k Interaction Queries on Dynamic CommunitiesACM Transactions on Spatial Algorithms and Systems10.1145/364837410:1(1-25)Online publication date: 16-Feb-2024
    • (2024)GraphZeppelin: How to Find Connected Components (Even When Graphs Are Dense, Dynamic, and Massive)ACM Transactions on Database Systems10.1145/364384649:3(1-31)Online publication date: 16-May-2024
    • (2024)Efficient Distributed Hop-Constrained Path Enumeration on Large-Scale GraphsProceedings of the ACM on Management of Data10.1145/36392772:1(1-25)Online publication date: 26-Mar-2024
    • (2024)ProCom: A Few-shot Targeted Community Detection AlgorithmProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671749(3414-3424)Online publication date: 25-Aug-2024
    • (2024)Scalable Community Search over Large-scale Graphs based on Graph TransformerProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657771(1680-1690)Online publication date: 10-Jul-2024
    • (2024)Neural Attributed Community Search at Billion ScaleProceedings of the ACM on Management of Data10.1145/36267381:4(1-25)Online publication date: 26-Apr-2024
    • (2024)Privacy-Preserving Approximate Minimum Community Search on Large NetworksIEEE Transactions on Information Forensics and Security10.1109/TIFS.2024.337620119(4146-4160)Online publication date: 11-Mar-2024
    • (2024)Range constrained group query on attribute social graphDistributed and Parallel Databases10.1007/s10619-024-07439-342:3(337-375)Online publication date: 30-Mar-2024
    • Show More Cited By

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media