Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3289600.3291031acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article
Public Access

Knowledge Graph Enhanced Community Detection and Characterization

Published: 30 January 2019 Publication History

Abstract

Recent studies show that by combining network topology and node attributes, we can better understand community structures in complex networks. However, existing algorithms do not explore "contextually" similar node attribute values, and therefore may miss communities defined with abstract concepts. We propose a community detection and characterization algorithm that incorporates the contextual information of node attributes described by multiple domain-specific hierarchical concept graphs. The core problem is to find the context that can best summarize the nodes in communities, while also discovering communities aligned with the context summarizing communities. We formulate the two intertwined problems, optimal community-context computation, and community discovery, with a coordinate-ascent based algorithm that iteratively updates the nodes' community label assignment with a community-context and computes the best context summarizing nodes of each community. Our unique contributions include (1) a composite metric on Informativeness and Purity criteria in searching for the best context summarizing nodes of a community; (2) a node similarity measure that incorporates the context-level similarity on multiple node attributes; and (3) an integrated algorithm that drives community structure discovery by appropriately weighing edges. Experimental results on public datasets show nearly 20 percent improvement on F-measure and Jaccard for discovering underlying community structure over the current state-of-the-art of community detection methods. Community structure characterization was also accurate to find appropriate community types for four datasets.

References

[1]
Sören Auer, Christian Bizer, Georgi Kobilarov, Jens Lehmann, Richard Cyganiak, and Zachary Ives. 2007. Dbpedia: A nucleus for a web of open data. The semantic web (2007), 722--735.
[2]
Shreyansh Bhatt, Brandon Minnery, Srikanth Nadella, Beth Bullemer, Valerie Shalin, and Amit Sheth. 2017. Enhancing crowd wisdom using measures of diversity computed from social media data. In Proceedings of the International Conference on Web Intelligence. ACM, 907--913.
[3]
Vincent D Blondel, Jean-Lou Guillaume, Renaud Lambiotte, and Étienne Lefebvre. 2011. The Louvain method for community detection in large networks. J of Statistical Mechanics: Theory and Experiment, Vol. 10 (2011), P10008.
[4]
Cecile Bothorel, Juan David Cruz, Matteo Magnani, and Barbora Micenkova. 2015. Clustering attributed graphs: models, measures and methods. Network Science, Vol. 3, 3 (2015), 408--444.
[5]
Yoon-Sik Cho, Greg Ver Steeg, Emilio Ferrara, and Aram Galstyan. 2016. Latent space model for multi-modal social data. In Proceedings of the 25th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 447--458.
[6]
Asmae El Kassiri and Fatima-Zahra Belouadha. 2015. Towards a unified semantic model for online social networks analysis and interoperability. In Intelligent Systems: Theories and Applications (SITA), 2015 10th International Conference on. IEEE, 1--6.
[7]
Guillaume Erétéo, Michel Buffa, Fabien Gandon, and Olivier Corby. 2009. Analysis of a real online social network using semantic web frameworks. The Semantic Web-ISWC 2009 (2009), 180--195.
[8]
Daniel G Goldstein, Randolph Preston McAfee, and Siddharth Suri. 2014. The wisdom of smaller, smarter crowds. In Proceedings of the fifteenth ACM conference on Economics and computation. ACM, 471--488.
[9]
Ryan Hartman, Josemar Faustino, Diego Pinheiro, and Ronaldo Menezes. 2017. Assessing the suitability of network community detection to available meta-data using rank stability. In Proceedings of the International Conference on Web Intelligence. ACM, 162--169.
[10]
Richard J Hazler and Sharon A Denham. 2002. Social isolation of youth at risk: Conceptualizations and practical implications. Journal of Counseling & Development, Vol. 80, 4 (2002), 403--409.
[11]
Dongxiao He, Zhiyong Feng, Di Jin, Xiaobao Wang, and Weixiong Zhang. 2017. Joint Identification of Network Communities and Semantics via Integrative Modeling of Network Topologies and Node Contents. (2017). https://aaai.org/ocs/index.php/AAAI/AAAI17/paper/view/14612
[12]
Thanh Ho and Phuc Do. 2015. Discovering Communities of Users on Social Networks Based on Topic Model Combined with Kohonen Network. In Knowledge and Systems Engineering (KSE), 2015 Seventh International Conference on. IEEE, 268--273.
[13]
Sergey Ioffe. 2010. Improved consistent sampling, weighted minhash and l1 sketching. In Data Mining (ICDM), 2010 IEEE 10th International Conference on. IEEE, 246--255.
[14]
Caiyan Jia, Yafang Li, Matthew B Carson, Xiaoyang Wang, and Jian Yu. 2017. Node Attribute-enhanced Community Detection in Complex Networks. Scientific Reports, Vol. 7 (2017).
[15]
Sarasi Lalithsena, Pavan Kapanipathi, and Amit Sheth. 2016. Harnessing relationships for domain-specific subgraph extraction: A recommendation use case. In Big Data (Big Data), 2016 IEEE International Conference on. IEEE, 706--715.
[16]
Jure Leskovec and Julian J Mcauley. 2012. Learning to discover social circles in ego networks. In Advances in neural information processing systems. 539--547.
[17]
Chunshan Li, William K Cheung, Yunming Ye, Xiaofeng Zhang, Dianhui Chu, and Xin Li. 2015. The author-topic-community model for author interest profiling and community discovery. Knowledge and Information Systems, Vol. 44, 2 (2015), 359--383.
[18]
Liyuan Liu, Linli Xu, Zhen Wangy, and Enhong Chen. 2015. Community detection based on structure and content: A content propagation perspective. In Data Mining (ICDM), 2015 IEEE International Conference on. IEEE, 271--280.
[19]
Shelley McKeown, Reeshma Haji, and Neil Ferguson. 2016. Understanding Peace and Conflict Through Social Identity Theory. Contemporary Global Perspectives. Switzerland: Springer (2016).
[20]
Pablo N Mendes, Max Jakob, Andrés Garc'ia-Silva, and Christian Bizer. 2011. DBpedia spotlight: shedding light on the web of documents. In Proceedings of the 7th international conference on semantic systems. ACM, 1--8.
[21]
Galileo Mark Namata, Brian Staats, Lise Getoor, and Ben Shneiderman. 2007. A dual-view approach to interactive network visualization. In Proceedings of the sixteenth ACM conference on Conference on information and knowledge management. ACM, 939--942.
[22]
Mark EJ Newman. 2006. Modularity and community structure in networks. Proceedings of the national academy of sciences, Vol. 103, 23 (2006), 8577--8582.
[23]
Mark EJ Newman and Aaron Clauset. 2016. Structure and inference in annotated networks. Nature communications, Vol. 7 (2016).
[24]
Guillermo Palma, Maria-Esther Vidal, and Louiqa Raschid. 2014. Drug-target interaction prediction using semantic similarity and edge partitioning. In International Semantic Web Conference. Springer, 131--146.
[25]
Simon Pool, Francesco Bonchi, and Matthijs van Leeuwen. 2014. Description-driven community detection. ACM Transactions on Intelligent Systems and Technology (TIST), Vol. 5, 2 (2014), 28.
[26]
David Sánchez and Montserrat Batet. 2012. A new model to compute the information content of concepts from taxonomic knowledge. International Journal on Semantic Web and Information Systems (IJSWIS), Vol. 8, 2 (2012), 34--50.
[27]
James Surowiecki. 2005. The wisdom of crowds . Anchor.
[28]
Chenguang Wang, Yangqiu Song, Ahmed El-Kishky, Dan Roth, Ming Zhang, and Jiawei Han. 2015. Incorporating world knowledge to document clustering via heterogeneous information networks. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1215--1224.
[29]
Pengfei Wang, Jiafeng Guo, and Yanyan Lan. 2014. Modeling retail transaction data for personalized shopping recommendation. In Proceedings of the 23rd ACM international conference on conference on information and knowledge management. ACM, 1979--1982.
[30]
Xiao Wang, Di Jin, Xiaochun Cao, Liang Yang, and Weixiong Zhang. 2016. Semantic Community Identification in Large Attribute Networks. (2016). https://www.aaai.org/ocs/index.php/AAAI/AAAI16/paper/view/11964
[31]
Jaewon Yang, Julian McAuley, and Jure Leskovec. 2013. Community detection in networks with node attributes. In Data Mining (ICDM), 2013 IEEE 13th international conference on. IEEE, 1151--1156.
[32]
Wei Ye, Linfei Zhou, Xin Sun, Claudia Plant, and Christian Böhm. 2017. Attributed Graph Clustering with Unimodal Normalized Cut. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 601--616.
[33]
Yuan Zhang, Elizaveta Levina, Ji Zhu, and others. 2016. Community detection in networks with node features. Electronic Journal of Statistics, Vol. 10, 2 (2016), 3153--3178.

Cited By

View all
  • (2025)Knowledge graph representation learning: A comprehensive and experimental overviewComputer Science Review10.1016/j.cosrev.2024.10071656(100716)Online publication date: May-2025
  • (2024)Community Deception in Attributed NetworksIEEE Transactions on Computational Social Systems10.1109/TCSS.2022.321372211:1(228-237)Online publication date: Feb-2024
  • (2024)Influence Role Recognition and LLM-Based Scholar Recommendation in Academic Social Networks2024 IEEE 11th International Conference on Data Science and Advanced Analytics (DSAA)10.1109/DSAA61799.2024.10722780(1-11)Online publication date: 6-Oct-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
WSDM '19: Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining
January 2019
874 pages
ISBN:9781450359405
DOI:10.1145/3289600
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 January 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. community detection
  2. graph clustering
  3. knowledge graph

Qualifiers

  • Research-article

Funding Sources

Conference

WSDM '19

Acceptance Rates

WSDM '19 Paper Acceptance Rate 84 of 511 submissions, 16%;
Overall Acceptance Rate 498 of 2,863 submissions, 17%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)263
  • Downloads (Last 6 weeks)46
Reflects downloads up to 24 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2025)Knowledge graph representation learning: A comprehensive and experimental overviewComputer Science Review10.1016/j.cosrev.2024.10071656(100716)Online publication date: May-2025
  • (2024)Community Deception in Attributed NetworksIEEE Transactions on Computational Social Systems10.1109/TCSS.2022.321372211:1(228-237)Online publication date: Feb-2024
  • (2024)Influence Role Recognition and LLM-Based Scholar Recommendation in Academic Social Networks2024 IEEE 11th International Conference on Data Science and Advanced Analytics (DSAA)10.1109/DSAA61799.2024.10722780(1-11)Online publication date: 6-Oct-2024
  • (2023)Probabilistic Coarsening for Knowledge Graph EmbeddingsAxioms10.3390/axioms1203027512:3(275)Online publication date: 6-Mar-2023
  • (2023)Interpretable Signed Link Prediction With Signed Infomax Hyperbolic GraphIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2021.313903535:4(3991-4002)Online publication date: 1-Apr-2023
  • (2022)Knowledge Graph-Based Framework for Decision Making Process with Limited InteractionMathematics10.3390/math1021398110:21(3981)Online publication date: 26-Oct-2022
  • (2022)Significant Subgraph Detection in Multi-omics Networks for Disease Pathway IdentificationFrontiers in Big Data10.3389/fdata.2022.8946325Online publication date: 22-Jun-2022
  • (2022)New Approaches to Extract Information From Posts on COVID-19 Published on RedditInternational Journal of Information Technology & Decision Making10.1142/S021962202250021321:05(1385-1431)Online publication date: 19-May-2022
  • (2022)Community Detection via Local Learning Based on Generalized Metric With Neighboring RegularizationIEEE Transactions on Systems, Man, and Cybernetics: Systems10.1109/TSMC.2020.300301952:1(498-510)Online publication date: Jan-2022
  • (2022) i MetaverseKG: I ndustrial Metaverse Knowledge Graph to Promote Interoperability in Design and Engineering Applications IEEE Internet Computing10.1109/MIC.2022.321208526:6(59-67)Online publication date: 1-Nov-2022
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media