Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2911451.2914807acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Constructing and Mining Web-scale Knowledge Graphs

Published: 07 July 2016 Publication History

Abstract

Recent years have witnessed a proliferation of large-scale knowledge graphs, from purely academic projects such as YAGO to major commercial projects such as Google's Knowledge Graph and Microsoft's Satori. Whereas there is a large body of research on mining homogeneous graphs, this new generation of information networks are highly heterogeneous, with thousands of entity and relation types and billions of instances of those types (graph vertices and edges). In this tutorial, we present the state of the art in constructing, mining, and growing knowledge graphs. The purpose of the tutorial is to equip newcomers to this exciting field with an understanding of the basic concepts, tools and methodologies, open research challenges, as well as pointers to available datasets and relevant literature. Knowledge graphs have become an enabling resource for a plethora of new knowledge-rich applications. Consequently, the tutorial will also discuss the role of knowledge bases in empowering a range of web applications, from web search to social networks to digital assistants. A publicly available knowledge base (Freebase) will be used throughout the tutorial to exemplify the different techniques.

References

[1]
M. Banko, M. J. Cafarella, S. Soderland, M. Broadhead, and O. Etzioni. Open information extraction for the web. In IJCAI, volume 7, pages 2670--2676, 2007.
[2]
A. Bordes, N. Usunier, A. Garcia-Duran, J. Weston, and O. Yakhnenko. Translating embeddings for modeling multi-relational data. In Advances in Neural Information Processing Systems, pages 2787--2795, 2013.
[3]
A. Bordes, J. Weston, R. Collobert, and Y. Bengio. Learning structured embeddings of knowledge bases. In Conference on Artificial Intelligence, number EPFL-CONF-192344, 2011.
[4]
X. Dong, E. Gabrilovich, G. Heitz, W. Horn, N. Lao, K. Murphy, T. Strohmann, S. Sun, and W. Zhang. Knowledge vault: A web-scale approach to probabilistic knowledge fusion. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 601--610. ACM, 2014.
[5]
X. L. Dong, E. Gabrilovich, G. Heitz, W. Horn, K. Murphy, S. Sun, and W. Zhang. From data fusion to knowledge fusion. Proceedings of the VLDB Endowment, 7(10):881--892, 2014.
[6]
A. Fader, S. Soderland, and O. Etzioni. Identifying relations for open information extraction. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 1535--1545. Association for Computational Linguistics, 2011.
[7]
P. G. Ipeirotis and E. Gabrilovich. Quizz: Targeted crowdsourcing with a billion (potential) users. In Proceedings of the 23rd international conference on World Wide Web, pages 143--154. ACM, 2014.
[8]
C. Kemp, J. B. Tenenbaum, T. L. Griffiths, T. Yamada, and N. Ueda. Learning systems of concepts with an infinite relational model. In AAAI, volume 3, page 5, 2006.
[9]
N. Lao, T. Mitchell, and W. W. Cohen. Random walk inference and learning in a large scale knowledge base. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 529--539. Association for Computational Linguistics, 2011.
[10]
M. Mintz, S. Bills, R. Snow, and D. Jurafsky. Distant supervision for relation extraction without labeled data. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2, pages 1003--1011. ACL, 2009.
[11]
M. Nickel, V. Tresp, and H.-P. Kriegel. A three-way model for collective learning on multi-relational data. In Proceedings of the 28th international conference on machine learning (ICML-11), pages 809--816, 2011.
[12]
M. Richardson and P. Domingos. Markov logic networks. Machine learning, 62(1-2):107--136, 2006.
[13]
S. Riedel, L. Yao, B. M. Marlin, and A. McCallum. Relation extraction with matrix factorization and universal schemas. In Joint Human Language Technology Conference/Annual Meeting of the North American Chapter of the ACL (HLT-NAACL '13), ACL, 2013.
[14]
S. Riedel, L. Yao, A. McCallum, and B. M. Marlin. Relation extraction with matrix factorization and universal schemas. In L. Vanderwende, H. D. III, and K. Kirchhoff, editors, HLT-NAACL, pages 74--84. ACL, 2013.
[15]
M. Schmitz, R. Bart, S. Soderland, O. Etzioni, et al. Open language learning for information extraction. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pages 523--534. ACL, 2012.
[16]
M. Surdeanu, J. Tibshirani, R. Nallapati, and C. D. Manning. Multi-instance multi-label learning for relation extraction. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pages 455--465. ACL, 2012.
[17]
R. West, E. Gabrilovich, K. Murphy, S. Sun, R. Gupta, and D. Lin. Knowledge base completion via search-based question answering. In Proceedings of the 23rd international conference on World Wide Web, pages 515--526. ACM, 2014.

Cited By

View all
  • (2023)Knowledge graph embedding by relational rotation and complex convolution for link predictionExpert Systems with Applications: An International Journal10.1016/j.eswa.2022.119122214:COnline publication date: 15-Mar-2023
  • (2019)Cross-platform rating prediction method based on review topicFuture Generation Computer Systems10.1016/j.future.2019.06.021101:C(236-245)Online publication date: 1-Dec-2019
  • (2018)Designing a multilingual knowledge graph as a service for cultural heritageProceedings of the 2018 International Conference on Dublin Core and Metadata Applications10.5555/3308533.3308538(29-40)Online publication date: 10-Sep-2018
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '16: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval
July 2016
1296 pages
ISBN:9781450340694
DOI:10.1145/2911451
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 July 2016

Permissions

Request permissions for this article.

Check for updates

Author Tag

  1. knowledge graphs

Qualifiers

  • Research-article

Conference

SIGIR '16
Sponsor:

Acceptance Rates

SIGIR '16 Paper Acceptance Rate 62 of 341 submissions, 18%;
Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)20
  • Downloads (Last 6 weeks)3
Reflects downloads up to 30 Aug 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Knowledge graph embedding by relational rotation and complex convolution for link predictionExpert Systems with Applications: An International Journal10.1016/j.eswa.2022.119122214:COnline publication date: 15-Mar-2023
  • (2019)Cross-platform rating prediction method based on review topicFuture Generation Computer Systems10.1016/j.future.2019.06.021101:C(236-245)Online publication date: 1-Dec-2019
  • (2018)Designing a multilingual knowledge graph as a service for cultural heritageProceedings of the 2018 International Conference on Dublin Core and Metadata Applications10.5555/3308533.3308538(29-40)Online publication date: 10-Sep-2018
  • (2018)Towards FAIRer Biological Knowledge Networks Using a Hybrid Linked Data and Graph Database ApproachJournal of Integrative Bioinformatics10.1515/jib-2018-002315:3Online publication date: 7-Aug-2018

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media