research-article

A Joint Model for Representation Learning of Tibetan Knowledge Graph Based on Encyclopedia

Authors:

Xiaobing ZhaoAuthors Info & Claims

Transactions on Asian and Low-Resource Language Information Processing, Volume 20, Issue 2

Article No.: 27, Pages 1 - 17

https://doi.org/10.1145/3447248

Published: 30 March 2021 Publication History

Abstract

Learning the representation of a knowledge graph is critical to the field of natural language processing. There is a lot of research for English knowledge graph representation. However, for the low-resource languages, such as Tibetan, how to represent sparse knowledge graphs is a key problem. In this article, aiming at scarcity of Tibetan knowledge graphs, we extend the Tibetan knowledge graph by using the triples of the high-resource language knowledge graphs and Point of Information map information. To improve the representation learning of the Tibetan knowledge graph, we propose a joint model to merge structure and entity description information based on the Translating Embeddings and Convolution Neural Networks models. In addition, to solve the segmentation errors, we use character and word embedding to learn more complex information in Tibetan. Finally, the experimental results show that our model can make a better representation of the Tibetan knowledge graph than the baseline.

References

[1]

Kurt Bollacker, Colin Evans, Praveen Paritosh, Tim Sturge, and Jamie Taylor. 2008. Freebase: A collaboratively created graph database for structuring human knowledge. In Proceedings of the ACM SIGMOD International Conference on Management of Data. AcM, 1247–1250.

Digital Library

[2]

Jens Lehmann, Robert Isele, Max Jakob, Anja Jentzsch, Dimitris Kontokostas, Pablo N. Mendes, Sebastian Hellmann, Mohamed Morsey, Patrick Van Kleef, Sören Auer et al. 2015. DBpedia–A large-scale, multilingual knowledge base extracted from Wikipedia. Semantic Web 6, 2 (2015), 167–195.

[3]

Fabian M. Suchanek, Gjergji Kasneci, and Gerhard Weikum. 2007. Yago: A core of semantic knowledge. In Proceedings of the 16th International Conference on World Wide Web. ACM, 697–706.

Digital Library

[4]

Yoshua Bengio, Aaron Courville, and Pascal Vincent. 2013. Representation learning: A review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 35, 8 (2013), 1798–1828.

Digital Library

[5]

Y. Bengio. 2009. Learning deep architectures for AI. Found. Trends Mach. Learn. 2, 1–127. (2009).

Digital Library

[6]

Antoine Bordes, Nicolas Usunier, Alberto Garcia-Duran, Jason Weston, and Oksana Yakhnenko. 2013. Translating embeddings for modeling multi-relational data. In Advances in Neural Information Processing Systems. MIT Press, 2787–2795.

Digital Library

[7]

Zhiyuan Liu, Maosong Sun, Yankai lin, and Ruobing Xie. 2016. Research on knowledge representation learning (in Chinese). J. Comput. Res. Dev. 53, 2 (2016), 247–261.

[8]

Zhen Wang, Jianwen Zhang, Jianlin Feng, and Zheng Chen. 2014. Knowledge graph embedding by translating on hyperplanes. In Proceeding sof the 28th AAAI Conference on Artificial Intelligence.

Digital Library

[9]

Yankai Lin, Zhiyuan Liu, Maosong Sun, Yang Liu, and Xuan Zhu. 2015. Learning entity and relation embeddings for knowledge graph completion. In Proceeding sof the 29th AAAI Conference on Artificial Intelligence.

Digital Library

[10]

Guoliang Ji, Shizhu He, Liheng Xu, Kang Liu, and Jun Zhao. 2015. Knowledge graph embedding via dynamic mapping matrix. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing. 687–696.

[11]

Guoliang Ji, Kang Liu, Shizhu He, and Jun Zhao. 2016. Knowledge graph completion with adaptive sparse transfer matrix. In Proceeding sof the 30th AAAI Conference on Artificial Intelligence.

Digital Library

[12]

Han Xiao, Minlie Huang, Yu Hao, and Xiaoyan Zhu. 2015. TransA: An adaptive approach for knowledge graph embedding. Retrieved from https://arXiv:1509.05490.

[13]

Han Xiao, Minlie Huang, Yu Hao, and Xiaoyan Zhu. 2015. TransG: A generative mixture model for knowledge graph embedding. Retrieved from https://arXiv:1509.05488.

[14]

He Shizhu, Liu Kang, Ji Guoliang, Zhao Jun et al. 2015. Learning to represent knowledge graphs with Gaussian embedding. In Proceedings of the 24th ACM Conference on Information and Knowledge Management. ACM, 623--632.

Digital Library

[15]

Xiaobin Zhao, Lirong Qiu, and Tiejun Zhao. 2011. Multi-nation language ontology knowledge base construction technology (in Chinese). J. Chinese Info. Process. 25, 4 (2011), 71–75.

[16]

Zhen Zhu and Yuan Sun. 2015. Tibetan character attribute extraction based on SVM and generalization template collaboration (in Chinese). J. Chinese Info. Process. 29, 6 (2015), 220–227.

[17]

Tianci Xia and Yuan Sun. 2018. Research on Tibetan entity relationship extraction method based on joint model (in Chinese). J. Chinese Info. Process. 32, 12 (2018), 76–83.

[18]

Lili Guo and Yuan Sun. 2016. Tibetan person attributes extraction based on BP neural network. In Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. Springer, 132–142.

[19]

Antoine Bordes, Xavier Glorot, Jason Weston, and Yoshua Bengio. 2012. Joint learning of words and meaning representations for open-text semantic parsing. In Artific. Intell. Stat. 127–135.

[20]

Antoine Bordes, Xavier Glorot, Jason Weston, and Yoshua Bengio. 2014. A semantic matching energy function for learning with multi-relational data. Mach. Learn. 94, 2 (2014), 233–259.

Digital Library

[21]

Andras Csomai and Rada Mihalcea. 2008. Linking documents to encyclopedic knowledge. IEEE Intell. Syst. 23, 5 (2008), 34–41.

Digital Library

[22]

David Milne and Ian H. Witten. 2008. Learning to link with wikipedia. In Proceedings of the 17th ACM Conference on Information and Knowledge Management. ACM, 509–518.

Digital Library

[23]

Razvan Bunescu and Marius Paşca. 2006. Using encyclopedic knowledge for named entity disambiguation. In Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics.

[24]

Chinatsu Aone and Mila Ramos-Santacruz. 2000. REES: A large-scale relation and event extraction system. In Proceedings of the 6th Conference on Applied Natural Language Processing. Association for Computational Linguistics, 76–83.

Digital Library

[25]

Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems. MIT Press, 1097–1105.

Digital Library

[26]

Yuan Sun, Like Wang, Chaofan Chen, Tianci Xia, and Xiaobing Zhao. 2019. Improved distant supervised model in tibetan relation extraction using ELMo and attention. IEEE Access 7 (2019), 173054–173062.

[27]

ChunYang Liu, WenBo Sun, WenHan Chao, and Wanxiang Che. 2013. Convolution neural network for relation extraction. In Proceedings of the International Conference on Advanced Data Mining and Applications. Springer, 231–242.

Digital Library

[28]

Daojian Zeng, Kang Liu, Siwei Lai, Guangyou Zhou, Jun Zhao et al. 2014. Relation classification via convolutional deep neural network. In Proceedings of International Conference on Computational Linguistics.

[29]

Thien Huu Nguyen and Ralph Grishman. 2015. Relation extraction: Perspective from convolutional neural networks. In Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing. 39–48.

[30]

Yann LeCun et al. 1989. Generalization and network design strategies. In Connectionism in Perspective. Vol. 19. Citeseer.

[31]

Cicero Nogueira dos Santos, Bing Xiang, and Bowen Zhou. 2015. Classifying relations by ranking with convolutional neural networks. Retrieved from https://arXiv:1504.06580.

[32]

Linlin Wang, Zhu Cao, Gerard De Melo, and Zhiyuan Liu. 2016. Relation classification via multi-level attention CNNs. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. 1298–1307.

[33]

Lantao Yu, Weinan Zhang, Jun Wang, and Yong Yu. 2017. Seqgan: Sequence generative adversarial nets with policy gradient. In Proceedings of the 31st AAAI Conference on Artificial Intelligence.

Digital Library

[34]

Mengfei Shi. 2018. Research and Implementation of Question Answering System Based on Chinese Knowledge Base (in Chinese). Master’s thesis. East China Normal University.

[35]

Huaping Zhong, Jianwen Zhang, Zhen Wang, Hai Wan, and Zheng Chen. 2015. Aligning knowledge and text embeddings by entity descriptions. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. 267–272.

[36]

Dongxu Zhang, Bin Yuan, Dong Wang, and Rong Liu. 2015. Joint semantic relevance learning with text data and graph knowledge. In Proceedings of the 3rd Workshop on Continuous Vector Space Models and Their Compositionality. 32–40.

[37]

Tong Xiao, Jingbo Zhu, Hao Zhang, and Qiang Li. 2012. NiuTrans: An open source toolkit for phrase-based and syntax-based machine translation. In Proceedings of the ACL System Demonstrations. 19–24.

Digital Library

[38]

Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. Retrieved from https://arXiv:1301.3781.

[39]

C. J. Long, H. D. Liu, M. H. Nuo, and J. Wu. 2015. Tibetan POS tagging based on syllable tagging. J. Chin. Info. Process. 29, 5 (2015), 211–216.

[40]

Ruobing Xie, Zhiyuan Liu, Jia Jia, Huanbo Luan, and Maosong Sun. 2016. Representation learning of knowledge graphs with entity descriptions. In Proceedings of the 30th AAAI Conference on Artificial Intelligence.

Digital Library

[41]

Bishan Yang, Wen-tau Yih, Xiaodong He, Jianfeng Gao, and Li Deng. 2014. Embedding entities and relations for learning and inference in knowledge bases. Retrieved from https://arXiv:1412.6575.

[42]

Dat Quoc Nguyen. 2017. An overview of embedding models of entities and relationships for knowledge base completion. Retrieved from https://arXiv:1703.08098.

Index Terms

A Joint Model for Representation Learning of Tibetan Knowledge Graph Based on Encyclopedia
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
  2. Machine learning
    1. Machine learning approaches
      1. Neural networks

Recommendations

Dynamic Adaptive Chain Model of Knowledge Graph Representation Learning
ICIIT '21: Proceedings of the 2021 6th International Conference on Intelligent Information Technology

Knowledge graph representation learning models are mostly used for static data. When the data changes, the models cannot be adjusted dynamically as the data changes. The data is constantly changing in actual usage scenarios. However, most representation ...
Text-Enhanced Knowledge Graph Representation Model in Hyperbolic Space
Advanced Data Mining and Applications
Abstract
The representation learning of knowledge graph refers to embedding entities and relations in knowledge graph into a low-dimensional dense vector space. Existing knowledge graph embedding models mostly chose Euclidean Space as their vector space ...
NeuralKG-ind: A Python Library for Inductive Knowledge Graph Representation Learning
SIGIR '23: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval

Since the dynamic characteristics of knowledge graphs, many inductive knowledge graph representation learning (KGRL) works have been proposed in recent years, focusing on enabling prediction over new entities. NeuralKG-ind is the first library of ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Asian and Low-Resource Language Information Processing

ACM Transactions on Asian and Low-Resource Language Information Processing Volume 20, Issue 2

March 2021

313 pages

ISSN:2375-4699

EISSN:2375-4702

DOI:10.1145/3454116

Editor:
Imed Zitouni
Google, USA

Issue’s Table of Contents

Copyright © 2021 Association for Computing Machinery.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 March 2021

Accepted: 01 January 2021

Revised: 01 January 2021

Received: 01 January 2020

Published in TALLIP Volume 20, Issue 2

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Refereed

Funding Sources

National Natural Science Foundation of China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
190
Total Downloads

Downloads (Last 12 months)8
Downloads (Last 6 weeks)1

Reflects downloads up to 01 Sep 2024

Other Metrics

View Author Metrics

Citations

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Issue’s Table of Contents