Abstract
Nowadays, reasoning over knowledge graphs (KGs) has been widely adapted to empower retrieval systems, recommender systems, and question answering systems, generating a surge in research interest. However, recently developed reasoning methods usually lack interpretability, and can hardly tackle the large-scale action space problem over KGs. Inspired by the ability of human hierarchical decision making, we propose a multi-hop reasoning framework with deep reinforcement learning (RL) to fill this gap, which incorporates meta information into hierarchical reasoning over KGs. We first use optimization-based meta learning method to initialize parameters for RL agents, allowing for efficient adaptation for tasks in a few gradient steps. Then, a hierarchical RL framework is designed to decompose reasoning tasks into several sub-tasks and solve them separately, performed more efficient and natural than directly solving the entire problem. We further evaluated our model through different tasks on five real world datasets. The experimental results indicate that our method outperforms state-of-the-art baseline models without losing interpretability.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Fang Y, Wang H, Zhao L, Yu F, Wang C (2020) Dynamic knowledge graph based fake-review detection. Appl Intell 50(12):4281–4295
Li Z, Liu H, Zhang Z, Liu T, Xiong NN (2021) Learning knowledge graph embedding with heterogeneous relation attention networks. IEEE Transactions on Neural Networks and Learning Systems PP(99)
Zhang Z, Li Z, Liu H, Xiong NN (2020) Multi-scale dynamic convolutional network for knowledge graph embedding. IEEE Trans Knowl Data Eng PP(99):1–1
Wang S, Du Z, Ding M, Rodríguez-patón A, Song T (2022) KG-DTI: a knowledge graph based deep learning method for drug-target interaction predictions and alzheimer’s disease drug repositions. Appl Intell 52(1):846–857
Bollacker KD, Evans C, Paritosh PK, Sturge T, Taylor J (2008) Freebase: a collaboratively created graph database for structuring human knowledge. In: Proceedings of the ACM SIGMOD international conference on management of data, SIGMOD 2008, June 10-12, 2008. ACM, pp 1247–1250
Vrandečić D, Krötzsch M (2014) Wikidata: a free collaborative knowledgebase. Commun ACM 57(10):78–85
Liu H, Zheng C, Li D, Shen X, Lin K, Wang J, Zhang Z, Zhang Z, Xiong NN (2022) EDMF: Efficient deep matrix factorization with review feature learning for industrial recommender system. IEEE Trans Ind Inform 18(7):4361–4371
Yi X, Mingjing L, Junyong L, Xiaohui C, Gang Z (2022) Iterative rule-guided reasoning over sparse knowledge graphs with deep reinforcement learning. Information Processing & Management 59(5):103040. https://doi.org/10.1016/j.ipm.2022.103040, https://www.sciencedirect.com/science/article/pii/S0306457322001492
Bordes A, Usunier N, Garcia-Durán A, Weston J, Yakhnenko O (2013) Translating embeddings for modeling multi-relational data. In: Proceedings of the 26th International Conference on Neural Information Processing Systems, NIPS 2013 - vol 2. Curran Associates Inc, pp 2787–2795
Sun Z, Deng Z, Nie J, Tang J (2019) Rotate: Knowledge graph embedding by relational rotation in complex space. In: 7Th international conference on learning representations, ICLR 2019, may 6-9. Openreview.net, 2019
Vashishth S, Sanyal S, Nitin V, Agrawal N, Talukdar PP (2020) Interacte: Improving convolution-based knowledge graph embeddings by increasing feature interactions. In: The thirty-fourth AAAI conference on artificial intelligence, AAAI 2020, february 7-12, 2020. AAAI Press, pp 3009–3016
Feng J, Wei Q, Cui J, Chen J (2022) Novel translation knowledge graph completion model based on 2d convolution. Appl Intell 52(3):3266–3275
Galárraga LA, Teflioudi C, Hose K, Suchanek FM (2013) AMIE: Association rule mining under incomplete evidence in ontological knowledge bases. In: The 22nd international world wide web conference, WWW 2013, may 13-17, 2013. International world wide web conferences steering committee / ACM, pp 413– 422
Lei D, Jiang G, Gu X, Sun K, Mao Y, Ren X (2020) Learning collaborative agents with rule guidance for knowledge graph reasoning. In: Proceedings of the 2020 conference on empirical methods in natural language processing, EMNLP 2020, November 16-20, 2020. Assoc Comput Linguist, pp 8541–8547
Meilicke C, Chekol MW, Fink M, Stuckenschmidt H (2020) Reinforced anytime bottom up rule learning for knowledge graph completion. arXiv:2004.04412
Wang H, Jiang S, Yu Z (2020) Modeling of complex internal logic for knowledge base completion. Appl Intell 50(10):3336–3349
Hildebrandt M, Serna JAQ, Ma Y, Ringsquandl M, Joblin M, Tresp V (2020) Reasoning on knowledge graphs with debate dynamics. In: The 34th AAAI conference on artificial intelligence, AAAI 2020, february 7-12, 2020. AAAI Press, pp 4123– 4131
Wan G, Du B (2021) Gaussianpath: a bayesian multi-hop reasoning framework for knowledge graph reasoning. In: Thirty-fifth AAAI conference on artificial intelligence, AAAI 2021, february 2-9, 2021. AAAI Press, pp 4393–4401
Das R, Dhuliawala S, Zaheer M, Vilnis L, Durugkar I, Krishnamurthy A, Smola A, McCallum A (2018) Go for a walk and arrive at the answer: Reasoning over paths in knowledge bases using reinforcement learning. In: The 6th international conference on learning representations, ICLR 2018, april 30 - may 3, 2018, conference track proceedings. Openreview.net
Lv X, Han X, Hou L, Li J, Liu Z, Zhang W, Zhang Y, Kong H, Wu S (2020) Dynamic anticipation and completion for multi-hop reasoning over sparse knowledge graph. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, EMNLP 2020, November 16-20, 2020. Association for Computational Linguistics, pp 5694–5703
Hou Z, Jin X, Li Z, Bai L (2021) Rule-aware reinforcement learning for knowledge graph reasoning. In: Findings of the association for computational linguistics: ACL/IJCNLP 2021, august 1-6, 2021. Association for computational linguistics, pp 4687–4692
Chen L, Cui J, Tang X, Qian Y, Li Y, Zhang Y (2022) Rlpath: a knowledge graph link prediction method using reinforcement learning based attentive relation path searching and representation learning. Appl Intell 52(4):4715–4726
Wan G, Pan S, Gong C, Zhou C, Haffari G (2020) Reasoning like human: Hierarchical reinforcement learning for knowledge graph reasoning. In: Proceedings of the twenty-ninth international joint conference on artificial intelligence, IJCAI 2020, Online, pp 1926–1932
Wang Q, Mao Z, Wang B, Guo L (2017) Knowledge graph embedding: a survey of approaches and applications. IEEE Trans Knowl Data Eng 29(12):2724–2743
Nickel M, Tresp V, Kriegel H (2011) A three-way model for collective learning on multi-relational data. In: Proceedings of the 28th international conference on machine learning, ICML 2011, June 28 - July 2, 2011. Omnipress, pp 809–816
Dettmers T, Minervini P, Stenetorp P, Riedel S (2018) Convolutional 2d knowledge graph embeddings. In: Proceedings of the 32th AAAI Conference on Artificial Intelligence, AAAI 2018, February 2-7, 2018. AAAI Press, pp 1811–1818
Omran PG, Wang K, Wang Z (2018) Scalable rule learning via learning representation. In: Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI 2018, July 13-19, 2018, Stockholm, Sweden, pp 2149– 2155
Yang F, Yang Z, Cohen WW (2017) Differentiable learning of logical rules for knowledge base reasoning. In: Advances in neural information processing systems 30: Annual conference on neural information processing systems 2017, december 4-9, 2017, Long Beach, CA, USA, pp 2319–2328
Xiong W, Hoang T, Wang WY (2017) Deeppath: a reinforcement learning method for knowledge graph reasoning. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017, September 9-11, 2017. Association for Computational Linguistics, pp 564–573
Fu C, Chen T, Qu M, Jin W, Ren X (2019) Collaborative policy learning for open knowledge graph reasoning. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing, EMNLP 2019, November 3-7, 2019. Association for Computational Linguistics, pp 2672–2681
Lin XV, Socher R, Xiong C (2018) Multi-hop knowledge graph reasoning with reward shaping. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018, October 31 - November 4, 2018. Association for Computational Linguistics, pp 3243–3253
Niu G, Zhang Y, Li B, Cui P, Liu S, Li J, Zhang X (2020) Rule-Guided Compositional representation learning on knowledge graphs. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol 34, pp 2950–2958
Carlson A, Betteridge J, Kisiel B, Settles B Jr, ERH, Mitchell TM (2010) Toward an architecture for never-ending language learning. In: Proceedings of the 24th AAAI conference on artificial intelligence, AAAI 2010, July 11-15, 2010. AAAI Press
Maas AL, Hannun AY, Ng AY, et al. (2013) Rectifier nonlinearities improve neural network acoustic models. In: Proceedings of the 30th international conference on machine learning, ICML 2013, vol 30. Citeseer, p 3
Finn C, Abbeel P, Levine S (2017) Model-agnostic meta-learning for fast adaptation of deep networks. In: Proceedings of the 34th International Conference on Machine Learning, ICML 2017, 6-11 August 2017. Proceedings of machine learning research, vol 70. PMLR, pp 1126–1135
Williams RJ (1992) Simple statistical gradient-following algorithms for connectionist reinforcement learning. Mach Learn 8(3-4):229–256
Toutanova K, Chen D, Pantel P, Poon H, Choudhury P, Gamon M (2015) Representing text for joint embedding of text and knowledge bases. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, EMNLP 2015, September 17-21, 2015. The Association for Computational Linguistics, pp 1499–1509
Trouillon T, Welbl J, Riedel S, Gaussier É, Bouchard G (2016) Complex embeddings for simple link prediction. In: Proceedings of the 33nd International Conference on Machine Learning, ICML 2016, June 19-24, 2016. JMLR Workshop and Conference Proceedings, vol 48. New York, NY, USA pp 2071–2080
Kingma DP, Ba J (2015) Adam: a method for stochastic optimization. In: The 3rd international conference on learning representations, ICLR 2015, may 7-9 2015. Conference Track Proceedings, San Diego, CA, USA
Acknowledgements
The authors would like to express appreciation for the financial support provided by the National Natural Science Foundation of China (41801313) and the Science and technology project of Henan Province (222102210081, 222300420590).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Xia, Y., Luo, J., Lan, M. et al. Reason more like human: Incorporating meta information into hierarchical reinforcement learning for knowledge graph reasoning. Appl Intell 53, 13293–13308 (2023). https://doi.org/10.1007/s10489-022-04147-2
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-022-04147-2