Unsupervised Deep Cross-Language Entity Alignment

Jiang, Chuanyu; Qian, Yiming; Chen, Lijun; Gu, Yang; Xie, Xia

doi:10.1007/978-3-031-43421-1_1

Chuanyu Jiang¹²,
Yiming Qian¹³,
Lijun Chen¹⁴,
Yang Gu¹⁵ &
…
Xia Xie¹²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14172))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

1268 Accesses
1 Citations

Abstract

Cross-lingual entity alignment is the task of finding the same semantic entities from different language knowledge graphs. In this paper, we propose a simple and novel unsupervised method for cross-language entity alignment. We utilize the deep learning multi-language encoder combined with a machine translator to encode knowledge graph text, which reduces the reliance on label data. Unlike traditional methods that only emphasize global or local alignment, our method simultaneously considers both alignment strategies. We first view the alignment task as a bipartite matching problem and then adopt the re-exchanging idea to accomplish alignment. Compared with the traditional bipartite matching algorithm that only gives one optimal solution, our algorithm generates ranked matching results which enabled many potentials downstream tasks. Additionally, our method can adapt two different types of optimization (minimal and maximal) in the bipartite matching process, which provides more flexibility. Our evaluation shows, we each scored 0.966, 0.990, and 0.996 Hits@1 rates on the $\textrm{DBP15K}$ dataset in Chinese, Japanese, and French to English alignment tasks. We outperformed the state-of-the-art method in unsupervised and semi-supervised categories. Compared with the state-of-the-art supervised method, our method outperforms 2.6% and 0.4% in Ja-En and Fr-En alignment tasks while marginally lower by 0.2% in the Zh-En alignment task.

C. Jiang and Y. Qian—Contribute equally to this work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Cross-Knowledge Graph Entity Alignment via Neural Tensor Network

Research on Chinese-Korean Entity Alignment Method Combining TransH and GAT

A semi-supervised neighborhood matching model for global entity alignment

Article 21 January 2023

Notes

1.
Our source code is available in https://github.com/chuanyus/UDCEA.

References

Bordes, A., Usunier, N., Garcia-Durán, A., Weston, J., Yakhnenko, O.: Translating embeddings for modeling multi-relational data. In: Proceedings of the 26th International Conference on Neural Information Processing Systems (NIPS), pp. 2787–2795 (2013)
Google Scholar
Cao, Y., Liu, Z., Li, C., Liu, Z., Li, J., Chua, T.S.: Multi-channel graph neural network for entity alignment. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), pp. 1452–1461 (2019)
Google Scholar
Cohen, W.W., Richman, J.: Learning to match and cluster large high-dimensional data sets for data integration. In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 575–480 (2002)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL): Human Language Technologies, Volume 1 (Long and Short Papers), pp. 4171–4186 (2019)
Google Scholar
Feng, F., Yang, Y., Cer, D., Arivazhagan, N., Wang, W.: Language-agnostic BERT sentence embedding, In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL), pp. 878–891 (2022)
Google Scholar
Huang, H., et al.: Cross-knowledge-graph entity alignment via relation prediction. Knowl. -Based Syst. 240(15), 107813 (2022)
Article Google Scholar
Jean-Mary, Y.R., Shironoshita, E.P., Kabuka, M.R.: Ontology matching with semantic verification. J. Web Semant. 7(3), 235–251 (2009)
Article Google Scholar
Ji, G., He, S., Xu, L., Liu, K., Zhao, J.: Knowledge graph embedding via dynamic mapping matrix. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics (ACL) and the 7th International Joint Conference on Natural Language Processing (IJCNLP), pp. 687–696 (2015)
Google Scholar
Ji, S., Pan, S., Cambria, E., Marttinen, P., Yu, P.S.: A survey on knowledge graphs: representation, acquisition, and applications. IEEE Trans. Neural Netw. Learn. Syst. 33(2), 494–514 (2021)
Article MathSciNet Google Scholar
Jiang, W., Liu, Y., Deng, X.: Fuzzy entity alignment via knowledge embedding with awareness of uncertainty measure. Neurocomputing 468, 97–110 (2022)
Article Google Scholar
Jiménez-Ruiz, E., Grau, B.C.: LogMap: logic-based and scalable ontology matching. In: The International Semantic Web Conference (ISWC), pp. 273–288 (2001)
Google Scholar
Jonker, R., Volgenant, A.: A shortest augmenting path algorithm for dense and sparse linear assignment problems. Computing 38, 325–340 (1987)
Article MathSciNet MATH Google Scholar
Kipf, T.N., Welling, M.: Semi-supervised classification with graph convolutional networks. In: 5th International Conference on Learning Representations (ICLR) (2017)
Google Scholar
Kondrak, G.: N-gram similarity and distance. In: International Conference on String Processing and Information Retrieval, pp. 115–126 (2005)
Google Scholar
Li, C., Cao, Y., Hou, L., Shi, J., Li, J., Chua, T.S.: Semi-supervised entity alignment via joint knowledge embedding model and cross-graph model. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 2723–2732 (2019)
Google Scholar
Lin, Y., Liu, Z., Sun, M., Liu, Y., Zhu, X.: Learning entity and relation embeddings for knowledge graph completion. In: Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, pp. 2181–2187 (2015)
Google Scholar
Liu, F., Chen, M., Roth, D., Collier, N.: Visual pivoting for (unsupervised) entity alignment. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 4257–4266 (2021)
Google Scholar
Liu, X., et al.: SelfKG: self-supervised entity alignment in knowledge graphs. In: Proceedings of the ACM Web Conference (WWW), pp. 860–870 (2022)
Google Scholar
Liu, Y., et al.: RoBERTa: a robustly optimized BERT pretraining approach. arXiv:1907.11692 (2019)
Luo, S., Yu, S.: An accurate unsupervised method for joint entity alignment and dangling entity detection. In: Findings of the Association for Computational Linguistics: ACL 2022, pp. 2330–2339 (2022)
Google Scholar
Mao, X., Wang, W., Wu, Y., Lan, M.: From alignment to assignment: frustratingly simple unsupervised entity alignment. In: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 2843–2853 (2021)
Google Scholar
Mao, X., Wang, W., Wu, Y., Lan, M.: LightEA: a scalable, robust, and interpretable entity alignment framework via three-view label propagation. In: Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 825–838 (2022)
Google Scholar
Mao, X., Wang, W., Xu, H., Lan, M., Wu, Y.: MRAEA: an efficient and robust entity alignment approach for cross-lingual knowledge graph. In: Proceedings of the 13th International Conference on Web Search and Data Mining (WSDM), pp. 420–428 (2020)
Google Scholar
Mao, X., Wang, W., Xu, H., Wu, Y., Lan, M.: Relational reflection entity alignment. In: Proceedings of the 29th ACM International Conference on Information & Knowledge Management (CIKM), pp. 1095–1104 (2020)
Google Scholar
Pennington, J., Socher, R., Manning, C.D.: GloVe: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1532–1543 (2014)
Google Scholar
Reimers, N., Gurevych, I.: Sentence-BERT: sentence embeddings using siamese BERT-networks. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 3982–3992 (2019)
Google Scholar
Reimers, N., Gurevych, I.: Making monolingual sentence embeddings multilingual using knowledge distillation. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 4512–4525 (2020)
Google Scholar
Singhal, A.: Introducing the knowledge graph: Things, not strings (2012). https://www.blog.google/products/search/introducing-knowledge-graph-things-not/, eB/OL
Sinkhorn, R.: A relationship between arbitrary positive matrices and doubly stochastic matrices. Ann. Math. Stat. 35(2), 876–879 (1964)
Article MathSciNet MATH Google Scholar
Sun, Z., Hu, W., Li, C.: Cross-lingual entity alignment via joint attribute-preserving embedding. In: The International Semantic Web Conference (ISWC), pp. 628–644 (2017)
Google Scholar
Sun, Z., et al.: Knowledge graph alignment network with gated multi-hop neighborhood aggregation. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 222–229 (2020)
Google Scholar
Sun, Z., et al.: A benchmarking study of embedding-based entity alignment for knowledge graphs. Proc. VLDB Endowment 13(11), 2326–2340 (2020)
Article Google Scholar
Tang, X., Zhang, J., Chen, B., Yang, Y., Chen, H., Li, C.: BERT-INT: a BERT-based interaction model for knowledge graph alignment. In: Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence (IJCAI), pp. 3174–3180 (2021)
Google Scholar
Veličković, P., Cucurull, G., Casanova, A., Romero, A., Liò, P., Bengio, Y.: Graph attention networks. In: 6th International Conference on Learning Representations (ICLR) (2018)
Google Scholar
Wang, Z., Zhang, J., Feng, J., Chen, Z.: Transh: knowledge graph embedding by translating on hyperplanes. In: Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, pp. 1112–1119 (2014)
Google Scholar
Wang, Z., Lv, Q., Lan, X., Zhang, Y.: Cross-lingual knowledge graph alignment via graph convolutional networks. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 349–357 (2018)
Google Scholar
Wang, Z., Yang, J., Ye, X.: Knowledge graph alignment with entity-pair embedding. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1672–1680. Association for Computational Linguistics (2020)
Google Scholar
Wu, Y., Liu, X., Feng, Y., Wang, Z., Yan, R., Zhao, D.: Relation-aware entity alignment for heterogeneous knowledge graphs. In: Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI), pp. 5278–5284 (2019)
Google Scholar
Wu, Y., Liu, X., Feng, Y., Wang, Z., Zhao, D.: Jointly learning entity and relation representations for entity alignment. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 240–249 (2019)
Google Scholar
Xin, K., et al.: Ensemble semi-supervised entity alignment via cycle-teaching. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 4281–4289 (2022)
Google Scholar
Yang, H.W., Zou, Y., Shi, P., Lu, W., Lin, J., Sun, X.: Aligning cross-lingual entities with multi-aspect information. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 4431–4441 (2019)
Google Scholar
Zeng, K., et al.: Interactive contrastive learning for self-supervised entity alignment. In: Proceedings of the 31st ACM International Conference on Information & Knowledge Management (CIKM), pp. 2465–2475 (2022)
Google Scholar
Zhu, B., Bao, T., Han, J., Han, R., Liu, L., Peng, T.: Cross-lingual knowledge graph entity alignment by aggregating extensive structures and specific semantics. J. Ambient Intell. Humanized Comput. 14, 12609–12616 (2022)
Google Scholar
Zhu, Q., et al.: Collective multi-type entity alignment between knowledge graphs. In: Proceedings of The Web Conference 2020 (WWW), pp. 2241–2252 (2020)
Google Scholar
Zhu, Q., Zhou, X., Wu, J., Tan, J., Guo, L.: Neighborhood-aware attentional representation for multilingual knowledge graphs. In: Proceedings of the 28th International Joint Conference on Artificial Intelligence (IJCAI), pp. 1943–1949 (2019)
Google Scholar
Zhu, R., Ma, M., Wang, P.: RAGA: relation-aware graph attention networks for global entity alignment. In: Advances in Knowledge Discovery and Data Mining (PAKDD), pp. 501–513 (2021)
Google Scholar
Zhu, Y., Liu, H., Wu, Z., Du, Y.: Relation-aware neighborhood matching model for entity alignment. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 4749–4756 (2021)
Google Scholar
Zou, X.: A survey on application of knowledge graph. J. Phys. Conf. Ser. 1487(1), 012016 (2020)
Article Google Scholar

Download references

Acknowledgement

This work is supported by Hainan Province Science and Technology Special Fund (No. ZDKJ2021042) and A*STAR under its Artificial Intelligence in Medicine Transformation (AIMx) Program (Award H20C6a0032) National Natural Science Foundation of China (No.62362023).

Author information

Authors and Affiliations

School of Computer Science and Technology, Hainan University, Haikou, China
Chuanyu Jiang & Xia Xie
Institute of High Performance Computing (IHPC), Agency for Science, Technology and Research (A*STAR), 1 Fusionopolis Way, #16-16 Connexis, Singapore, 138632, Republic of Singapore
Yiming Qian
School of Cyberspace Security, Hainan University, Haikou, China
Lijun Chen
School of Information and Communication Engineering, Hainan University, Haikou, China
Yang Gu

Authors

Chuanyu Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Yiming Qian
View author publications
You can also search for this author in PubMed Google Scholar
Lijun Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yang Gu
View author publications
You can also search for this author in PubMed Google Scholar
Xia Xie
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xia Xie .

Editor information

Editors and Affiliations

University of Michigan, Ann Arbor, MI, USA
Danai Koutra
University of Vienna, Vienna, Austria
Claudia Plant
Max Planck Institute for Software Systems, Kaiserslautern, Germany
Manuel Gomez Rodriguez
Politecnico di Torino, Turin, Italy
Elena Baralis
CENTAI, Turin, Italy
Francesco Bonchi

Ethics declarations

This work doesn’t involve any unethical tasks, all the experiment was done in public datasets.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jiang, C., Qian, Y., Chen, L., Gu, Y., Xie, X. (2023). Unsupervised Deep Cross-Language Entity Alignment. In: Koutra, D., Plant, C., Gomez Rodriguez, M., Baralis, E., Bonchi, F. (eds) Machine Learning and Knowledge Discovery in Databases: Research Track. ECML PKDD 2023. Lecture Notes in Computer Science(), vol 14172. Springer, Cham. https://doi.org/10.1007/978-3-031-43421-1_1

Download citation

DOI: https://doi.org/10.1007/978-3-031-43421-1_1
Published: 18 September 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-43420-4
Online ISBN: 978-3-031-43421-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the ECML PKDD community (opens in a new tab)

Unsupervised Deep Cross-Language Entity Alignment

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Cross-Knowledge Graph Entity Alignment via Neural Tensor Network

Research on Chinese-Korean Entity Alignment Method Combining TransH and GAT

A semi-supervised neighborhood matching model for global entity alignment

Notes

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Ethics declarations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Subscribe and save

Buy Now

Navigation

Unsupervised Deep Cross-Language Entity Alignment

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Cross-Knowledge Graph Entity Alignment via Neural Tensor Network

Research on Chinese-Korean Entity Alignment Method Combining TransH and GAT

A semi-supervised neighborhood matching model for global entity alignment

Notes

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Ethics declarations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation