Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3340531.3412768acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Open access

The Enslaved Dataset: A Real-world Complex Ontology Alignment Benchmark using Wikibase

Published: 19 October 2020 Publication History


Ontology alignment has taken a critical place for helping heterogeneous resources to interoperate. It has been studied for over a decade, and over that time many alignment systems and methods have been developed by researchers to find simple 1:1 equivalence matches between two ontologies. However, very few alignment systems focus on finding complex correspondences. Even if the complex alignment systems are developed, the performance of finding complex relations still has a lot of room for improvement. One reason for this limitation may be that there are still few applicable alignment benchmarks that contain such complex relationships that can raise researchers' interests. In this paper, we propose a real-world dataset from the Enslaved project as a potential complex alignment benchmark. The benchmark consists of two resources, the Enslaved Ontology along with a Wikibase repository holding a large number of instance data from the Enslaved project, as well as a manually created reference alignment between them. The alignment was developed in consultation with domain experts in the digital humanities. The alignment not only includes simple 1:1 equivalence correspondences, but also more complex m:n equivalence and subsumption correspondences and are provided in both Expressive and Declarative Ontology Alignment Language (EDOAL) format and rule syntax. The Enslaved benchmark has been incorporated into the Ontology Alignment Evaluation Initiative (OAEI) 2020 and is completely free for public use to assist the researchers in developing and evaluating their complex alignment algorithms.

Supplementary Material

MP4 File (3340531.3412768.mp4)
This video presents the resource track paper entitled "The Enslaved Dataset: A Real-world Complex Ontology Alignment Benchmark using Wikibase".


Alsayed Algergawy, Michelle Cheatham, Daniel Faria, Alfio Ferrara, Irini Fundulaki, Ian Harrow, Sven Hertling, Ernesto Jimé nez-Ruiz, Naouel Karam, Abderrahmane Khiat, Patrick Lambrix, Huanyu Li, Stefano Montanelli, Heiko Paulheim, Catia Pesquita, Tzanina Saveta, Daniela Schmidt, Pavel Shvaiko, Andrea Splendiani, É lodie Thié blin, Cá ssia Trojahn, Jana Vatascinová, Ondrej Zamazal, and Lu Zhou. 2018. Results of the Ontology Alignment Evaluation Initiative 2018. In Proceedings of the 13th International Workshop on Ontology Matching co-located with the 17th International Semantic Web Conference, OM@ISWC 2018, Monterey, CA, USA, October 8, 2018 (CEUR Workshop Proceedings), Pavel Shvaiko, Jé rô me Euzenat, Ernesto Jimé nez-Ruiz, Michelle Cheatham, and Oktie Hassanzadeh (Eds.), Vol. 2288. CEUR-WS.org, 76--116. http://ceur-ws.org/Vol-2288/oaei18_paper0.pdf
Alsayed Algergawy, Daniel Faria, Alfio Ferrara, Irini Fundulaki, Ian Harrow, Sven Hertling, Ernesto Jimé nez-Ruiz, Naouel Karam, Abderrahmane Khiat, Patrick Lambrix, Huanyu Li, Stefano Montanelli, Heiko Paulheim, Catia Pesquita, Tzanina Saveta, Pavel Shvaiko, Andrea Splendiani, É lodie Thié blin, Cá ssia Trojahn, Jana Vatascinová, Ondrej Zamazal, and Lu Zhou. 2019. Results of the Ontology Alignment Evaluation Initiative 2019. In Proceedings of the 14th International Workshop on Ontology Matching co-located with the 18th International Semantic Web Conference (ISWC 2019), Auckland, New Zealand, October 26, 2019 (CEUR Workshop Proceedings), Pavel Shvaiko, Jé rô me Euzenat, Ernesto Jimé nez-Ruiz, Oktie Hassanzadeh, and Cá ssia Trojahn (Eds.), Vol. 2536. CEUR-WS.org, 46--85. http://ceur-ws.org/Vol-2536/oaei19_paper0.pdf
Eva Blomqvist, Karl Hammar, and Valentina Presutti. 2016. Engineering Ontologies with Patterns -- The eXtreme Design Methodology. In Ontology Engineering with Ontology Design Patterns -- Foundations and Applications, Pascal Hitzler, Aldo Gangemi, Krzysztof Janowicz, Adila Krisnadhi, and Valentina Presutti (Eds.). Studies on the Semantic Web, Vol. 25. IOS Press, 23--50.
Eva Blomqvist and Kurt Sandkuhl. 2005. Patterns in Ontology Engineering: Classification of Ontology Patterns. In ICEIS 2005, Proceedings of the Seventh International Conference on Enterprise Information Systems, Miami, USA, May 25--28, 2005, Chin-Sheng Chen, Joaquim Filipe, Isabel Seruca, and José Cordeiro (Eds.). 413--416.
Jé rô me David, Jé rô me Euzenat, Francc ois Scharffe, and Cá ssia Trojahn dos Santos. 2011. The Alignment API 4.0. Semantic Web, Vol. 2, 1 (2011), 3--10. https://doi.org/10.3233/SW-2011-0028
Marc Ehrig and Jé rô me Euzenat. 2005. Relaxed Precision and Recall for Ontology Matching. In Integrating Ontologies '05, Proceedings of the K-CAP 2005 Workshop on Integrating Ontologies, Banff, Canada, October 2, 2005 (CEUR Workshop Proceedings), Benjamin Ashpole, Marc Ehrig, Jé rô me Euzenat, and Heiner Stuckenschmidt (Eds.), Vol. 156. CEUR-WS.org. http://ceur-ws.org/Vol-156/paper5.pdf
Aldo Gangemi. 2005. Ontology Design Patterns for Semantic Web Content. In The Semantic Web -- ISWC 2005, 4th International Semantic Web Conference, ISWC 2005, Galway, Ireland, November 6--10, 2005, Proceedings (Lecture Notes in Computer Science), Yolanda Gil, Enrico Motta, V. Richard Benjamins, and Mark A. Musen (Eds.), Vol. 3729. Springer, 262--276.
Johanna Geiß, Andreas Spitz, Jannik Strö tgen, and Michael Gertz. 2015. The Wikipedia location network: overcoming borders and oceans. In Proceedings of the 9th Workshop on Geographic Information Retrieval, GIR 2015, Paris, France, November 26--27, 2015, Ross S. Purves and Christopher B. Jones (Eds.). ACM, 2:1--2:3. https://doi.org/10.1145/2837689.2837694
Birte Glimm, Ian Horrocks, Boris Motik, Giorgos Stoilos, and Zhe Wang. 2014. HermiT: An OWL 2 Reasoner. J. Autom. Reasoning, Vol. 53, 3 (2014), 245--269. https://doi.org/10.1007/s10817-014--9305--1
Jiawei Han, Jian Pei, Yiwen Yin, and Runying Mao. 2004. Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach. Data Min. Knowl. Discov., Vol. 8, 1 (2004), 53--87. https://doi.org/10.1023/B:DAMI.0000005258.31418.83
Walter Hawthorne. 2020. Maranhão Inventories Slave Database. Journal of Slavery and Data Preservation, Vol. 1, 1 (2020). https://jsdp.enslaved.org/fullDataArticle/40--59--4
Sven Hertling and Heiko Paulheim. 2020. The Knowledge Graph Track at OAEI - Gold Standards, Baselines, and the Golden Hammer Bias. In The Semantic Web - 17th International Conference, ESWC 2020, Heraklion, Crete, Greece, May 31-June 4, 2020, Proceedings (Lecture Notes in Computer Science), Andreas Harth, Sabrina Kirrane, Axel-Cyrille Ngonga Ngomo, Heiko Paulheim, Anisa Rula, Anna Lisa Gentile, Peter Haase, and Michael Cochez (Eds.), Vol. 12123. Springer, 343--359. https://doi.org/10.1007/978--3-030--49461--2_20
Pascal Hitzler, Aldo Gangemi, Krzysztof Janowicz, Adila Krisnadhi, and Valentina Presutti (Eds.). 2016. Ontology Engineering with Ontology Design Patterns -- Foundations and Applications. Studies on the Semantic Web, Vol. 25. IOS Press.
Pascal Hitzler and Adila Krisnadhi. 2016. On the Roles of Logical Axiomatizations for Ontologies. In Ontology Engineering with Ontology Design Patterns -- Foundations and Applications, Pascal Hitzler, Aldo Gangemi, Krzysztof Janowicz, Adila Krisnadhi, and Valentina Presutti (Eds.). Studies on the Semantic Web, Vol. 25. IOS Press, 73--80.
Adila Krisnadhi and Pascal Hitzler. 2016. Modeling With Ontology Design Patterns: Chess Games As a Worked Example. In Ontology Engineering with Ontology Design Patterns -- Foundations and Applications, Pascal Hitzler, Aldo Gangemi, Krzysztof Janowicz, Adila Krisnadhi, and Valentina Presutti (Eds.). Studies on the Semantic Web, Vol. 25. IOS Press, 3--21.
Jens Lehmann, Robert Isele, Max Jakob, Anja Jentzsch, Dimitris Kontokostas, Pablo N. Mendes, Sebastian Hellmann, Mohamed Morsey, Patrick van Kleef, Sö ren Auer, and Christian Bizer. 2015. DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia. Semantic Web, Vol. 6, 2 (2015), 167--195. https://doi.org/10.3233/SW-140134
Gregory Piatetsky-Shapiro. 1991. Discovery, Analysis, and Presentation of Strong Rules. In Knowledge Discovery in Databases, Gregory Piatetsky-Shapiro and William J. Frawley (Eds.). AAAI/MIT Press, 229--248.
Dominique Ritze, Christian Meilicke, Ondrej Svá b-Zamazal, and Heiner Stuckenschmidt. 2009. A Pattern-based Ontology Matching Approach for Detecting Complex Correspondences. In Proceedings of the 4th International Workshop on Ontology Matching (OM-2009) collocated with the 8th International Semantic Web Conference (ISWC-2009) Chantilly, USA, October 25, 2009 (CEUR Workshop Proceedings), Pavel Shvaiko, Jé rô me Euzenat, Fausto Giunchiglia, Heiner Stuckenschmidt, Natalya Fridman Noy, and Arnon Rosenthal (Eds.), Vol. 551. CEUR-WS.org. http://ceur-ws.org/Vol-551/om2009_Tpaper3.pdf
Cogan Shimizu, Pascal Hitzler, Quinn Hirt, Dean Rehberger, Seila Gonzalez Estrecha, Catherine Foley, Alicia M Sheill, Walter Hawthorne, Jeff Mixter, Ethan Watrall, et almbox. 2020. The enslaved ontology: Peoples of the historic slave trade. Journal of Web Semantics (2020), 100567.
Cogan Shimizu, Pascal Hitzler, Quinn Hirt, Alicia Sheill, Seila Gonzalez, Catherine Foley, Dean Rehberger, Ethan Watrall, Walter Hawthorne, Duncan Tarr, Ryan Carty, and Jeff Mixter. 2019. The Enslaved Ontology 1.0: Peoples of the Historic Slave Trade. Technical Report. Enslaved: Peoples of the Historic Slave Trade. Available from http://docs.enslaved.org.
Pavel Shvaiko and Jé rô me Euzenat. 2013. Ontology Matching: State of the Art and Future Challenges. IEEE Trans. Knowl. Data Eng., Vol. 25, 1 (2013), 158--176. https://doi.org/10.1109/TKDE.2011.253
É lodie Thié blin, Michelle Cheatham, Cá ssia Trojahn dos Santos, Ondrej Zamazal, and Lu Zhou. 2018. The First Version of the OAEI Complex Alignment Benchmark. In Proceedings of the ISWC 2018 Posters & Demonstrations, Industry and Blue Sky Ideas Tracks co-located with 17th International Semantic Web Conference (ISWC 2018), Monterey, USA, October 8th - to - 12th, 2018 (CEUR Workshop Proceedings), Marieke van Erp, Medha Atre, Vanessa Ló pez, Kavitha Srinivas, and Carolina Fortuna (Eds.), Vol. 2180. CEUR-WS.org. http://ceur-ws.org/Vol-2180/paper-67.pdf
É lodie Thié blin, Ollivier Haemmerlé, Nathalie Hernandez, and Cá ssia Trojahn dos Santos. 2017. Towards a complex alignment evaluation dataset. In Proceedings of the 12th International Workshop on Ontology Matching co-located with the 16th International Semantic Web Conference (ISWC 2017), Vienna, Austria, October 21, 2017 (CEUR Workshop Proceedings), Pavel Shvaiko, Jé rô me Euzenat, Ernesto Jimé nez-Ruiz, Michelle Cheatham, and Oktie Hassanzadeh (Eds.), Vol. 2032. CEUR-WS.org, 217--218. http://ceur-ws.org/Vol-2032/om2017_poster6.pdf
Denny Vrandecic and Markus Krö tzsch. 2014. Wikidata: a free collaborative knowledgebase. Commun. ACM, Vol. 57, 10 (2014), 78--85. https://doi.org/10.1145/2629489
Lu Zhou, Michelle Cheatham, and Pascal Hitzler. 2019 a. AROA Results for 2019 OAEI. In Proceedings of the 14th International Workshop on Ontology Matching co-located with the 18th International Semantic Web Conference (ISWC 2019), Auckland, New Zealand, October 26, 2019 (CEUR Workshop Proceedings), Pavel Shvaiko, Jé rô me Euzenat, Ernesto Jimé nez-Ruiz, Oktie Hassanzadeh, and Cá ssia Trojahn (Eds.), Vol. 2536. CEUR-WS.org, 107--113. http://ceur-ws.org/Vol-2536/oaei19_paper4.pdf
Lu Zhou, Michelle Cheatham, and Pascal Hitzler. 2019 b. Towards Association Rule-Based Complex Ontology Alignment. In Semantic Technology - 9th Joint International Conference, JIST 2019, Hangzhou, China, November 25--27, 2019, Proceedings (Lecture Notes in Computer Science), Xin Wang, Francesca Alessandra Lisi, Guohui Xiao, and Elena Botoeva (Eds.), Vol. 12032. Springer, 287--303. https://doi.org/10.1007/978--3-030--41407--8_19
Lu Zhou, Michelle Cheatham, Adila Krisnadhi, and Pascal Hitzler. 2018. A Complex Alignment Benchmark: GeoLink Dataset. In The Semantic Web - ISWC 2018 - 17th International Semantic Web Conference, Monterey, CA, USA, October 8--12, 2018, Proceedings, Part II (Lecture Notes in Computer Science), Denny Vrandecic, Kalina Bontcheva, Mari Carmen Suá rez-Figueroa, Valentina Presutti, Irene Celino, Marta Sabou, Lucie-Aimé e Kaffee, and Elena Simperl (Eds.), Vol. 11137. Springer, 273--288. https://doi.org/10.1007/978--3-030-00668--6_17
Lu Zhou, Michelle Cheatham, Adila Krisnadhi, and Pascal Hitzler. 2019 c. GeoLink Data Set: A Complex Alignment Benchmark from Real-world Ontology. Data Intelligence (2019), 1--26.

Cited By

View all
  • (2023)OAG: Linking Entities Across Large-Scale Heterogeneous Knowledge GraphsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.322216835:9(9225-9239)Online publication date: 1-Sep-2023
  • (2023)The Wikibase Approach to the Enslaved.Org Hub Knowledge GraphThe Semantic Web – ISWC 202310.1007/978-3-031-47243-5_23(419-434)Online publication date: 27-Oct-2023
  • (2023)Whyis 2: An Open Source Framework for Knowledge Graph Development and ResearchThe Semantic Web10.1007/978-3-031-33455-9_32(538-554)Online publication date: 28-May-2023
  • Show More Cited By



Information & Contributors


Published In

cover image ACM Conferences
CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management
October 2020
3619 pages
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]



Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 October 2020


Request permissions for this article.

Check for updates

Author Tags

  1. benchmark
  2. knowledge graph
  3. ontology alignment
  4. wikibase


  • Research-article

Funding Sources


CIKM '20

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25


Other Metrics

Bibliometrics & Citations


Article Metrics

  • Downloads (Last 12 months)155
  • Downloads (Last 6 weeks)26
Reflects downloads up to 29 Jan 2025

Other Metrics


Cited By

View all
  • (2023)OAG: Linking Entities Across Large-Scale Heterogeneous Knowledge GraphsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.322216835:9(9225-9239)Online publication date: 1-Sep-2023
  • (2023)The Wikibase Approach to the Enslaved.Org Hub Knowledge GraphThe Semantic Web – ISWC 202310.1007/978-3-031-47243-5_23(419-434)Online publication date: 27-Oct-2023
  • (2023)Whyis 2: An Open Source Framework for Knowledge Graph Development and ResearchThe Semantic Web10.1007/978-3-031-33455-9_32(538-554)Online publication date: 28-May-2023
  • (2022)Develop an Ontology for E-Commerce based on a Web Application to Assist Color-blind people2022 2nd International Conference on Digital Futures and Transformative Technologies (ICoDT2)10.1109/ICoDT255437.2022.9787475(1-5)Online publication date: 24-May-2022
  • (2022)Methodology for Creating a Community Corpus Using a Wikibase Knowledge GraphKnowledge Graphs and Semantic Web10.1007/978-3-031-21422-6_21(285-297)Online publication date: 13-Nov-2022
  • (2021)Wikibase as an Infrastructure for Knowledge Graphs: The EU Knowledge GraphThe Semantic Web – ISWC 202110.1007/978-3-030-88361-4_37(631-647)Online publication date: 24-Oct-2021

View Options

View options


View or Download as a PDF file.



View online with eReader.


Login options






Share this Publication link

Share on social media