Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3106426.3106428acmconferencesArticle/Chapter ViewAbstractPublication PageswiConference Proceedingsconference-collections
research-article

An evaluation of models for runtime approximation in link discovery

Published: 23 August 2017 Publication History

Abstract

Time-efficient link discovery is of central importance to implement the vision of the Semantic Web. Some of the most rapid Link Discovery approaches rely internally on planning to execute link specifications. In newer works, linear models have been used to estimate the runtime of the fastest planners. However, no other category of models has been studied for this purpose so far. In this paper, we study non-linear runtime estimation functions for runtime estimation. In particular, we study exponential and mixed models for the estimation of the runtimes of planners. To this end, we evaluate three different models for runtime on six datasets using 500 link specifications. We show that exponential and mixed models achieve better fits when trained but are only to be preferred in some cases. Our evaluation also shows that the use of better runtime approximation models has a positive impact on the overall execution of link specifications.

References

[1]
Sören Auer, Jens Lehmann, Axel-Cyrille Ngonga Ngomo, and Amrapali Zaveri. 2013. Introduction to linked data and its lifecycle on the web. In Reasoning Web. Semantic Technologies for Intelligent Data Access. Springer, 1--90.
[2]
Ron Avnur and Joseph M. Hellerstein. 2000. Eddies: Continuously Adaptive Query Processing. In Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data (SIGMOD '00). ACM, New York, NY, USA, 261--272.
[3]
Surajit Chaudhuri. 1998. An overview of query optimization in relational systems. In Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems (PODS '98). ACM, 34--43.
[4]
Richard L. Cole and Goetz Graefe. 1994. Optimization of Dynamic Query Evaluation Plans. In Proceedings of the 1994 ACM SIGMOD International Conference on Management of Data (SIGMOD '94). ACM, New York, NY, USA, 150--160.
[5]
Pierre Courrieu. 2008. Fast computation of Moore-Penrose inverse matrices. arXiv preprint arXiv:0804.4809 (2008).
[6]
Goetz Graefe. 1993. Query Evaluation Techniques for Large Databases. ACM Comput. Surv. 25, 2 (June 1993), 73--169.
[7]
Yannis E. Ioannidis, Raymond T. Ng, Kyuseok Shim, and Timos K. Sellis. 1992. Parametric Query Optimization. In Proceedings of the 18th International Conference on Very Large Data Bases (VLDB '92). Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 103--114.
[8]
R. Isele, A. Jentzsch, and C. Bizer. 2011. Efficient Multidimensional Blocking for Link Discovery without losing Recall. In WebDB.
[9]
Zachary G. Ives, Daniela Florescu, Marc Friedman, Alon Levy, and Daniel S. Weld. 1999. An Adaptive Query Execution System for Data Integration. In Proceedings of the 1999 ACM SIGMOD International Conference on Management of Data (SIGMOD '99). ACM, New York, NY, USA, 299--310.
[10]
Hanna Köpcke, Andreas Thor, and Erhard Rahm. 2010. Evaluation of entity resolution approaches on real-world match problems. PVLDB 3, 1 (2010), 484--493.
[11]
Jorge J Moré. 1978. The Levenberg-Marquardt algorithm: implementation and theory. In Numerical analysis. Springer, 105--116.
[12]
Axel-Cyrille Ngonga Ngomo and Klaus Lyko. 2012. Eagle: Efficient active learning of link specifications using genetic programming. In The Semantic Web: Research and Applications. Springer, 149--163.
[13]
Axel-Cyrille Ngonga Ngomo. 2012. On Link Discovery using a Hybrid Approach. Journal on Data Semantics 1, 4 (December 2012), 203 -- 217.
[14]
Axel-Cyrille Ngonga Ngomo. 2014. HELIOS - Execution Optimization for Link Discovery. In The Semantic Web - ISWC 2014 - 13th International Semantic Web Conference, Riva del Garda, Italy, October 19-23, 2014. Proceedings, Part I. Springer, 17--32.
[15]
Andriy Nikolov, Mathieu d'Aquin, and Enrico Motta. 2012. Unsupervised learning of link discovery configuration. In 9th Extended Semantic Web Conference (ESWC 2012). http://oro.open.ac.uk/33434/
[16]
Xing Niu, Shu Rong, Yunlong Zhang, and Haofen Wang. 2011. Zhishi.links results for OAEI 2011. In OM.
[17]
Muhammad Saleem, Maulik R Kamdar, Aftab Iqbal, Shanmukha Sampath, Helena F Deus, and Axel-Cyrille Ngonga Ngomo. 2014. Big linked cancer data: Integrating linked tcga and . Web Semantics: Science, Services and Agents on the World Wide Web 27 (2014), 34--41.
[18]
Abraham Silberschatz, Henry Korth, and S. Sudarshan. 2006. Database Systems Concepts (5 ed.). McGraw-Hill, Inc., New York, NY, USA.
[19]
Immanuel Trummer and Christoph Koch. 2014. Multi-objective Parametric Query Optimization. Proc. VLDB Endow. 8, 3 (Nov. 2014), 221--232.
[20]
Tolga Urhan, Michael J. Franklin, and Laurent Amsaleg. 1998. Cost-based Query Scrambling for Initial Delays. In Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data (SIGMOD '98). ACM, New York, NY, USA, 130--141.
[21]
Chuan Xiao, Wei Wang, and Xuemin Lin. 2008. Ed-Join: an efficient algorithm for similarity joins with edit distance constraints. Proceedings of the VLDB Endowment 1, 1 (2008), 933--944.
[22]
Chuan Xiao, Wei Wang, and Xuemin Lin. 2008. Ed-Join: An Efficient Algorithm for Similarity Joins with Edit Distance Constraints. Proc. VLDB Endow. 1, 1 (Aug. 2008), 933--944.
[23]
Chuan Xiao, Wei Wang, Xuemin Lin, and Jeffrey Xu Yu. 2008. Efficient Similarity Joins for Near Duplicate Detection. In Proceedings of the 17th International Conference on World Wide Web (WWW '08). ACM, New York, NY, USA, 131--140.

Cited By

View all
  • (2019)A Probabilistic Algorithm to Predict Missing Facts from Knowledge GraphsDatabase and Expert Systems Applications10.1007/978-3-030-27615-7_11(149-158)Online publication date: 3-Aug-2019
  • (2018)Dynamic Planning for Link DiscoveryThe Semantic Web10.1007/978-3-319-93417-4_16(240-255)Online publication date: 3-Jun-2018

Index Terms

  1. An evaluation of models for runtime approximation in link discovery

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      WI '17: Proceedings of the International Conference on Web Intelligence
      August 2017
      1284 pages
      ISBN:9781450349512
      DOI:10.1145/3106426
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 23 August 2017

      Permissions

      Request permissions for this article.

      Check for updates

      Badges

      • Best Student Paper

      Author Tags

      1. link discovery
      2. link specifications
      3. runtime approximation
      4. taylor series

      Qualifiers

      • Research-article

      Funding Sources

      • HOBBIT, QAMEL, SAKE

      Conference

      WI '17
      Sponsor:

      Acceptance Rates

      WI '17 Paper Acceptance Rate 118 of 178 submissions, 66%;
      Overall Acceptance Rate 118 of 178 submissions, 66%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 21 Sep 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2019)A Probabilistic Algorithm to Predict Missing Facts from Knowledge GraphsDatabase and Expert Systems Applications10.1007/978-3-030-27615-7_11(149-158)Online publication date: 3-Aug-2019
      • (2018)Dynamic Planning for Link DiscoveryThe Semantic Web10.1007/978-3-319-93417-4_16(240-255)Online publication date: 3-Jun-2018

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media