Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1754239.1754250acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article

Building data warehouses with semantic data

Published: 22 March 2010 Publication History

Abstract

The Semantic Web has become a new environment that enables organizations to attach semantic annotations taken from ontologies to the information they generate. As a result, large amounts of complex, semi-structured and heterogeneous semantic data repositories are being made available, making necessary new data warehouse tools for analyzing the Semantic Web. In this paper, we present a semi-automatic method for the identification and extraction of valid facts aimed at analyzing semantic data expressed as instance stores in RDF/OWL. The starting point of the method is a multidimensional (MD) star schema (i.e., subject of analysis, dimensions and measures) designed by the analyst by picking up concepts and properties from the ontology. The method exploits the semantics and theoretical foundations of Description Logics to derive valid combinations of instances into fact tuples. Moreover, some specific index structures are applied to the ontology in order to reach scalability and effectiveness.

References

[1]
F. Baader, D. Calvanese, D. L. McGuinness, D. Nardi, and P. F. Patel-Schneider, editors. The Description Logic Handbook: Theory, Implementation, and Applications. Cambridge University Press, 2003.
[2]
C. Chen, X. Yan, F. Zhu, J. Han, and P. S. Yu. Graph OLAP: Towards Online Analytical Processing on Graphs. In ICDM, pages 103--112. IEEE Computer Society, 2008.
[3]
E. F. Codd, S. B. Codd, and C. T. Salley. Providing OLAP (On-Line Analytical Processing) to User Analysts: An IT Mandate. E. F. Codd and Ass., 1993.
[4]
R. Kimball and M. Ross. The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling. Wiley, 2nd edition, April 2002.
[5]
J.-N. Mazón and J. Trujillo. A model driven modernization approach for automatically deriving multidimensional models in data warehouses. In ER, volume 4801 of LNCS, pages 56--71. Springer, 2007.
[6]
V. Nebot and R. Berlanga. Efficient retrieval of ontology fragments using an interval labeling scheme. Inf. Sci., 179(24):4151--4173, 2009.
[7]
V. Nebot, R. Berlanga, J. M. Pérez, M. J. Aramburu, and T. B. Pedersen. Multidimensional Integrated Ontologies: A Framework for Designing Semantic Data Warehouses. JoDS XIII, 5530:1--35, 2009.
[8]
M. Niinimaki and T. Niemi. An ETL Process for OLAP Using RDF/OWL Ontologies. JoDS XIII, 2009.
[9]
J. M. Pérez, R. Berlanga, M. J. Aramburu, and T. B. Pedersen. Integrating Data Warehouses with Web Data: A Survey. IEEE Trans. Knowl. Data Eng., 20(7):940--955, 2008.
[10]
O. Romero and A. Abelló. Automating multidimensional design from ontologies. In DOLAP '07, pages 1--8, New York, NY, USA, 2007. ACM.
[11]
D. Skoutas and A. Simitsis. Ontology-based conceptual design of ETL processes for both structured and semi-structured data. Int. J. Semantic Web Inf. Syst., 3(4): 1--24, 2007.

Cited By

View all
  • (2019)Data Cube Is Dead, Long Life to Data Cube in the Age of Web DataBig Data Analytics10.1007/978-3-030-37188-3_4(44-64)Online publication date: 12-Dec-2019
  • (2015)baconProceedings of the 5th International Conference on Web Intelligence, Mining and Semantics10.1145/2797115.2797126(1-6)Online publication date: 13-Jul-2015
  • (2015)An integrated personalization framework for SaaS-based cloud servicesFuture Generation Computer Systems10.1016/j.future.2015.05.01153:C(157-173)Online publication date: 1-Dec-2015
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
EDBT '10: Proceedings of the 2010 EDBT/ICDT Workshops
March 2010
290 pages
ISBN:9781605589909
DOI:10.1145/1754239
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 March 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. ETL processes
  2. data warehouses
  3. semantic web

Qualifiers

  • Research-article

Conference

EDBT/ICDT '10
EDBT/ICDT '10: EDBT/ICDT '10 joint conference
March 22 - 26, 2010
Lausanne, Switzerland

Acceptance Rates

Overall Acceptance Rate 7 of 10 submissions, 70%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Data Cube Is Dead, Long Life to Data Cube in the Age of Web DataBig Data Analytics10.1007/978-3-030-37188-3_4(44-64)Online publication date: 12-Dec-2019
  • (2015)baconProceedings of the 5th International Conference on Web Intelligence, Mining and Semantics10.1145/2797115.2797126(1-6)Online publication date: 13-Jul-2015
  • (2015)An integrated personalization framework for SaaS-based cloud servicesFuture Generation Computer Systems10.1016/j.future.2015.05.01153:C(157-173)Online publication date: 1-Dec-2015
  • (2014)Model-Driven Data Warehouse AutomationAdvances and Applications in Model-Driven Engineering10.4018/978-1-4666-4494-6.ch011(240-267)Online publication date: 2014
  • (2014)Towards a Configurable Database Design: A Case of Semantic Data WarehousesOn the Move to Meaningful Internet Systems: OTM 2014 Conferences10.1007/978-3-662-45563-0_47(760-767)Online publication date: 2014
  • (2014)Requirements Driven Data Warehouse Design: We Can Go FurtherLeveraging Applications of Formal Methods, Verification and Validation. Specialized Techniques and Applications10.1007/978-3-662-45231-8_49(588-603)Online publication date: 2014
  • (2013)XML Mining for Semantic WebData Mining10.4018/978-1-4666-2455-9.ch031(625-649)Online publication date: 2013
  • (2013)Enriching hierarchies in multidimensional model of data warehouse using WORDNET2013 International Conference on Research and Innovation in Information Systems (ICRIIS)10.1109/ICRIIS.2013.6716725(296-301)Online publication date: Nov-2013
  • (2012)XML Mining for Semantic WebXML Data Mining10.4018/978-1-61350-356-0.ch014(317-342)Online publication date: 2012
  • (2012)Semantic Web Technologies for Business IntelligenceBusiness Intelligence Applications and the Web10.4018/978-1-61350-038-5.ch014(310-339)Online publication date: 2012
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media