Abstract
The amount of data that is being made available on the Web is increasing. This provides business organisations with the opportunity to acquire large datasets in order to offer novel information services or to better market existing products and services. Much of this data is now publicly available (e.g., thanks to initiatives such as Open Government Data). The challenge from a corporate perspective is to make sense of the third party data and transform it so that it can more easily integrate with their existing corporate data or with datasets with a different provenance. This paper presents research-in-progress aimed at semantically transforming raw data on U.K. registered companies. The approach adopted is based on BORO (a 4D foundational ontology and re-engineering method) and the target technological platform is Neo4J (a graph database). The primary challenges encountered are (1) re-engineering the raw data into a 4D ontology and (2) representing the 4D ontology into a graph database. The paper will discuss such challenges and explain the transformation process that is currently being adopted.
Chapter PDF
Similar content being viewed by others
Keywords
References
Jain, P., Hitzler, P., Sheth, A.P., Verma, K., Yeh, P.Z.: Ontology Alignment for Linked Open Data. In: Patel-Schneider, P.F., Pan, Y., Hitzler, P., Mika, P., Zhang, L., Pan, J.Z., Horrocks, I., Glimm, B. (eds.) ISWC 2010, Part I. LNCS, vol. 6496, pp. 402–417. Springer, Heidelberg (2010)
Partridge, C.: Business Objects: Re-Engineering for Re-Use. Butterworth-Heinemann (1996)
Partridge, C., Mitchell, A., de Cesare, S.: Guidelines for developing ontological architectures in modelling and simulation. In: Tolk, A. (ed.) Ontology, Epistemology, & Teleology for Model. & Simulation. ISRL, vol. 44, pp. 27–57. Springer, Heidelberg (2013)
Neo4J, http://www.neo4j.org
McAfee, A., Brynjolfsson, E.: Big Data: The Management Revolution. Harvard Business Review (October 2012)
Minister of State for the Cabinet Office and Paymaster General, Open Data White Paper: Unleashing the Potential, http://data.gov.uk/sites/default/files/Open_data_White_Paper.pdf (last accessed on April 03, 2013)
Pereira, A.L., Appel, A.P.: Modeling and Storing Complex Network with Graph-Tree. New Trends in Databases and Information Systems. Advances in Intelligent Systems and Computing 185, 305–315 (2013)
Sider, T.: Four-Dimensionalism: An Ontology of Persistence and Time. Oxford University Press, USA (2002)
Angles, R., Gutierrez, C.: Survey of Graph Database Models. ACM Computing Surveys 40(1) (2008)
IDEAS Group: The IDEAS Model, http://www.ideasgroup.org/foundation/
Partridge, C., Mitchell, A., de Cesare, S.: Guidelines for Developing Ontological Architectures in Modelling and Simulation. In: Tolk, A. (ed.) Ontology, Epistemology, & Teleology for Model. & Simulation. ISRL, vol. 44, pp. 27–57. Springer, Heidelberg (2013)
Vassiliadis, P.: A Survey of Extract-Transform-Load Technology. International Journal of Data Warehousing and Mining 5(3), 1–27 (2009)
Skoutas, D., Simitsis, A.: Ontology-based Conceptual Design of ETL Processes for both Structured and Semi-structured Data. International Journal on Semantic Web and Information Systems 3(4) (2007)
Shadbolt, N., O’Hara, K., Berners-Lee, T., Gibbins, N., Glaser, H., Hall, W., Schraefel, M.C.: Linked Open Government Data: Lessons from Data.gov.uk. IEEE Intelligent Systems (May/June 2012)
Alani, H., Dupplaw, D., Sheridan, J., O’Hara, K., Darlington, J., Shadbolt, N.R., Tullo, C.: Unlocking the Potential of Public Sector Information with Semantic Web Technology. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC/ISWC 2007. LNCS, vol. 4825, pp. 708–721. Springer, Heidelberg (2007)
Oberle, D., Ankolekar, A., Hitzler, P., Cimiano, P., Sintek, M., Kiesel, M., Mougouie, B., Baumann, S., Vembu, S., Romanelli, M., Buitelaar, P., Engel, R., Sonntag, D., Reithinger, N., Loos, B., Zorn, H.P., Micelli, V., Porzel, R., Schmidt, C., Weiten, M., Burkhardt, F., Zhou, J.: DOLCE ergo SUMO: On foundational and domain models in the SmartWeb Integrated Ontology (SWIntO). Web Semantics: Science, Services and Agents on the World Wide Web 5, 156–174 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
de Cesare, S., Foy, G., Partridge, C. (2013). Re-engineering Data with 4D Ontologies and Graph Databases. In: Franch, X., Soffer, P. (eds) Advanced Information Systems Engineering Workshops. CAiSE 2013. Lecture Notes in Business Information Processing, vol 148. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38490-5_29
Download citation
DOI: https://doi.org/10.1007/978-3-642-38490-5_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38489-9
Online ISBN: 978-3-642-38490-5
eBook Packages: Computer ScienceComputer Science (R0)