Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Online analytical processsing on graph data

Published: 01 January 2020 Publication History

Abstract

Online Analytical Processing (OLAP) comprises tools and algorithms that allow querying multidimensional databases. It is based on the multidimensional model, where data can be seen as a cube such that each cell contains one or more measures that can be aggregated along dimensions. In a “Big Data” scenario, traditional data warehousing and OLAP operations are clearly not sufficient to address current data analysis requirements, for example, social network analysis. Furthermore, OLAP operations and models can expand the possibilities of graph analysis beyond the traditional graph-based computation. Nevertheless, there is not much work on the problem of taking OLAP analysis to the graph data model.
This paper proposes a formal multidimensional model for graph analysis, that considers the basic graph data, and also background information in the form of dimension hierarchies. The graphs in this model are node- and edge-labelled directed multi-hypergraphs, called graphoids, which can be defined at several different levels of granularity using the dimensions associated with them. Operations analogous to the ones used in typical OLAP over cubes are defined over graphoids. The paper presents a formal definition of the graphoid model for OLAP, proves that the typical OLAP operations on cubes can be expressed over the graphoid model, and shows that the classic data cube model is a particular case of the graphoid data model. Finally, a case study supports the claim that, for many kinds of OLAP-like analysis on graphs, the graphoid model works better than the typical relational OLAP alternative, and for the classic OLAP queries, it remains competitive.

References

[1]
R. Angles, A Comparison of Current Graph Database Models, in: Proceedings of ICDE Workshops, Arlington, VA, USA, 2012, pp. 171–177.
[2]
R. Angles, M. Arenas, P. Barceló, A. Hogan, J.L. Reutter and D. Vrgoc, Foundations of modern query languages for graph databases, ACM Comput. Surv. 50(5) (2017), 68:1–68:40.
[3]
R. Angles and C. Gutierrez, Survey of graph database models, ACM Comput. Surv. 40(1) (2008), 1:1–1:39.
[4]
S.-M.-R. Beheshti, B. Benatallah, H.R.M. Nezhad and M. Allahbakhsh, A framework and a language for on-line analytical processing on graphs, in: WISE 2012, Springer, 2012, pp. 213–227.
[5]
C. Chen, X. Yan, F. Zhu, J. Han and P. Yu, Graph OLAP: A multi-dimensional framework for graph data analysis, Knowl. Inf. Syst. 21(1) (2009), 41–63.
[6]
J. Cohen, B. Dolan, M. Dunlap, J. Hellerstein and C. Welton, MAD Skills: New analysis practices for big data, Proceedings of the VLDB Endowment 2(2) (2009), 1481–1492.
[7]
A. Cuzzocrea, L. Bellatreche and I.-Y. Song, Data Warehousing and OLAP over Big Data: Current Challenges and Future Research Directions, in: Proceedings of DOLAP, New York, NY, USA, ACM, 2013, pp. 67–70.
[8]
B. Denis, A. Ghrab and S. Skhiri, A distributed approach for graph-oriented multidimensional analysis, in: IEEE International Conference on Big Data, 2013, pp. 9–16.
[9]
L. Etcheverry and A. Vaisman, QB4OLAP: A vocabulary for OLAP cubes on the semantic web, in: Proceedings of COLD, Boston, USA, 2012. CEUR-WS.org.
[10]
A. Ghrab, O. Romero, S. Skhiri, A.A. Vaisman and E. Zimányi, GRAD: Modeling and Querying Data Warehouses, in: Proceedings of ADBIS, Poitiers, France, 2015, pp. 92–105.
[11]
L.I. Gómez, B. Kuijpers and A.A. Vaisman, Performing OLAP over graph data: Query language, implementation, and a case study, in: Proceedings of BIRTE, Munich, Germany, August 28, 2017, pp. 6:1–6:8.
[12]
L.I. Gómez, B. Kuijpers and A.A. Vaisman, Analytical queries on semantic trajectories using graph databases, TGIS Trans. Geog. Inf. Syst. 23(5) (2019).
[13]
O. Hartig, Reconciliation of RDF* and property graphs, CoRR, abs/1409.3288, 2014.
[14]
H. He and A. Singh, Query language and access methods for graph databases, in: Managing and Mining Graph Data, Vol. 40 of Advances in Database Systems, Springer US, 2010, pp. 125–160.
[15]
R. Kimball, The Data Warehouse Toolkit, J. Wiley and Sons, 1996.
[16]
M.B. Kraiem, J. Feki, K. Khrouf, F. Ravat and O. Teste, Modeling and OLAPing social media: the case of twitter, Social Netw. Analys. Mining 5(1) (2015), 47:1–47:15.
[17]
B. Kuijpers and A.A. Vaisman, An algebra for OLAP, Intelligent Data Analysis 21(5) (2017).
[18]
Q. Qu, F. Zhu, X. Yan, J. Han, P.S. Yu and H. Li, Efficient topological OLAP on information networks, in: Proceedings of DASFAA – Vol. Part I, Springer, 2011, pp. 389–403.
[19]
N.U. Rehman, A. Weiler and M.H. Scholl, OLAPing social media: the case of twitter, in: Advances in Social Networks Analysis and Mining 2013, ASONAM ’13, Niagara, ON, Canada, 2013, pp. 1139–1146.
[20]
I. Robinson, J. Webber and E. Eifrém, Graph Databases, O’Reilly Media, 2013.
[21]
M. Rudolf, H. Voigt, C. Bornhövd and W. Lehner, Synopsys: Foundations for multidimensional graph analytics, in: Proceedings of BIRTE, Hangzhou, China, Revised Selected Papers, September 1, 2014, pp. 159–166.
[22]
B. Tang, S. Han, M.L. Yiu, R. Ding and D. Zhang, Extracting top-k insights from multi-dimensional data, in: Proceedings of ACM SIGMOD, Chicago, IL, USA, May 14–19, 2017, pp. 1509–1524.
[23]
Y. Tian, R.A. Hankins and J.M. Patel, Efficient aggregation for graph summarization, in: Proceedings ACM SIGMOD, ACM, 2008, pp. 567–580.
[24]
A. Vaisman, F. Besteiro and M. Valverde, Modelling and querying star and snowflake warehouses using graph databases, in: Proceedings of ADBIS Conference 2019, Bled, Slovenia, Sept. 8–11, 2019.
[25]
A.A. Vaisman, Publishing OLAP cubes on the semantic web, in: EBIS’15, Lecture Notes in Business Information Processing, Springer, 2015, pp. 32–61.
[26]
A.A. Vaisman and E. Zimányi, Data Warehouse Systems: Design and Implementation, Springer, 2014.
[27]
P. Wang, B. Wu and B. Wang, TSMH graph cube: A novel framework for large scale multi-dimensional network analysis, In: IEEE DSAA, Paris, France, October 19–21, 2015, pp. 1–10.
[28]
Z. Wang, Q. Fan, H. Wang, K.-L. Tan, D. Agrawal and A.E. Abbadi, Pagrol: Parallel graph OLAP over large-scale attributed graphs, in: Proceeding of IEEE ICDE, 2014, pp. 496–507.
[29]
X. Wu, B. Wu and B. Wang, P&D graph cube: Model and parallel materialization for multidimensional heterogeneous network, in: Proceeding of CyberC 2017, Nanjing, China, October 12–14, 2017, pp. 95–104.
[30]
M. Yin, B. Wu and Z. Zeng, HMGraph OLAP: a novel framework for multi-dimensional heterogeneous network analysis, in: Proceedings of DOLAP, ACM, 2012, pp. 137–144.
[31]
P. Zhao, X. Li, D. Xin and J. Han, Graph Cube: on warehousing and OLAP multidimensional networks, in: Proceedings of ACM SIGMOD, ACM, 2011, pp. 853–864.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Intelligent Data Analysis
Intelligent Data Analysis  Volume 24, Issue 3
2020
240 pages

Publisher

IOS Press

Netherlands

Publication History

Published: 01 January 2020

Author Tags

  1. OLAP
  2. data warehousing
  3. graph database
  4. big data
  5. graph aggregation

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 15 Oct 2024

Other Metrics

Citations

Cited By

View all

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media