Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Specifying OLAP Cubes on XML Data

Published: 02 December 2001 Publication History

Abstract

On-Line Analytical Processing (OLAP) enables analysts to gain insight about data through fast and interactive access to a variety of possible views on information, organized in a dimensional model. The demand for data integration is rapidly becoming larger as more and more information sources appear in modern enterprises. In the data warehousing approach, selected information is extracted in advance and stored in a repository, yielding good query performance. However, in many situations a logical (rather than physical) integration of data is preferable. Previous web-based data integration efforts have focused almost exclusively on the logical level of data models, creating a need for techniques focused on the conceptual level. Also, previous integration techniques for web-based data have not addressed the special needs of OLAP tools such as handling dimensions with hierarchies. Extensible Markup Language (XML) is fast becoming the new standard for data representation and exchange on the World Wide Web. The rapid emergence of XML data on the web, e.g., business-to-business (B2B) e-commerce, is making it necessary for OLAP and other data analysis tools to handle XML data as well as traditional data formats.
Based on a real-world case study, this paper presents an approach to specification of OLAP DBs based on web data. Unlike previous work, this approach takes special OLAP issues such as dimension hierarchies and correct aggregation of data into account. Also, the approach works on the conceptual level, using Unified Modeling Language (UML) as a basis for so-called UML snowflake diagrams that precisely capture the multidimensional structure of the data. An integration architecture that allows the logical integration of XML and relational data sources for use by OLAP tools is also presented.

References

[1]
Abiteboul, S. (1997). Querying Semistructured Data. In Proceeding of the Sixth International Conference on Database Theory (pp. 1-18).]]
[2]
Abiteboul, S. et al. (1999). Tools for Data Translation and Integration. Data Engineering Bulletin, 22(1), 3-8.]]
[3]
Bonifati, A. et al. (2000). Comparative Analyses of Five XML Query Languages. SIGMOD Record, 29(1), 68-79.]]
[4]
Cattell, R. (2000). The Object Database Standard: ODMG 3.0. San Mateo, CA: Morgan-Kaufmann.]]
[5]
Chamberlin, D. et al. (2000). Quilt: An XML Query Language for Heterogeneous Data Sources. In Proceedings of the Third International Workshop on the Web and Databases (pp. 53-62).]]
[6]
Computer Associates Corporation. (2001). ERwin Product Brochure. www.cai.com/products/alm/erwin/ erwin_pd.pdf]]
[7]
Deutsch, A. et al. (1999). Storing Semistructured Data with STORED. In Proceedings of ACM SIGMOD Conference (pp 431-442).]]
[8]
Fernandez, M. F. et al. (2000). Declarative Specification of Web Sites with Strudel. VLDB Journal, 9(1), 38-55.]]
[9]
Florescu, D. and Kossmann, D. (1999). Storing and Querying XML Data using and RDMBS. Data Engineeing Bulletin, 22(3), 27-34.]]
[10]
Gamma, E. et al. (1995). Design Patterns. Reading, MA: Addison-Wesley.]]
[11]
Garcia-Molina, H. et al. (1997). The TSIMMIS Approach to Mediation: Data Models and Languages. Journal of Intelligent Information Systems, 8(2), 117-132.]]
[12]
Gray, J. et al. (1997). Data Cube: A Relational Aggregation Operator Generalizing Group-by, Cross-Tab, and Sub Totals, Data Mining and Knowledge Discovery, 1(1), 29-53.]]
[13]
Hellerstein, J.M. et al. (1999). Independent, Open Enterprise Data Integration. Data Engineering Bulletin, 22(1), 43-49.]]
[14]
Hyperion Corporation. (2001). Hyperion Essbase OLAP 6. www.hyperion.com/essbaseolap.cfm]]
[15]
Jensen, M.R., Møller, T.H., and Pedersen T.B. (2001a). Converting XML Data to UML Diagrams For Conceptual Data Integration. In Proceedings of the First International Workshop on Data Integration Over The Web (pp. 17-31).]]
[16]
Jensen, M.R., Møller, T.H., and Pedersen, T.B. (2001b). Specifying OLAP Cubes on XML Data. Technical Report R-01-5003, Department of Computer Science, Aalborg University, 22 p.]]
[17]
Kimball, R. et al. (1998). The Data Warehouse Lifecycle Toolkit. New York: Wiley.]]
[18]
Kimball, R. (1996). The Data Warehouse Toolkit. New York: Wiley.]]
[19]
Lahiri, T. et al. (1999). Ozone: Integrating Structured and Semistructured Data. In Proceedings of the Seventh International Conference on Database Programming Languages (pp. 297-323).]]
[20]
Lenz, H. and Shoshani, A. (1997). Summarizability in OLAP and Statistical Databases. In Proceedings of the Ninth International Conference on Statistical and Scientific Database Management (pp. 39-48).]]
[21]
Melton, J. et al. (1995). Understanding the New SQL: A Complete Guide. San Mateo, CA: Morgan-Kaufmann.]]
[22]
Microsoft Corporation. (2001). SQL Server 2000 Analysis Services White Paper. www.microsoft.com/sql/ evaluation/compare/analysisservicesWP.asp]]
[23]
Object Management Group. (2001). OMG Unified Modeling Language Specification 1.3. www.rational.com/uml/ resources/documentation/index.jsp]]
[24]
Oracle Corporation. (2001). Oracle Express OLAP. www.oracle.com/ip/analyze/warehouse/bus_intell/index.html]]
[25]
Pedersen, T.B. et al. (1999). Extending Practical Pre-Aggregation in On-Line Analytical Processing. In Proceedings of the Twenty-Fifth International Conference on Very Large Databases (pp. 663-674).]]
[26]
Pedersen, T.B. et al. (2000). Extending OLAP Querying to External Object Databases. In Proceedings of the Ninth International Conference on Information and Knowledge Management (pp. 405-413).]]
[27]
Pinnock, J. et al. (2000). Professional XML. Chicago, IL: Wrox Press.]]
[28]
Rafanelli, M. et al. (1990). STORM: A Statistical Object Representation Model. In Proceedings of the Fifth Conference on Statistical and Scientific Database Management (pp. 14-29). Heidelberg, Germany: Springer Verlag, 1990.]]
[29]
Roth, M.T. et al. (1996). The Garlie Project. In Proceedings of ACM SIGMOD Conference (p. 557). New York, NY.]]
[30]
Shanmugasundaram, J. et al. (1999). Relational Databases for Querying XML Documents: Limitations and Opportunities. In Proceedings of the Twenty-Fifth International Conference on Very Large Databases (pp. 302-314).]]
[31]
Silicon Integration Initiative (SII). (2001). The Electronic Component Information Exchange QuickData Architecture . www.-si2.org/ecix/]]
[32]
Thomsen, E. et al. (1999). Microsoft OLAP Solutions. New York, NY: Wiley.]]
[33]
Thomsen, E. (1997). OLAP Solutions: Building Multidimensional Information System. New York, NY: Wiley.]]
[34]
World Wide Web Consortium (W3C) (2001a). Extensible Markup Language (XML) 1.0 (Second Edition), W3C Recommendation. www.w3.org/TR/2000/REC-xml-20001006.]]
[35]
World Wide Web Consortium (W3C) (2001b). XML Schema, W3C Candidate Recommendation. www.w3.org/ XML/Schema.html]]

Cited By

View all
  • (2017)XML OLAP Cube in the Cloud Towards the DWaaSInternational Journal of Cloud Applications and Computing10.4018/IJCAC.20170101037:1(47-56)Online publication date: 1-Jan-2017
  • (2016)Visual Spatial-OLAP for Vehicle Recorder Data on Micro-sized Electric VehiclesProceedings of the 20th International Database Engineering & Applications Symposium10.1145/2938503.2938532(358-363)Online publication date: 11-Jul-2016
  • (2013)Big dataProceedings of the 17th International Database Engineering & Applications Symposium10.1145/2513591.2527071(198-203)Online publication date: 9-Oct-2013
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Journal of Intelligent Information Systems
Journal of Intelligent Information Systems  Volume 17, Issue 2-3
December 2001
217 pages

Publisher

Kluwer Academic Publishers

United States

Publication History

Published: 02 December 2001

Author Tags

  1. OLAP
  2. XML
  3. data integration
  4. data warehousing
  5. multidimensional databases

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 17 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2017)XML OLAP Cube in the Cloud Towards the DWaaSInternational Journal of Cloud Applications and Computing10.4018/IJCAC.20170101037:1(47-56)Online publication date: 1-Jan-2017
  • (2016)Visual Spatial-OLAP for Vehicle Recorder Data on Micro-sized Electric VehiclesProceedings of the 20th International Database Engineering & Applications Symposium10.1145/2938503.2938532(358-363)Online publication date: 11-Jul-2016
  • (2013)Big dataProceedings of the 17th International Database Engineering & Applications Symposium10.1145/2513591.2527071(198-203)Online publication date: 9-Oct-2013
  • (2010)OLAP operators for complex object data cubesProceedings of the 14th east European conference on Advances in databases and information systems10.5555/1885872.1885884(103-116)Online publication date: 20-Sep-2010
  • (2010)A secure multiparty computation privacy preserving OLAP framework over distributed XML dataProceedings of the 2010 ACM Symposium on Applied Computing10.1145/1774088.1774447(1666-1673)Online publication date: 22-Mar-2010
  • (2009)An ETL process for OLAP using RDF/OWL ontologiesJournal on Data Semantics XIII10.5555/2172259.2172263(97-119)Online publication date: 1-Jan-2009
  • (2009)A relational data harmonization approach to XMLJournal of Information Science10.1177/016555150910423135:5(571-601)Online publication date: 1-Oct-2009
  • (2008)Expressing OLAP operators with the TAX XML algebraProceedings of the 2008 EDBT workshop on Database technologies for handling XML information on the web10.1145/1416691.1416703(61-66)Online publication date: 25-Mar-2008
  • (2008)Two approaches to the integration of heterogeneous data warehousesDistributed and Parallel Databases10.1007/s10619-007-7022-z23:1(69-97)Online publication date: 1-Feb-2008
  • (2007)Ix-cubesProceedings of the sixteenth ACM conference on Conference on information and knowledge management10.1145/1321440.1321572(905-908)Online publication date: 6-Nov-2007
  • Show More Cited By

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media