Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1031763.1031777acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

A new OLAP aggregation based on the AHC technique

Published: 12 November 2004 Publication History

Abstract

Nowadays, decision support systems are evolving in order to handle complex data. Some recent works have shown the interest of combining on-line analysis processing (OLAP) and data mining. We think that coupling OLAP and data mining would provide excellent solutions to treat complex data. To do that, we propose an enhanced OLAP operator based on the agglomerative hierarchical clustering (AHC). The here proposed operator, called <i>OpAC</i> (Operator for Aggregation by Clustering) is able to provide significant aggregates of facts refereed to complex objects. We complete this operator with a tool allowing the user to evaluate the best partition from the AHC results corresponding to the most interesting aggregates of facts.

References

[1]
S. Chaudhuri. Data Mining and Database Systems: Where is the Intersection? Data Engineering Bulletin, 21(1):4--8, 1998.]]
[2]
S. Chaudhuri and U. Dayal. An Overview of Data Warehousing and OLAP Technology. SIGMOD Record, 26(1):65--74, 1997.]]
[3]
S. Chaudhuri, U. Fayyad and J. Bernhardt. Scalable Classification over SQL Databases. In IEEE Computer Society, editor, ICDE-99, pages 470--479, Sydney, Australia, 1999.]]
[4]
M. Chen, Q. Zhu and Z. Chen. An integrated interactive environment for knowledge discovery from heterogeneous data resources. Information and Software Technology, 43:487--496, 2001.]]
[5]
Q. Chen, U. Dayal and M. Hsu. An OLAP-based Scalable Web Access Analysis Engine. In 2nd International Conference on Data Warehousing and Knowledge Discovery, London, UK, September 2000.]]
[6]
G. Dong, J. Han, J. M. W. Lam et al. Mining Multi-Dimensional Constrained Gradients in Data Cubes. In The VLDB Conference, pages 321--330, 2001.]]
[7]
E. Favero and J. Robin. Using OLAP and Data Mining for Content Planning in Natural Language Generation. Lecture Notes in Computer Science, 1959:164--175, 2001.]]
[8]
U. M. Fayyad, G. P. Shapiro and P. Smyth and R. Uthurusamy. Advances in Knowledge Discovery and Data Mining. AAAI/MIT Press, 1996.]]
[9]
S. Goil and A. Choudhary. High Performance Multidimensional Analysis and Data Mining. In High Performance Networking and Computing Conference, Orlando, USA, November 1998.]]
[10]
S. Goil and A. Choudhary. PARSIMONY: An Infrastructure for parallel Multidimensional Analysis and Data Mining. Journal of parallel and distributed computing, 61:285--321, 2001.]]
[11]
J. Han. Toward On-line Analytical Mining in Large Databases. SIGMOD Record, 27:97--107, 1998.]]
[12]
R.M. Haralick, K.Shanmugan and I. Dinstein. Texture features for image classification. Man and Cybernetics, 3:610--622, 1973.]]
[13]
I. Bartolini, E. Bertino, B. Catania et al. PAtterns for Next-generation DAtabase systems: preliminary results of the PANDA project. In Undicesimo Convegno Nazionale su Sistemi Evoluti Per Basi Di Dati, pages 293--300, Cetraro, Italy, 2003.]]
[14]
T. Imielinski, L. Khachiyan and A. Abdulghani. Cubegrades: Generalizing association rules. Data Mining and Knowledge Descovery, 6:219--258, 2000.]]
[15]
T. Imielinski and H. Mannila. A database perspective on knowledge discovery. Communication of the ACM, 39(11):58--64, 1996.]]
[16]
W. H. Inmon. Building the Data Warehouse. John Wiley & Sons, 1996.]]
[17]
R. Kimball. The Data Warehouse toolkit. John Wiley & Sons, 1996.]]
[18]
A. Laurent, B. Bouchon-Meunier, A. Doucet et al. Fuzzy Data Mining from Multidimensional Databases. In Springer-Verlag, editor, International Symposium on Computational Intelligence (ISCI'2000), pages 278--283, Kosice, Slovakia, 2000.]]
[19]
A. Maedche, A. Hotho and M. Wiese. Enhancing Preprocessing in Data-Intensive Domains using Online-Analytical Processing. In Springer-Verlag, editor, 2nd International Conference on Data Warehousing and Knowledge Discovery, pages 258--264, London, UK, 2000.]]
[20]
R. Meo, G. Psaila and S. Ceri. A New SQL-like Operator for Mining Association Rules. In 22nd VLDB conf., pages 122--133, Bombay, India, 1996.]]
[21]
R. B. Messaoud, S. Rabaséda, O. Boussaid et al. OpAC: Opérateur d'analyse en ligne basé sur une technique de fouille de données. In Revue des Nouvelles Technologies de l'Information (RNTI) (EGC 04), volume 2, pages 35--46, Clermont-Ferrand, France, January 2004.]]
[22]
G.W. Milligan and M.C. Cooper. An examination of procedures for detecting the number of clusters in a data set. Psychometrika, 50:159--179, 1985.]]
[23]
T. Palpanas. Knowledge Discovery in Data Warehouses. SIGMOD Record, 29:88--100, 2000.]]
[24]
K. Parsaye. OLAP and Data Mining: Bridging the Gap. Database Programming and Design, 10:30--37, 1997.]]
[25]
H. Pinto, J. Han, J. Pei et al. Multi-dimensional Sequential Pattern Mining. In Information and Knowledge Management (CIKM'01), Atlanta, USA, November 2001.]]
[26]
S. Sarawgi. iDiff: Informative summarization of differences in multidimensional aggregates. Data Mining And Knowledge Discovery, 5:213--246, 2001.]]
[27]
S. Sarawgi, R. Agrawal and N. Megiddo. Discovery-driven Exploration of OLAP Data Cubes. In The 6th Int. Conference on Extending Database Technology (EDBT), Valencia, Spain, March 2001.]]
[28]
D.A. Zighed, S. Lallich and S.Muhlenbach. A statistical approach for separability of classes. In Statistical Learning, Theory and Applications, Paris, France, 2002.]]

Cited By

View all
  • (2020)A Semi-Automatic Design Methodology for (Big) Data Warehouse Transforming Facts into DimensionsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2019.292562133:1(28-42)Online publication date: 7-Dec-2020
  • (2018)Integration of Data Mining and Statistical Methods for Constructing and Exploring Data CubesInformation Retrieval and Management10.4018/978-1-5225-5191-1.ch037(871-882)Online publication date: 2018
  • (2017)Multidimensional Model Design using Data MiningInternational Journal of Data Warehousing and Mining10.5555/3077757.307775813:1(1-35)Online publication date: 1-Jan-2017
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
DOLAP '04: Proceedings of the 7th ACM international workshop on Data warehousing and OLAP
November 2004
130 pages
ISBN:1581139772
DOI:10.1145/1031763
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 November 2004

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. AHC
  2. clustering
  3. complex data
  4. complex objects
  5. data mining
  6. on-line analysis processing
  7. semantic aggregation

Qualifiers

  • Article

Conference

CIKM04
Sponsor:
CIKM04: Conference on Information and Knowledge Management
November 12 - 13, 2004
DC, Washington, USA

Acceptance Rates

Overall Acceptance Rate 29 of 79 submissions, 37%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)1
Reflects downloads up to 29 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2020)A Semi-Automatic Design Methodology for (Big) Data Warehouse Transforming Facts into DimensionsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2019.292562133:1(28-42)Online publication date: 7-Dec-2020
  • (2018)Integration of Data Mining and Statistical Methods for Constructing and Exploring Data CubesInformation Retrieval and Management10.4018/978-1-5225-5191-1.ch037(871-882)Online publication date: 2018
  • (2017)Multidimensional Model Design using Data MiningInternational Journal of Data Warehousing and Mining10.5555/3077757.307775813:1(1-35)Online publication date: 1-Jan-2017
  • (2017)Multidimensional Model Design using Data MiningInternational Journal of Data Warehousing and Mining10.4018/IJDWM.201701010113:1(1-35)Online publication date: Jan-2017
  • (2015)Integration of Data Mining and Statistical Methods for Constructing and Exploring Data CubesImproving Knowledge Discovery through the Integration of Data Mining Techniques10.4018/978-1-4666-8513-0.ch001(1-12)Online publication date: 2015
  • (2010)Augmenting OLAP exploration with dynamic advanced analyticsProceedings of the 13th International Conference on Extending Database Technology10.1145/1739041.1739127(687-692)Online publication date: 22-Mar-2010
  • (2010)Data mining and automatic OLAP schema generation2010 Fifth International Conference on Digital Information Management (ICDIM)10.1109/ICDIM.2010.5664622(35-43)Online publication date: Jul-2010
  • (2010)Finding an application-appropriate model for XML data warehousesInformation Systems10.1016/j.is.2009.12.00235:6(662-687)Online publication date: 1-Sep-2010
  • (2009)Analyses and algorithms for exploring relational databasesProceedings of the WSEAES 13th international conference on Computers10.5555/1627695.1627768(386-393)Online publication date: 23-Jul-2009
  • (2009)A Conceptual Model for Combining Enhanced OLAP and Data Mining SystemsProceedings of the 2009 Fifth International Joint Conference on INC, IMS and IDC10.1109/NCM.2009.354(1958-1963)Online publication date: 25-Aug-2009
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media