Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/1927300.1927314guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Bridging conjunctive and disjunctive search spaces for mining a new concise and exact representation of correlated patterns

Published: 06 October 2010 Publication History

Abstract

In the literature, many works were interested in mining frequent patterns. Unfortunately, these patterns do not offer the whole information about the correlation rate amongst the items that constitute a given pattern since they are mainly interested in appearance frequency. In this situation, many correlation measures have been proposed in order to convey information on the dependencies within sets of items. In this work, we adopt the correlation measure bond, which provides several interesting properties. Motivated by the fact that the number of correlated patterns is often huge while many of them are redundant, we propose a new exact concise representation of frequent correlated patterns associated to this measure, through the definition of a new closure operator. The proposed representation allows not only to efficiently derive the correlation rate of a given pattern but also to exactly offer its conjunctive, disjunctive and negative supports. To prove the utility of our approach, we undertake an empirical study on several benchmark data sets that are commonly used within the data mining community.

References

[1]
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of the 20th International Conference on VLDB 1994, Santiago, Chile, pp. 487-499 (1994).
[2]
Ben Yahia, S., Hamrouni, T., Mephu Nguifo, E.: Frequent closed itemset based algorithms: A thorough structural and analytical survey. ACM-SIGKDD Explorations 8(1), 93-104 (2006).
[3]
Brin, S., Motwani, R., Silverstein, C.: Beyond market baskets: generalizing association rules to correlations. In: Proceedings of the ACM SIGMOD International Conference on SIGMOD 1997, Tucson, Arizona, USA, pp. 265-276 (1997).
[4]
Casali, A., Cicchetti, R., Lakhal, L.: Essential patterns: A perfect cover of frequent patterns. In: Proceedings of the 7th International Conference on DaWaK, Copenhagen, Denmark, pp. 428-437 (2005).
[5]
Galambos, J., Simonelli, I.: Bonferroni-type inequalities with applications. Springer, Heidelberg (2000).
[6]
Ganter, B., Wille, R.: Formal Concept Analysis. Springer, Heidelberg (1999).
[7]
Hamrouni, T.: Mining concise representations of frequent patterns through conjunctive and disjunctive search spaces. Ph.D. Thesis, University of Tunis El Manar (Tunisia) and University of Artois (France) (2009), http://tel.archives-ouvertes.fr/ tel-00465733.
[8]
Hamrouni, T., Ben Yahia, S., Mephu Nguifo, E.: Sweeping the disjunctive search space towards mining new exact concise representations of frequent itemsets. Data & Knowledge Engineering 68(10), 1091-1111 (2009).
[9]
Hamrouni, T., Ben Yahia, S., Mephu Nguifo, E.: Optimized mining of a concise representation for frequent patterns based on disjunctions rather than conjunctions. In: Proceedings of the 23rd International Florida Artificial Intelligence Research Society Conference (FLAIRS 2010), pp. 422-427. AAAI Press, Daytona Beach, Florida, USA (2010).
[10]
Jaccard, P.: Nouvelles recherches sur la distribution florale. Bulletin de la Société Vaudoise des Sciences Naturelles 44, 223-270 (1908).
[11]
Ke, Y., Cheng, J., Yu, J.X.: Efficient discovery of frequent correlated subgraph pairs. In: Proceedings of the 9th IEEE International Conference on Data Mining, Miami, Florida, USA, pp. 239-248 (2009).
[12]
Kim, W.Y., Lee, Y.K., Han, J.: CCMINE: Efficient mining of confidence-closed correlated patterns. In: Proceedings of the 8th International Pacific-Asia Conference on KDD, Sydney, Australia, pp. 569-579 (2004).
[13]
Kryszkiewicz, M.: Compressed disjunction-free pattern representation versus essential pattern representation. In: Corchado, E., Yin, H. (eds.) IDEAL 2009. LNCS, vol. 5788, pp. 350-358. Springer, Heidelberg (2009).
[14]
Le Bras, Y., Lenca, P., Lallich, S.: Mining interesting rules without support requirement: a general universal existential upward closure property. Annals of Information Systems 8, 75-98 (2010).
[15]
Lee, Y.K., Kim, W.Y., Cai, Y.D., Han, J.: CoMine: Efficient mining of correlated patterns. In: Proceedings of the 3rd IEEE International Conference on Data Mining, Melbourne, Florida, USA, pp. 581-584 (2003).
[16]
Lenca, P., Vaillant, B., Meyer, P., Lallich, S.: Association rule interestingness measures: Experimental and theoretical studies. In: Quality Measures in Data Mining, Studies in Computational Intelligence, vol. 43, pp. 51-76. Springer, Heidelberg (2007).
[17]
Mannila, H., Toivonen, H.: Levelwise search and borders of theories in knowledge discovery. Data Mining and Knowledge Discovery 1(3), 241-258 (1997).
[18]
Omiecinski, E.R.: Alternative interest measures for mining associations in databases. IEEE Transactions on Knowledge and Data Engineering 15(1), 57-69 (2003).
[19]
Pasquier, N., Bastide, Y., Taouil, R., Stumme, G., Lakhal, L.: Generating a condensed representation for association rules. Journal of Intelligent Information Systems 24(1), 25-60 (2005).
[20]
Soulet, A., Crémilleux, B.: Adequate condensed representations of patterns. Data Mining and Knowledge Discovery 17(1), 94-110 (2008).
[21]
Tan, P.N., Kumar, V., Srivastava, J.: Selecting the right interestingness measure for association patterns. In: Proceedings of the 8th ACM SIGKDD International Conference on KDD, Edmonton, Alberta, Canada, pp. 32-41 (2002).
[22]
Tanimoto, T.T.: An elementary mathematical theory of classification and prediction. Technical Report, I.B.M. Corporation Report (1958).
[23]
Wu, T., Chen, Y., Han, J.: Re-examination of interestingness measures in pattern mining: a unified framework. Data Mining and Knowledge Discovery (2010) 009-0161-2.
[24]
Xiong, H., Tan, P.N., Kumar, V.: Hyperclique pattern discovery. Data Mining and Knowledge Discovery 13(2), 219-242 (2006).

Cited By

View all
  • (2019)An insight into biological datamining based on rarity and correlation as constraintsProceedings of the 34th ACM/SIGAPP Symposium on Applied Computing10.1145/3297280.3297281(3-10)Online publication date: 8-Apr-2019
  • (2015)Key correlation mining by simultaneous monotone and anti-monotone constraints checkingProceedings of the 30th Annual ACM Symposium on Applied Computing10.1145/2695664.2695802(851-856)Online publication date: 13-Apr-2015
  • (2012)Top-N minimization approach for indicative correlation change miningProceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition10.1007/978-3-642-31537-4_9(102-116)Online publication date: 13-Jul-2012
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
DS'10: Proceedings of the 13th international conference on Discovery science
October 2010
382 pages
ISBN:3642161839
  • Editors:
  • Bernhard Pfahringer,
  • Geoff Holmes,
  • Achim Hoffmann

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 06 October 2010

Author Tags

  1. bond measure
  2. closure operator
  3. concise representation
  4. conjunctive support
  5. correlated pattern
  6. disjunctive support
  7. equivalence class

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 09 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2019)An insight into biological datamining based on rarity and correlation as constraintsProceedings of the 34th ACM/SIGAPP Symposium on Applied Computing10.1145/3297280.3297281(3-10)Online publication date: 8-Apr-2019
  • (2015)Key correlation mining by simultaneous monotone and anti-monotone constraints checkingProceedings of the 30th Annual ACM Symposium on Applied Computing10.1145/2695664.2695802(851-856)Online publication date: 13-Apr-2015
  • (2012)Top-N minimization approach for indicative correlation change miningProceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition10.1007/978-3-642-31537-4_9(102-116)Online publication date: 13-Jul-2012
  • (2011)Contrasting correlations by an efficient double-clique conditionProceedings of the 7th international conference on Machine learning and data mining in pattern recognition10.5555/2033831.2033871(469-483)Online publication date: 30-Aug-2011

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media