Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1109/ICDM.2006.1guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

\delta-Tolerance Closed Frequent Itemsets

Published: 18 December 2006 Publication History

Abstract

In this paper, we study an inherent problem of mining Frequent Itemsets (FIs): the number of FIs mined is often too large. The large number of FIs not only affects the mining performance, but also severely thwarts the application of FI mining. In the literature, Closed FIs (CFIs) and Maximal FIs (MFIs) are proposed as concise representations of FIs. However, the number of CFIs is still too large in many cases, while MFIs lose information about the frequency of the FIs. To address this problem, we relax the restrictive definition of CFIs and propose the \delta-Tolerance CFIs (\delta- TCFIs). Mining \delta-TCFIs recursively removes all subsets of a \delta-TCFI that fall within a frequency distance bounded by \delta. We propose two algorithms, CFI2TCFI and MineTCFI, to mine \delta-TCFIs. CFI2TCFI achieves very high accuracy on the estimated frequency of the recovered FIs but is less efficient when the number of CFIs is large, since it is based on CFI mining. MineTCFI is significantly faster and consumes less memory than the algorithms of the state-of-the-art concise representations of FIs, while the accuracy of MineTCFI is only slightly lower than that of CFI2TCFI.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
ICDM '06: Proceedings of the Sixth International Conference on Data Mining
December 2006
1209 pages
ISBN:0769527019

Publisher

IEEE Computer Society

United States

Publication History

Published: 18 December 2006

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 30 Aug 2024

Other Metrics

Citations

Cited By

View all
  • (2016)Mining approximate patterns with frequent locally optimal occurrencesDiscrete Applied Mathematics10.1016/j.dam.2015.07.002200:C(123-152)Online publication date: 19-Feb-2016
  • (2013)A prediction framework based on contextual data to support Mobile Personalized MarketingDecision Support Systems10.5555/2747904.274823056:C(234-246)Online publication date: 1-Dec-2013
  • (2013)YmalDBProceedings of the 16th International Conference on Extending Database Technology10.1145/2452376.2452464(725-728)Online publication date: 18-Mar-2013
  • (2012)Key roles of closed sets and minimal generators in concise representations of frequent patternsIntelligent Data Analysis10.5555/2595513.259551716:4(581-631)Online publication date: 1-Jul-2012
  • (2011)ReDRIVEProceedings of the 20th ACM international conference on Information and knowledge management10.1145/2063576.2063798(1547-1552)Online publication date: 24-Oct-2011
  • (2010)Margin-closed frequent sequential pattern miningProceedings of the ACM SIGKDD Workshop on Useful Patterns10.1145/1816112.1816119(45-54)Online publication date: 25-Jul-2010
  • (2010)Self-sufficient itemsetsACM Transactions on Knowledge Discovery from Data10.1145/1644873.16448764:1(1-20)Online publication date: 18-Jan-2010
  • (2009)JPMinerProceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 510.5555/1801874.1801900(114-118)Online publication date: 14-Aug-2009
  • (2009)CP-summaryProceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining10.1145/1557019.1557096(687-696)Online publication date: 28-Jun-2009
  • (2009)Efficient query processing on graph databasesACM Transactions on Database Systems10.1145/1508857.150885934:1(1-48)Online publication date: 23-Apr-2009
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media