Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1951365.1951370acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article

Novel techniques to reduce search space in multiple minimum supports-based frequent pattern mining algorithms

Published: 21 March 2011 Publication History

Abstract

Frequent patterns are an important class of regularities that exist in a transaction database. Certain frequent patterns with low minimum support (minsup) value can provide useful information in many real-world applications. However, extraction of these frequent patterns with single minsup-based frequent pattern mining algorithms such as Apriori and FP-growth leads to "rare item problem." That is, at high minsup value, the frequent patterns with low minsup are missed, and at low minsup value, the number of frequent patterns explodes. In the literature, "multiple minsups framework" was proposed to discover frequent patterns. Furthermore, frequent pattern mining techniques such as Multiple Support Apriori and Conditional Frequent Pattern-growth (CFP-growth) algorithms have been proposed. As the frequent patterns mined with this framework do not satisfy downward closure property, the algorithms follow different types of pruning techniques to reduce the search space. In this paper, we propose an efficient CFP-growth algorithm by proposing new pruning techniques. Experimental results show that the proposed pruning techniques are effective.

References

[1]
R. Agrawal, T. Imieliński, and A. Swami. Mining association rules between sets of items in large databases. SIGMOD Rec., 22:207--216, June 1993.
[2]
R. Agrawal and R. Srikant. Fast algorithms for mining association rules in large databases. In Proceedings of the 20th International Conference on Very Large Data Bases, VLDB '94, pages 487--499, 1994.
[3]
M. Hahsler. A model-based frequency constraint for mining associations from transaction data. Data Min. Knowl. Discov., 13(2):137--166, 2006.
[4]
J. Han, H. Cheng, D. Xin, and X. Yan. Frequent pattern mining: current status and future directions. Data Min. Knowl. Discov., 15(1):55--86, 2007.
[5]
J. Han, J. Pei, Y. Yin, and R. Mao. Mining frequent patterns without candidate generation: A frequent-pattern tree approach. Data Min. Knowl. Discov., 8(1):53--87, 2004.
[6]
Y.-H. Hu and Y.-L. Chen. Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism. Decis. Support Syst., 42(1):1--24, 2006.
[7]
R. U. Kiran and P. K. Reddy. An improved frequent pattern-growth approach to discover rare association rules. In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval, pages 43--52, 2009.
[8]
R. U. Kiran and P. K. Reddy. An improved multiple minimum support based approach to mine rare association rules. In IEEE Symposium on Computational Intelligence and Data Mining, pages 340--347, 2009.
[9]
R. U. Kiran and P. K. Reddy. Mining rare association rules in the datasets with widely varying items' frequencies. In DASFAA (1), pages 49--62, 2010.
[10]
B. Liu, W. Hsu, and Y. Ma. Mining association rules with multiple minimum supports. In KDD '99: Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 337--341. ACM, 1999.
[11]
C. S. K. Selvi and A. Tamilarasi. Mining association rules with dynamic and collective support thresholds. International Journal on Open Problems Computational Mathematics, 2(3):427--438, 2009.
[12]
R. Uday Kiran and P. Krishna Reddy. Towards efficient mining of periodic-frequent patterns in transactional databases. In Database and Expert Systems Applications, volume 6262 of Lecture Notes in Computer Science, pages 194--208. Springer, 2010.
[13]
G. M. Weiss. Mining with rarity: a unifying framework. SIGKDD Explor. Newsl., 6(1):7--19, 2004.
[14]
H. Yun, D. Ha, B. Hwang, and K. H. Ryu. Mining association rules on significant rare data using relative support. J. Syst. Softw., 67:181--191, September 2003.
[15]
Z. Zheng, R. Kohavi, and L. Mason. Real world performance of association rule algorithms. In KDD '01: Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, pages 401--406. ACM, 2001.
[16]
L. Zhou and S. Yau. Association rule and quantitative association rule mining among infrequent items. In International Workshop on Multimedia Data Mining, 2007.

Cited By

View all
  • (2024)New approaches for mining high utility itemsets with multiple utility thresholdsApplied Intelligence10.1007/s10489-023-05145-854:1(767-790)Online publication date: 1-Jan-2024
  • (2024)IM-DISCO: Invariant Mining for Detecting IntrusionS in Critical OperationsComputer Security. ESORICS 2023 International Workshops10.1007/978-3-031-54129-2_3(42-58)Online publication date: 12-Mar-2024
  • (2023)Depth-First Uncertain Frequent Itemsets Mining based on Ensembled Conditional Item-Wise Supports2023 International Conference on Intelligent Supercomputing and BioPharma (ISBP)10.1109/ISBP57705.2023.10061307(121-128)Online publication date: 6-Jan-2023
  • Show More Cited By

Index Terms

  1. Novel techniques to reduce search space in multiple minimum supports-based frequent pattern mining algorithms

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    EDBT/ICDT '11: Proceedings of the 14th International Conference on Extending Database Technology
    March 2011
    587 pages
    ISBN:9781450305280
    DOI:10.1145/1951365
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    • Microsoft Research: Microsoft Research

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 21 March 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. data mining
    2. frequent patterns and multiple minimum supports
    3. knowledge discovery

    Qualifiers

    • Research-article

    Conference

    EDBT/ICDT '11
    Sponsor:
    • Microsoft Research
    EDBT/ICDT '11: EDBT/ICDT '11 joint conference
    March 21 - 24, 2011
    Uppsala, Sweden

    Acceptance Rates

    Overall Acceptance Rate 7 of 10 submissions, 70%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 25 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)New approaches for mining high utility itemsets with multiple utility thresholdsApplied Intelligence10.1007/s10489-023-05145-854:1(767-790)Online publication date: 1-Jan-2024
    • (2024)IM-DISCO: Invariant Mining for Detecting IntrusionS in Critical OperationsComputer Security. ESORICS 2023 International Workshops10.1007/978-3-031-54129-2_3(42-58)Online publication date: 12-Mar-2024
    • (2023)Depth-First Uncertain Frequent Itemsets Mining based on Ensembled Conditional Item-Wise Supports2023 International Conference on Intelligent Supercomputing and BioPharma (ISBP)10.1109/ISBP57705.2023.10061307(121-128)Online publication date: 6-Jan-2023
    • (2022)Assessing Model-free Anomaly Detection in Industrial Control Systems Against Generic Concealment AttacksProceedings of the 38th Annual Computer Security Applications Conference10.1145/3564625.3564633(412-426)Online publication date: 5-Dec-2022
    • (2022)Mining High Utility Itemset with Multiple Minimum Utility Thresholds Based on Utility Deviation2022 IEEE International Conference on Data Mining Workshops (ICDMW)10.1109/ICDMW58026.2022.00071(490-496)Online publication date: Nov-2022
    • (2022)High average-utility itemsets mining: a surveyApplied Intelligence10.1007/s10489-021-02611-z52:4(3901-3938)Online publication date: 1-Mar-2022
    • (2022)Mining Frequents Itemset and Association Rules in Diabetic DatasetBusiness Intelligence10.1007/978-3-031-06458-6_12(146-157)Online publication date: 13-May-2022
    • (2021)MaxRI: A method for discovering maximal rare itemsets2021 4th International Conference on Data Science and Information Technology10.1145/3478905.3478972(334-341)Online publication date: 23-Jul-2021
    • (2021)Constraint Programming for Itemset Mining with Multiple Minimum Supports2021 IEEE 33rd International Conference on Tools with Artificial Intelligence (ICTAI)10.1109/ICTAI52525.2021.00095(598-603)Online publication date: Nov-2021
    • (2021)FRI-miner: fuzzy rare itemset miningApplied Intelligence10.1007/s10489-021-02574-1Online publication date: 6-Jul-2021
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media