Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Mining frequent itemsets in large databases: The hierarchical partitioning approach

Published: 01 April 2013 Publication History

Abstract

Although many methods have been proposed to enhance the efficiencies of data mining, little research has been devoted to the issue of scalability - that is, the problem of mining frequent itemsets when the size of the database is very large. This study proposes a methodology, hierarchical partitioning, for mining frequent itemsets in large databases, based on a novel data structure called the Frequent Pattern List (FPL). One of the major features of the FPL is its ability to partition the database, and thus transform the database into a set of sub-databases of manageable sizes. As a result, a divide-and-conquer approach can be developed to perform the desired data-mining tasks. Experimental results show that hierarchical partitioning is capable of mining frequent itemsets and frequent closed itemsets in very large databases.

References

[1]
Mastering data mining: The art and science of customer relationship management. John-Wiley & Sons, Indianapolis, IN.
[2]
Data mining: An overview from database perspective. IEEE Transactions on Knowledge and Data Engineering. v8 i6. 866-883.
[3]
Han, J., Pei, J., & Yin, Y. (2000). Mining frequent itemsets without candidate generation. In: Proc. 2000 ACM SIGMOD int. conf. on management of data (SIGMOD'00) (pp. 1-12).
[4]
Data mining: Concepts and techniques. 2006. 2nd ed. Morgan Kaufmann, San Francisco, CA.
[5]
Data mining: Concepts and techniques. 3rd ed. Morgan Kaufmann, San Francisco, CA.
[6]
Mining frequent itemsets without candidate generation: A frequent-pattern tree approach. Data Mining and Knowledge Discovery. v8 i1. 53-87.
[7]
Pasquier, N., Bastide, Y., Taouil, R., & Lakhal, L. (1999). Discovering frequent closed itemsets for association rules. In: Proc. 7th int. conf. database theory (ICDT'99) (pp. 398-416).
[8]
Pei, J., Han, J., & Mao, R. (2000). CLOSET: An efficient algorithm for mining frequent closed itemsets. In: Proc. 2000 ACMSIGMOD int. workshop, data mining and knowledge discovery (DMKD00) (pp. 11-20).
[9]
Pei, J., Han, J., Lu, H., Nishio, S., Tang, S., & Yang, D. (2001). H-Mine: Hyper-structure mining of frequent itemsets in large databases. In: Proc. 2001 int. conf. on data mining (ICDM'01), San Jose, CA, Nov. 2001.
[10]
Savasere, A., Omiecinski, D., & Navathe, S. (1995). An efficient algorithm for mining association rules in large databases. In: Proc. int'l conf. very large data bases (VLDB 95), Sept. 1995 (pp. 432-443).
[11]
An adaptive approach to mining frequent itemsets efficiently. Expert Systems with Applications. v39 i18. 13166-13172.
[12]
Tseng, F. -C., Hsu, C. -C., & Chen, H. (2001). Mining frequent closed itemsets with the Frequent Pattern List. In: Proc. 2001 IEEE int'l conference on data mining (ICDM 2001) (pp. 653-654).
[13]
Generating frequent itemsets with the frequent pattern list. Lecture Notes in Artificial Intelligence. v2035. 376-386.
[14]
The frequent pattern list: Another framework for mining frequent itemsets. International Journal of Electronic Business Management. v3 i2. 104-115.
[15]
Efficiently mining frequent closed itemsets by eliminating data redundancies. Electronic Commerce Studies. v3 i1. 39-56.
[16]
To bid or to buy? Online shoppers' preferences for online purchasing channels. International Journal of E-Business Research. v6 i2. 1-20.
[17]
Segmenting online gamers by motivation. Expert Systems with Applications. v38 i6. 7693-7697.
[18]
DBV-miner: A dynamic bit-vector approach for fast mining frequent closed itemsets. Expert Systems with Applications. v39 i8. 7196-7206.
[19]
Wang, J., Han, J., & Pei, J. (2003). CLOSET+: Searching for the best strategies for mining frequent closed itemsets. In: Proc. 2003 ACM SIGKDD int. conf. on knowledge discovery and data mining (KDD'03), Washington, DC, Aug. 2003.
[20]
Efficient algorithms for mining closed itemsets and their lattice structure. IEEE Transactions on Knowledge and Data Engineering. v17 i4. 462-478.

Cited By

View all
  1. Mining frequent itemsets in large databases: The hierarchical partitioning approach

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image Expert Systems with Applications: An International Journal
      Expert Systems with Applications: An International Journal  Volume 40, Issue 5
      April, 2013
      505 pages

      Publisher

      Pergamon Press, Inc.

      United States

      Publication History

      Published: 01 April 2013

      Author Tags

      1. Data mining
      2. Frequent Pattern List (FPL)
      3. Frequent closed itemsets
      4. Frequent itemsets
      5. Hierarchical partitioning

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 09 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Bit Splicing Frequent Itemset Mining Algorithm Based on Dynamic GroupingWeb and Big Data10.1007/978-981-97-2387-4_28(417-432)Online publication date: 6-Oct-2023
      • (2018)OOIMASPExpert Systems with Applications: An International Journal10.1016/j.eswa.2017.10.01593:C(62-71)Online publication date: 1-Mar-2018
      • (2016)Fast algorithm for high utility pattern mining with the sum of item quantitiesIntelligent Data Analysis10.3233/IDA-16081120:2(395-415)Online publication date: 1-Jan-2016
      • (2016)High utility pattern mining over data streams with sliding window techniqueExpert Systems with Applications: An International Journal10.1016/j.eswa.2016.03.00157:C(214-231)Online publication date: 15-Sep-2016
      • (2015)Online Gamers' Preferences for Online Game Charging MechanismsInternational Journal of E-Business Research10.4018/ijebr.201501010211:1(23-34)Online publication date: 1-Jan-2015
      • (2015)Association rule mining with mostly associated sequential patternsExpert Systems with Applications: An International Journal10.1016/j.eswa.2014.10.04942:5(2582-2592)Online publication date: 1-Apr-2015

      View Options

      View options

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media