Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1133905.1133916acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

LCM ver.3: collaboration of array, bitmap and prefix tree for frequent itemset mining

Published: 21 August 2005 Publication History

Abstract

For a transaction database, a frequent itemset is an itemset included in at least a specified number of transactions. To find all the frequent itemsets, the heaviest task is the computation of frequency of each candidate itemset. In the previous studies, there are roughly three data structures and algorithms for the computation: bitmap, prefix tree, and array lists. Each of these has its own advantage and disadvantage with respect to the density of the input database. In this paper, we propose an efficient way to combine these three data structures so that in any case the combination gives the best performance.

References

[1]
A. Fiat and S. Shporer, "AIM2: Improved implementation of AIM" In Proc. IEEE ICDM'04 Workshop FIMI'04, 2004.
[2]
B. Racz, "nonordfp: An FP-growth variation without rebuilding the FP-tree," In Proc. IEEE ICDM'04 Workshop FIMI'04, 2004.
[3]
R. J. Bayardo Jr., "Efficiently Mining Long Patterns from Databases", In Proc. SIGMOD'98, pp. 85--93, 1998.
[4]
D. Burdick, M. Calimlim, J. Gehrke, "MAFIA: A Maximal Frequent Itemset Algorithm for Transactional Databases," In Proc. ICDE 2001, pp. 443--452, 2001.
[5]
D. Burdick, M. Calimlim, J. Flannick, J. Gehrke, and T. Yiu, "MAFIA: A Performance Study of Mining Maximal Frequent Itemsets," In Proc. IEEE ICDM'03 Workshop FIMI'03, 2003.
[6]
B. Goethals, the FIMI repository, http://fimi.cs.helsinki.fi/, 2003.
[7]
G. Grahne and J. Zhu, "Efficiently Using Prefix-trees in Mining Frequent Itemsets," In Proc. IEEE ICDM'03 Workshop FIMI'03, 2003.
[8]
J. Han, J. Pei, Y. Yin, "Mining Frequent Patterns without Candidate Generation," SIGMOD Conference 2000, pp. 1--12, 2000.
[9]
Guimei Liu, Hongjun Lu, Jeffrey Xu Yu, Wang Wei, and Xiangye Xiao, "AFOPT: An Efficient Implementation of Pattern Growth Approach," In Proc. IEEE ICDM'03 Workshop FIMI'03, 2003.
[10]
S. Orlando, C. Lucchese, P. Palmerini, R. Perego and F. Silvestri, "kDCI: a Multi-Strategy Algorithm for Mining Frequent Sets," In Proc. IEEE ICDM'03 Workshop FIMI'03, 2003.
[11]
C. Lucchese, S. Orlando and R. Perego, "DCI Closed: A Fast and Memory Efficient Algorithm to Mine Frequent Closed Itemsets," In Proc. IEEE ICDM'04 Workshop FIMI'04, 2004.
[12]
N. Pasquier, Y. Bastide, R. Taouil, L. Lakhal, Efficient Mining of Association Rules Using Closed Itemset Lattices, Inform. Syst., 24(1), 25--46, 1999.
[13]
A. Pietracaprina and D. Zandolin, "Mining Frequent Itemsets using Patricia Tries," In Proc. IEEE ICDM'03 Workshop FIMI'03, 2003.
[14]
T. Uno, T. Asai, Y. Uchida, H. Arimura, "LCM: An Efficient Algorithm for Enumerating Frequent Closed Item Sets," In Proc. IEEE ICDM'03 Workshop FIMI'03, 2003.
[15]
T. Uno, T. Asai, Y. Uchida, H. Arimura, "An Efficient Algorithm for Enumerating Closed Patterns in Transaction Databases," Lecture Notes in Artificial Intelligence 3245, pp. 16--31, 2004.
[16]
T. Uno, M. Kiyomi, H. Arimura, "LCM ver. 2: Efficient Mining Algorithms for Frequent/Closed/Maximal Itemsets", In Proc. IEEE ICDM'04 Workshop FIMI'04, 2004.

Cited By

View all
  • (2025)On the discovery of seasonal gradual patterns through periodic patterns miningInformation Systems10.1016/j.is.2024.102511129(102511)Online publication date: Mar-2025
  • (2024)Fraction-Score: A Generalized Support Measure for Weighted and Maximal Co-Location Pattern MiningIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.330436536:4(1582-1596)Online publication date: Apr-2024
  • (2024)Efficient Frequent Chronicle Mining Algorithms: Application to Sleep DisorderIEEE Access10.1109/ACCESS.2024.335713912(14580-14595)Online publication date: 2024
  • Show More Cited By
  1. LCM ver.3: collaboration of array, bitmap and prefix tree for frequent itemset mining

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    OSDM '05: Proceedings of the 1st international workshop on open source data mining: frequent pattern mining implementations
    August 2005
    92 pages
    ISBN:1595932100
    DOI:10.1145/1133905
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 21 August 2005

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Article

    Conference

    KDD05
    Sponsor:

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)17
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 15 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)On the discovery of seasonal gradual patterns through periodic patterns miningInformation Systems10.1016/j.is.2024.102511129(102511)Online publication date: Mar-2025
    • (2024)Fraction-Score: A Generalized Support Measure for Weighted and Maximal Co-Location Pattern MiningIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.330436536:4(1582-1596)Online publication date: Apr-2024
    • (2024)Efficient Frequent Chronicle Mining Algorithms: Application to Sleep DisorderIEEE Access10.1109/ACCESS.2024.335713912(14580-14595)Online publication date: 2024
    • (2024)A scalable, distributed framework for significant subgroup discoveryKnowledge-Based Systems10.1016/j.knosys.2023.111335284:COnline publication date: 25-Jan-2024
    • (2024)Data Heterogeneity's Impact on the Performance of Frequent Itemset Mining AlgorithmsInformation Sciences10.1016/j.ins.2024.120981(120981)Online publication date: Jun-2024
    • (2024)GMiner++: Boosting GPU-based frequent itemset mining by reducing redundant computationsExpert Systems with Applications10.1016/j.eswa.2024.123928250(123928)Online publication date: Sep-2024
    • (2024)SufRec, an algorithm for mining association rules: Recursivity and task parallelismExpert Systems with Applications10.1016/j.eswa.2023.121321236(121321)Online publication date: Feb-2024
    • (2024)$$(\alpha ,\beta )$$-Butterfly Computation on Bipartite GraphsArabian Journal for Science and Engineering10.1007/s13369-024-09632-zOnline publication date: 21-Oct-2024
    • (2024)Identifying similar-bicliques in bipartite graphsThe VLDB Journal10.1007/s00778-023-00834-933:3(703-726)Online publication date: 25-Jan-2024
    • (2024)What is the Intrinsic Dimension of Your Binary Data?—and How to Compute it QuicklyConceptual Knowledge Structures10.1007/978-3-031-67868-4_7(97-112)Online publication date: 1-Aug-2024
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media