Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Erasable itemset mining over incremental databases with weight conditions

Published: 01 June 2016 Publication History

Abstract

Erasable itemset mining is an approach for mining itemsets with low profits from large-scale product databases in order to solve financial crises of plants in manufacturing industries. Previous erasable itemset mining methods deal with static product databases only, and ignore any characteristics such as items' own values when they extract the erasable itemsets. Therefore, such approaches may fail to solve financial crises of plants because they have to iterate a significant number of mining processes in order to deal with real-time product data accumulated from plants in the real world. In this paper, we propose a new tree-based erasable itemset mining algorithm for dynamic databases, which finds erasable itemsets considering the weight conditions from incremental databases. The proposed algorithm uses new tree and list data structures for performing its mining operations more efficiently. Furthermore, the proposed algorithm is capable of reducing the number of mined erasable itemsets by considering the different weight information of items within product databases. We also compare the proposed approach with other tree-based state-of-the-art methods. By performing runtime, memory, pattern quality, and scalability comparisons with respect to various real and synthetic incremental datasets, we show that the proposed algorithm is outstanding in comparison to other previous methods.

References

[1]
Agrawal, R., Srikant, R., 1994. Fast algorithms for mining association rules. In: Proceedings of the 20th International Conference on Very Large Data Bases, September, pp. 487-499.
[2]
Ahmed, C.F., Tanbeer, S.K., Jeong, B., 2009. Efficient mining of weighted frequent patterns over data streams. In: Proceedings of the 11th IEEE International Conference on High Performance Computing and Communications, June, pp. 400-406.
[3]
C.F. Ahmed, S.K. Tanbeer, B. Jeong, Y. Lee, Efficient tree structures for high utility pattern mining in incremental databases, IEEE Trans. Knowl. Data Eng., 21 (2009) 1708-1721.
[4]
C.F. Ahmed, S.K. Tanbeer, B. Jeong, Y. Lee, H. Choi, Single-pass incremental and interactive mining for weighted frequent patterns, Expert Syst. Appl., 39 (2012) 7976-7994.
[5]
M. Berlingerio, F. Pinelli, F. Calabrese, ABACUS: frequent pattern mining-based community discovery in multidimensional networks, Data Min. Knowl. Discov., 27 (2013) 294-320.
[6]
T.-P. Chang, Shih-Ying Chen, An efficient algorithm of frequent XML query pattern mining for ebXML applications in e-commerce, Expert Syst. Appl., 39 (2012) 2183-2193.
[7]
L. Chen, W. Liu, Frequent patterns mining in multiple biological sequences, Comput. Biol. Med., 43 (2013) 1444-1452.
[8]
Z. Deng, Mining top-rank-k erasable itemsets by PID_lists, Int. J.¿Intell. Syst., 28 (2013) 366-379.
[9]
Deng, Z., Fang, G., Wang, Z., 2009.¿Mining erasable itemsets. In: Proceedings of the 8th International Conference on Machine Learning and Cybernetics, vol. 1, July, pp. 67-73.
[10]
Z. Deng, X. Xu, An efficient algorithm for mining erasable itemsets, Adv. Data Min. Appl. (2010) 214-225.
[11]
Z. Deng, X. Xu, Fast mining erasable itemsets using NC_sets, Expert Syst. Appl., 39 (2012) 4453-4463.
[12]
A. Gionis, H. Mannila, T. Mielikäinen, P. Tsaparas, Assessing data mining results via swap randomization, ACM Trans. Knowl. Discov. Data, l (2007).
[13]
J. Han, J. Pei, Y. Yin, R. Mao, Mining frequent patterns without candidate generation: a frequent-pattern tree approach, Data Min. Knowl. Discov., 8 (2004) 53-87.
[14]
Hämäläinen, W.,¿Nykänen, Matti, 2008. Efficient discovery of statistically significant association rules. In: Proceedings of the IEEE International Conference on Data Mining (ICDM), pp. 203-212.
[15]
T. Le, B. Vo, MEI: an efficient algorithm for mining erasable itemsets, Eng. Appl. Artif. Intell., 27 (2014) 155-166.
[16]
Le, T., Vo, B., Coenen, F., 2013. An efficient algorithm for mining erasable itemsets using the difference of NC-sets. In: Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, October, pp. 2270-2274.
[17]
G. Lee, U. Yun, H. Ryang, Mining weighted erasable patterns by using underestimated constraint-based pruning technique, J. Intell. Fuzzy Syst., 28 (2014) 1145-1157.
[18]
G. Lee, U. Yun, K. Ryu, Sliding window based weighted maximal frequent pattern mining over data streams, Expert Syst. Appl., 41 (2014) 694-708.
[19]
Y.-S. Lee, S.-J. Yen, Incremental and interactive mining of web traversal patterns, Inf. Sci., 178 (2008) 287-306.
[20]
J. Lijffijt, P. Papapetrou, K. Puolamäki, A statistical significance testing approach to mining the most informative set of patterns, Data Min. Knowl. Discov., 28 (2014) 238-263.
[21]
C.-W. Lin, T.-P. Hong, G.-C. Lan, J.-W. Wong, W.-Y. Lin, Incrementally mining high utility patterns based on pre-large concept, Appl. Intell., 40 (2014) 343-357.
[22]
C.-W. Lin, G.-C. Lan, T.-P. Hong, An incremental mining algorithm for high utility itemsets, Expert Syst. Appl., 39 (2012) 7173-7180.
[23]
S. Lu, H. Hu, F. Li, Mining weighted association rules, Intell. Data Anal., 5 (2001) 211-225.
[24]
G. Nguyen, T. Le, B. Vo, B. Le, A new approach for mining top-rank-k erasable itemsets, Intell. Inf. Database Syst., 8397 (2014) 73-82.
[25]
P.N.E. Nohuddin, F. Coenen, R. Christley, C. Setzkorn, Y. Patel, S. Williams, Finding "interesting" trends in social networks using frequent pattern mining and self organizing maps, Knowl.-Based Syst., 29 (2012) 104-113.
[26]
R. Pears, Y.S. Koh, G. Dobbie, W.K. Yeap, Weighted association rule mining via a graph based connectivity model, Inf. Sci., 218 (2013) 61-84.
[27]
G. Pyun, U. Yun, Mining top-k frequent patterns with combination reducing techniques, Appl. Intell., 41 (2014) 76-98.
[28]
G. Pyun, U. Yun, K. Ryu, Efficient frequent pattern mining based on linear prefix tree, Knowl.-Based Syst., 55 (2014) 125-139.
[29]
H. Ryang, U. Yun, Fast algorithm for high utility pattern mining with the sum of item quantities, Intell. Data Anal., 20 (2016) 395-415.
[30]
H. Ryang, U. Yun, K. Ryu, Discovering high utility itemsets with multiple minimum supports, Intell. Data Anal., 18 (2014) 1027-1047.
[31]
S.K. Tanbeer, C.F. Ahmed, B.-S. Jeong, Y.-K. Lee, Efficient single-pass frequent pattern mining using a prefix-tree, Inf. Sci., 179 (2008) 559-583.
[32]
B. Vo, F. Coenen, B. Le, A new method for mining Frequent Weighted Itemsets based on WIT-trees, Expert Syst. Appl., 40 (2013) 1256-1264.
[33]
S.-J. Yen, Y.-S. Lee, C.-K. Wang, An efficient algorithm for incrementally mining frequent closed itemsets, Appl. Intell, 40 (2014) 649-668.
[34]
U. Yun, G. Lee, K. Ryu, Mining maximal frequent patterns by considering weight conditions over data streams, Knowl.-Based Syst., 55 (2014) 49-65.
[35]
U. Yun, G. Pyun, E. Yoon, Efficient mining of robust closed weighted sequential patterns without information loss, Int. J. Artif. Intell. Tools, 24 (2015).
[36]
U. Yun, H. Ryang, Incremental high utility pattern mining with static and dynamic databases, Appl. Intell., 42 (2015) 323-352.
[37]
U. Yun, H. Ryang, K. Ryu, High utility itemset mining with techniques for reducing overestimated utilities and pruning candidates, Expert Syst. Appl., 41 (2014) 3861-3878.
[38]
U. Yun, K. Ryu, Efficient mining of maximal correlated weight frequent patterns, Intell. Data Anal., 17 (2013) 917-939.
[39]
U. Yun, E. Yoon, An efficient approach for mining weighted approximate closed frequent patterns considering noise constraints, Int. J. Uncertain., Fuzziness¿Knowl.-Based Syst., 22 (2014) 879-912.

Cited By

View all
  • (2024)Mining Erasable Itemsets from a Product Database with Quantitative ComponentsProceedings of the 2024 11th Multidisciplinary International Social Networks Conference10.1145/3675669.3675698(62-66)Online publication date: 21-Aug-2024
  • (2023)Tree-Based Unified Temporal Erasable-Itemset MiningIntelligent Information and Database Systems10.1007/978-981-99-5834-4_18(224-233)Online publication date: 24-Jul-2023
  • (2020)Efficiently mining erasable stream patterns for intelligent systems over uncertain dataInternational Journal of Intelligent Systems10.1002/int.2226935:11(1699-1734)Online publication date: 28-Sep-2020
  • Show More Cited By

Index Terms

  1. Erasable itemset mining over incremental databases with weight conditions
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Engineering Applications of Artificial Intelligence
    Engineering Applications of Artificial Intelligence  Volume 52, Issue C
    June 2016
    248 pages

    Publisher

    Pergamon Press, Inc.

    United States

    Publication History

    Published: 01 June 2016

    Author Tags

    1. Data mining
    2. Erasable itemset
    3. Incremental database
    4. Itemset pruning
    5. Weight constraint

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 01 Sep 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Mining Erasable Itemsets from a Product Database with Quantitative ComponentsProceedings of the 2024 11th Multidisciplinary International Social Networks Conference10.1145/3675669.3675698(62-66)Online publication date: 21-Aug-2024
    • (2023)Tree-Based Unified Temporal Erasable-Itemset MiningIntelligent Information and Database Systems10.1007/978-981-99-5834-4_18(224-233)Online publication date: 24-Jul-2023
    • (2020)Efficiently mining erasable stream patterns for intelligent systems over uncertain dataInternational Journal of Intelligent Systems10.1002/int.2226935:11(1699-1734)Online publication date: 28-Sep-2020
    • (2019)SPPCApplied Intelligence10.1007/s10489-018-1280-549:2(478-495)Online publication date: 1-Feb-2019
    • (2018)An Efficient Method for Mining Erasable Itemsets Using Multicore Processor PlatformComplexity10.1155/2018/84876412018Online publication date: 1-Jan-2018
    • (2018)Maintenance of Erasable Itemsets for Product DeletionProceedings of the 5th Multidisciplinary International Social Networks Conference10.1145/3227696.3227718(1-4)Online publication date: 16-Jul-2018
    • (2018)Single-pass based efficient erasable pattern mining using list data structure on dynamic incremental databasesFuture Generation Computer Systems10.1016/j.future.2017.07.03580:C(12-28)Online publication date: 1-Mar-2018
    • (2018)Efficient algorithms for mining top-rank-k erasable patterns using pruning strategies and the subsume conceptEngineering Applications of Artificial Intelligence10.1016/j.engappai.2017.09.01068:C(1-9)Online publication date: 1-Feb-2018
    • (2017)An efficient algorithm for mining high utility patterns from incremental databases with one database scanKnowledge-Based Systems10.1016/j.knosys.2017.03.016124:C(188-206)Online publication date: 15-May-2017
    • (2017)Efficient algorithm for mining high average-utility itemsets in incremental transaction databasesApplied Intelligence10.1007/s10489-016-0890-z47:1(114-131)Online publication date: 1-Jul-2017

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media