Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
survey

FPGA/GPU-based Acceleration for Frequent Itemsets Mining: A Comprehensive Review

Published: 08 October 2021 Publication History

Abstract

In data mining, Frequent Itemsets Mining is a technique used in several domains with notable results. However, the large volume of data in modern datasets increases the processing time of Frequent Itemset Mining algorithms, making them unsuitable for many real-world applications. Accordingly, proposing new methods for Frequent Itemset Mining to obtain frequent itemsets in a realistic amount of time is still an open problem. A successful alternative is to employ hardware acceleration using Graphics Processing Units (GPU) and Field Programmable Gates Arrays (FPGA). In this article, a comprehensive review of the state of the art of Frequent Itemsets Mining hardware acceleration is presented. Several approaches (FPGA and GPU based) were contrasted to show their weaknesses and strengths. This survey gathers the most relevant and the latest research efforts for improving the performance of Frequent Itemsets Mining regarding algorithms advances and modern development platforms. Furthermore, this survey organizes the current research on Frequent Itemsets Mining from the hardware perspective considering the source of the data, the development platform, and the baseline algorithm.

References

[1]
Syed Hasan Adil and Sadaf Qamar. 2009. Implementation of association rule mining using CUDA. In Proceedings of the IEEE International Conference on Emerging Technologies (ICET’09). IEEE, 332–336.
[2]
Giuseppe Agapito, Pietro Hiram Guzzi, and Mario Cannataro. 2019. Parallel extraction of association rules from genomics data. Appl. Math. Comput. 350, 1 (2019), 434–446.
[3]
Charu C. Aggarwal. 2015. Data Mining: The Textbook. Springer, Cham, Switzerland.
[4]
Rakesh Agrawal, Tomasz Imieliundefinedski, and Arun Swami. 1993. Mining association rules between sets of items in large databases. SIGMOD Rec. 22, 2 (Jun. 1993), 207–216.
[5]
Rakesh Agrawal and Ramakrishnan Srikant. 1994. Fast algorithms for mining association rules in large databases. In Proceedings of the 20th International Conference on Very Large Data Bases (VLDB’94). Morgan Kaufmann, 487–499.
[6]
Apache Foundation. 2006. Apache Hadoop. (2006). https://hadoop.apache.org/.
[7]
Khedija Arour and Amani Belkahla. 2014. Frequent pattern-growth algorithm on multi-core CPU and GPU processors. J. Comp. Inf. Technol. 22, 3 (2014), 159–169.
[8]
Brian Babcock, Shivnath Babu, Mayur Datar, Rajeev Motwani, and Jennifer Widom. 2002. Models and issues in data stream systems. In Proceedings of the 21st ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS’02). ACM, 1–16.
[9]
Zachary Baker and Viktor Prasanna. 2005. Efficient hardware data mining with the apriori algorithm on FPGAs. In Proceedings of the 13th Annual IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM’05). IEEE Computer Society, 3–12.
[10]
Zachary Baker and Viktor Prasanna. 2006. An architecture for efficient hardware data mining using reconfigurable computing systems. In Proceedings of the 14th Annual IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM’06). IEEE Computer Society, 67–75.
[11]
Dharmesh Bhalodia and Chhaya Patel. 2014. Mining frequent patterns with optimized candidate representation on graphics processor. Int. J. Comp. Appl. 105, 7 (2014), 1–7.
[12]
Pan Bian, Bin Liang, Wenchang Shi, Jianjun Huang, and Yan Cai. 2018. NAR-miner: Discovering negative association rules from code for bug detection. In Proceedings of the 26th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE’18). Association for Computing Machinery, New York, NY, 411–422.
[13]
Christian Borgelt. 2003. Efficient implementations of apriori and eclat. In FIMI’03: Proceedings of the IEEE-ICDM Workshop on Frequent Itemset Mining Implementations (CEUR Workshop Proceedings), Vol. 90. CEUR-WS.org.
[14]
Lázaro Bustio, René Cumplido, Raudel Hernández, José M. Bande, and Claudia Feregrino. 2015. A hardware-based approach for frequent itemset mining in data streams. In Proceedings of the 4th Workshop on New Frontiers in Mining Complex Patterns (n’15) held in conjunction with PKDD2015. 14–26.
[15]
Lázaro Bustio, René Cumplido, Raudel Hernández, José M. Bande, and Claudia Feregrino. 2016. Frequent Itemsets Mining in Data Streams Using Reconfigurable Hardware. Springer, 32–45.
[16]
Lázaro Bustio-Martínez. 2017. Hardware Acceleration of Frequent Itemsets Mining on Data Streams. Ph.D. Dissertation. Computer Sciences Department.
[17]
Lázaro Bustio-Martínez, René Cumplido, Raudel Hernández-León, José M. Bande-Serrano, and Claudia Feregrino-Uribe. 2018. On the design of hardware-software architectures for frequent itemsets mining on data streams. J. Intell. Inf. Syst. 50, 3 (Jun. 2018), 415–440.
[18]
L. Bustio-Martínez, R. Cumplido, M. Letras-Luna, C. F. Uribe, R. Hernández-Léon, and J. M. Bande-Serrano. 2017. Approximate frequent itemsets mining on data streams using hashing and lexicographic order in hardware. In Proceedings of the IEEE 8th Latin American Symposium on Circuits Systems (LASCAS’17). 1–4.
[19]
Lázaro Bustio-Martínez, Martín Letras-Luna, René Cumplido, Raudel Hernández-León, Claudia Feregrino-Uribe, and José M. Bande-Serrano. 2019. Using hashing and lexicographic order for frequent itemsets mining on data streams. J. Parallel Distrib. Comput. 125, 1 (2019), 58–71.
[20]
M. Cafaro, I. Epicoco, G. Aloisio, and M. Pulimeno. 2017. CUDA based parallel implementations of space-saving on a GPU. In Proceedings of the Int. Conference on High Performance Computins and Simulation (HPCS’17). 707–714.
[21]
Luca Cagliero, Paolo Garza, and Elena Baralis. 2019. ELSA: A multilingual document summarization algorithm based on frequent itemsets and latent semantic analysis. ACM Trans. Inf. Syst. 37, 2, Article 21 (Jan. 2019), 33 pages.
[22]
Alberto Cano. 2018. A survey on graphic processing unit computing for large-scale data mining. WIREs Data Min. Knowl. Discov. 8, 1 (2018), e1232.
[23]
Joong Hyuk Chang and Won Suk Lee. 2003. Finding recent frequent itemsets adaptively over online data streams. In Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’03). ACM, New York, NY, 487–492.
[24]
Joong Hyuk Chang and Won Suk Lee. 2005. estWin: Online data stream mining of recent frequent itemsets by sliding window method. J. Inf. Sci. 31, 2 (Apr. 2005), 76–90.
[25]
Chin-Hoong Chee, Jafreezal Jaafar, Izzatdin Abdul Aziz, Mohd Hilmi Hasan, and William Yeoh. 2018. Algorithms for frequent itemset mining: A literature review. Artif. Intell. Rev. 52, 4 (24 March 2018), 2603–2621.
[26]
James Cheng, Yiping Ke, and Wilfred Ng. 2007. A survey on algorithms for mining frequent itemsets over data streams. Knowl. nf. Syst. 16, 1 (2007), 1–27.
[27]
Hyeok-Jun Choi and Cheong Hee Park. 2019. Emerging topic detection in twitter stream based on high utility pattern mining. Expert Syst. Appl. 115, 1 (2019), 27–36.
[28]
Kang-Wook Chon, Sang-Hyun Hwang, and Min-Soo Kim. 2018. GMiner: A fast GPU-based frequent itemset mining method for large-scale data. Inf. Sci. 439-440, 1 (2018), 19–38.
[29]
Clarivate. Web of Science Database. Retrieved on 6 January, 2021 from https://mjl.clarivate.com/home.
[30]
Katherine Compton and Scott Hauck. 2002. Reconfigurable computing: A survey of systems and software. ACM Comput. Surv. 34, 2 (2002), 171–210.
[31]
Qingmin Cui and Xiaobo Guo. 2013. Research on parallel association rules mining on GPU. In Proceedings of the 2nd International Conference on Green Communications and Networks 2012 (GCN’12): Volume 2, Yuhang Yang and Maode Ma (Eds.). Springer, Berlin, 215–222.
[32]
Zhanqi Cui, Xiang Chen, Yongmin Mu, Zhihua Zhang, and Xu Ma. 2018. Mining function call sequence patterns across different versions of the project for defect detection. In Software Analysis, Testing, and Evolution, Lei Bu and Yingfei Xiong (Eds.). Springer International Publishing, 154–169.
[33]
V. Dang and K. Skadron. 2017. Acceleration of frequent itemset mining on FPGA using SDAccel and vivado HLS. In Proceedings of the IEEE 28th International Conference on Application-specific Systems, Architectures and Processors (ASAP’17). 195–200.
[34]
Erik D. Demaine, Alejandro López-Ortiz, and Ian J. Munro. 2002. Frequency estimation of internet packet streams with limited space. In Proceedings of the 10th Annual European Symposium on Algorithms (ESA’02). Springer-Verlag, 348–360.
[35]
Cambridge Dictionary. 2020. Plagiarism. https://dictionary.cambridge.org/dictionary/english/plagiarisml.
[36]
Youcef Djenouri, Asma Belhadi, Philippe Fournier-Viger, and Hamido Fujita. 2018. Mining diversified association rules in big datasets: A cluster/GPU/genetic approach. Inf. Sci. 459, 1 (2018), 117–134.
[37]
Youcef Djenouri, Asma Belhadi, Philippe Fournier-Viger, and Jerry Chun-Wei Lin. 2018. An hybrid multi-core/GPU-based mimetic algorithm for big association rule mining. In Genetic and Evolutionary Computing, Jerry Chun-Wei Lin, Jeng-Shyang Pan, Shu-Chuan Chu, and Chien-Ming Chen (Eds.). Springer, Singapore, 59–65.
[38]
Youcef Djenouri, Ahcene Bendjoudi, Zineb Habbas, Malika Mehdi, and Djamel Djenouri. 2017. Reducing thread divergence in GPU-based bees swarm optimization applied to association rule mining. Concurr. Comput.: Pract. Exp. 729, 9 (2017).
[39]
Youcef Djenouri, Ahcene Bendjoudi, Malika Mehdi, Nadia Nouali-Taboudjemat, and Zineb Habbas. 2015. GPU-based bees swarm optimization for association rules mining. J. Supercomput. 71, 4 (2015), 1318–1344.
[40]
Youcef Djenouri, Marco Comuzzi, and Djamel Djenouri. 2017. SS-FIM: Single scan for frequent itemsets mining in transactional databases. In Advances in Knowledge Discovery and Data Mining, Jinho Kim, Kyuseok Shim, Longbing Cao, Jae-Gil Lee, Xuemin Lin, and Yang-Sae Moon (Eds.). Springer International Publishing, Cham, 644–654.
[41]
Youcef Djenouri, Djamel Djenouri, Asma Belhadi, and Alberto Cano. 2019. Exploiting GPU and cluster parallelism in single scan frequent itemset mining. Inf. Sci. 496, 1 (2019), 363–377.
[42]
Youcef Djenouri, Djamel Djenouri, Asma Belhadi, Philippe Fournier-Viger, Jerry Chun-Wei Lin, and Ahcene Bendjoudi. 2019. Exploiting GPU parallelism in improving bees swarm optimization for mining big transactional databases. Inf. Sci. 496 (2019), 326–342.
[43]
Youcef Djenouri, Djamel Djenouri, and Zineb Habbas. 2018. Intelligent mapping between GPU and cluster computing for discovering big association rules. Appl. Soft Comput. 65, C (Apr. 2018), 387–399.
[44]
Youcef Djenouri and Habiba Drias. 2014. Parallel Bees Swarm Optimization for Association Rules Mining Using GPU Architecture. Springer International Publishing, 50–57.
[45]
Y. Djenouri, H. Drias, Z. Habbas, and H. Mosteghanemi. 2012. Bees swarm optimization for web association rule mining. In Proceedings of the IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology, Volume 03 (WI-IAT’12). IEEE Computer Society, 142–146.
[46]
Guozhu Dong and Jinyan Li. 1999. Efficient mining of emerging patterns: Discovering trends and differences. In Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’99). Association for Computing Machinery, New York, NY, 43–52.
[47]
Vladimir Dzyuba, Matthijs Leeuwen, and Luc Raedt. 2017. Flexible constrained sampling with guarantees for pattern mining. Data Min. Knowl. Discov. 31, 5 (Sep. 2017), 1266–1293.
[48]
A. Ebrahim and J. Khlaifat. 2020. An efficient hardware architecture for finding frequent items in data streams. In Proceedings of the IEEE 38th International Conference on Computer Design (ICCD’20). 113–119.
[49]
Ugo Erra and Bernardino Frola. 2012. Frequent items mining acceleration exploiting fast parallel sorting on the GPU.Proc. Comp. Sci. 9, 1 (2012), 86–95.
[50]
Wenbin Fang, Ka K. Lau, Mian Lu, Xiangye Xiao, Chi K. Lam, Philip Y. Yang, Bingsheng He, Qiong Luo, Pedro V. Sander, and Ke Yang. 2008. Parallel Data Mining on Graphics Processors. Technical Report. The Hong Kong University of Science and Technology.
[51]
Wenbin Fang, Mian Lu, Xiangye Xiao, Bingsheng He, and Qiong Luo. 2009. Frequent itemset mining on graphics processors. In Proceedings of the 5th International Workshop on Data Management on New Hardware (DaMoN’09). ACM, 34–42.
[52]
Philippe Fournier-Viger, Jerry Chun-Wei Lin, Bay Vo, Tin Truong Chi, Ji Zhang, and Hoai Bac Le. 2017. A survey of itemset mining. Data Min. Knowl. Discov. 7, 4 (2017), e1207.
[53]
Minos Garofalakis, Johannes Gehrke, and Rajeev Rastogi. 2016. Data Stream Management: Processing High-Speed Data Streams. Springer.
[54]
Liqiang Geng and Howard J. Hamilton. 2006. Interestingness measures for data mining: A survey. ACM Comput. Surv. 38, 3 (Sep. 2006), 9–41.
[55]
Chris Giannella, Jiawei Han, Jian Pei, Xifeng Yan, and Philip S Yu. 2003. Mining frequent patterns in data streams at multiple time granularities. Next Gener. Data Min. 212, 1 (2003), 191–212.
[56]
Lukasz Golab and M. Tamer Özsu. 2003. Issues in data dtream management. ACM SIGMOD Rec. 32, 2 (Jun. 2003), 5–14.
[57]
Xiaoqi Gu, Yongxin Zhu, Meikang Qiu, Shengyan Zhou, and Chaojun Wang. 2016. Effective FPGA-based enhancement of quantitative frequent itemset mining. In Proceedings of the 5th International Conference on Electrical Engineering and Automatic Control, Bo Huang and Yufeng Yao (Eds.). Springer, Berlin, 1225–1231.
[58]
Xiaoqi Gu, Yongxin Zhu, Shengyan Zhou, Chaojun Wang, Meikang Qiu, and Guoxing Wang. 2016. A real-time FPGA-based accelerator for ECG analysis and diagnosis using association-rule mining. ACM Trans. Embed. Comput. Syst. 15, 2, Article 25 (Feb. 2016), 25:1–25:23 pages.
[59]
M. Gueguen, O. Sentieys, and A. Termier. 2019. Accelerating itemset sampling using satisfiability constraints on FPGA. In Proceedings of the Design, Automation Test in Europe Conference Exhibition (DATE’19). 1046–1051.
[60]
R. Guidotti, G. Rossetti, L. Pappalardo, F. Giannotti, and D. Pedreschi. 2018. Personalized market basket prediction with temporal annotated recurring sequences. IEEE Trans. Knowl. Data Eng. 31, 11 (2019), 2151–2163.
[61]
Jiawei Han, Jian Pei, and Yiwen Yin. 2000. Mining frequent patterns without candidate generation. SIGMOD Rec. 29, 2 (May 2000), 1–12.
[62]
T. Hoang, X. Nguyen, H. Nguyen, N. Truong, D. Le, K. Inoue, and C. Pham. 2017. FPGA-based frequent items counting using matrix of equality comparators. In Proceedings of the IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS’17). 285–288.
[63]
Ya-Han Hu and Yen-Liang Chen. 2006. Mining association rules with multiple minimum supports: A new mining algorithm and a support tuning mechanism. Decis. Support Syst. 42, 1 (2006), 1–24.
[64]
Yuan-Shao Huang, Kun-Ming Yu, Li-Wei Zhou, Ching-Hsien Hsu, and Sheng-Hui Liu. 2013. Accelerating parallel frequent itemset mining on graphics processors with sorting. In Network and Parallel Computing, Ching-Hsien Hsu, Xiaoming Li, Xuanhua Shi, and Ran Zheng (Eds.). Springer, Berlin, 245–256.
[65]
Liheng Jian, Cheng Wang, Ying Liu, Shenshen Liang, Weidong Yi, and Yong Shi. 2013. Parallel data mining techniques on graphics processing unit with compute unified device architecture (CUDA). J. Supercomput. 64, 3 (2013), 942–967.
[66]
H. Jiang and H. Meng. 2017. A parallel FP-growth algorithm based on GPU. In Proceedings of the IEEE 14th International Conference on e-Business Engineering (ICEBE’17). 97–102.
[67]
Ruoming Jin and Gagan Agrawal. 2005. An algorithm for in-core frequent itemset mining on streaming data. In Proceedings of the 5th IEEE International Conference on Data Mining (ICDM’05). IEEE Computer Society, 210–217.
[68]
R. Uday Kiran and P. Krishna Reddy. 2011. Novel techniques to reduce search space in multiple minimum supports-based frequent pattern mining algorithms. In Proceedings of the 14th International Conference on Extending Database Technology (EDBT/ICDT’11). Association for Computing Machinery, New York, NY, 11–20.
[69]
Yusuke Kozawa, Toshiyuki Amagasa, and Hiroyuki Kitagawa. 2012. GPU acceleration of probabilistic frequent itemset mining from uncertain databases. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM’12). ACM, 892–901.
[70]
Yusuke Kozawa, Toshiyuki Amagasa, and Hiroyuki Kitagawa. 2014. Probabilistic frequent itemset mining on a GPU cluster. IEICE Trans. Inf. Syst. 97, 4 (2014), 779–789.
[71]
Gangin Lee, Unil Yun, and Heungmo Ryang. 2015. An uncertainty-based approach: Frequent itemset mining from uncertain data with different item importance. Knowl.-Bas. Syst. 90, 1 (2015), 239–256.
[72]
In Lee. 2017. Big data: Dimensions, evolution, impacts, and challenges. Bus. Horizons 60, 3 (2017), 293–303.
[73]
Victor E. Lee, Ruoming Jin, and Gagan Agrawal. 2014. Frequent Pattern Mining in Data Streams. Springer International Publishing, 199–224.
[74]
Martín Letras, Lázaro Bustio-Martínez, René Cumplido, Raudel Hernández-León, and Claudia Feregrino-Uribe. 2020. On the design of hardware architectures for parallel frequent itemsets mining. Expert Syst. Appl. 157, 1 (2020), 113440.
[75]
Martín Letras, Raudel Hernández-León, and René Cumplido. 2016. Hardware architectures for frequent itemset mining based on equivalence classes partitioning. In Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW’16). 289–294.
[76]
Yubin Li, Yuliang Sun, Guohao Dai, Qiang Xu, Yu Wang, and Huazhong Yang. 2016. Approximate frequent itemset mining for streaming data on FPGA. In Proceedings of the 26th International Conference on Field Programmable Logic and Applications (FPL’16). 1–4.
[77]
Y. Li, J. Xu, and L. Chen. 2015. A new closed frequent itemsets mining algorithm based on GPU. In Proceedings of the 3rd International Conference on Advanced Cloud and Big Data. 291–295.
[78]
Zhenmin Li and Yuanyuan Zhou. 2005. PR-miner: Automatically extracting implicit programming rules and detecting violations in large software code. SIGSOFT Softw. Eng. Notes 30, 5 (Sep. 2005), 306–315.
[79]
Jerry Chun-Wei Lin, Wensheng Gan, Philippe Fournier-Viger, Lu Yang, Qiankun Liu, Jaroslav Frnda, Lukas Sevcik, and Miroslav Voznak. 2016. High utility-itemset mining and privacy-preserving utility mining. Perspect. Sci. 7, 1 (2016), 74–80.
[80]
T. Y. Lin, Xiaohua Hu, and E. Louie. 2003. A fast association rule algorithm based on bitmap and granular computing. In Proceedings of the 12th IEEE International Conference on Fuzzy Systems (FUZZ’03), Vol. 1. 678–683 vol. 1.
[81]
Bing Liu, Wynne Hsu, and Yiming Ma. 1999. Mining association rules with multiple minimum supports. In Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’99). Association for Computing Machinery, New York, NY, 337–341.
[82]
Wei-Chuan Liu, Ken-Hao Liu, and Ming-Syan Chen. 2006. Hardware enhanced mining for association rules. In Proceedings of the 10th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining (PAKDD’06). 729–738.
[83]
O. Loyola-González. 2019. Black-box vs. white-box: Understanding their advantages and weaknesses from a practical point of view. IEEE Access 7, 1 (2019), 154096–154113.
[84]
José María Luna, Philippe Fournier-Viger, and Sebastián Ventura. 2019. Frequent itemset mining: A 25 years review. WIREs Data Min. Knowl. Discov. 9, 6 (2019), e1329.
[85]
Gurmeet Singh Manku and Rajeev Motwani. 2002. Approximate frequency counts over data streams. In Proceedings of the 28th International Conference on Very Large Data Bases (VLDB’02). VLDB Endowment, 346–357.
[86]
Dejan Markovikj, Sonja Gievska, Michal Kosinski, and David Stillwell. 2013. Mining facebook data for predictive personality modeling. Proc. Int. AAAI Conf. Web Soc. Media 7, 1 (Jun. 2013), 1–31.
[87]
Alejandro Mesa, Claudia Feregrino-Uribe, René Cumplido, and José Hernández-Palancar. 2010. A highly parallel algorithm for frequent itemset mining. In Advances in Pattern Recognition, José Francisco Martínez-Trinidad, Jesús Ariel Carrasco-Ochoa, and Josef Kittler (Eds.), Lecture Notes in Computer Science, Vol. 6256. Springer, Berlin, 291–300.
[88]
Ahmed Metwally, Divyakant Agrawal, and Amr El Abbadi. 2006. An integrated efficient solution for computing frequent and top-k elements in data streams. ACM Trans. Database Syst. 31, 3 (Sep. 2006), 1095–1133.
[89]
Ariel Monteserin and Marcelo G. Armentano. 2018. Influence-based approach to market basket analysis. Inf. Syst. 78 (2018), 214–224.
[90]
Stefan Naulaerts, Pieter Meysman, Wout Bittremieux, Trung Nghia Vu, Wim Vanden Berghe, Bart Goethals, and Kris Laukens. 2013. A primer to frequent itemset mining for bioinformatics. Brief. Bioinf. 16, 2 (10 2013), 216–231.
[91]
Victor Nikam and Bal Meshram. 2014. Scalable frequent itemset mining using heterogeneous computing: ParApriori. Int. J. Dist. Parallel Syst. 5, 5 (Sep. 2014), 14.
[92]
Edward R. Omiecinski. 2003. Alternative interest measures for mining associations in databases. IEEE Trans. Knowl. Data Eng. 15, 1 (Jan. 2003), 57–69.
[93]
John D. Owens, Mike Houston, David Luebke, Simon Green, John E. Stone, and James C. Phillips. 2008. GPU computing. Proc. IEEE 96, 5 (2008), 879–899.
[94]
Adrien Prost-Boucle, Frédéric Pétrot, Vincent Leroy, and Hande Alemdar. 2017. Efficient and versatile FPGA acceleration of support counting for stream mining of sequences and frequent itemsets. ACM Trans. Reconfig. Technol. Syst. 10, 3 (May 2017), 21:1–21:25.
[95]
Jiadong Ren Qian Wang, Darryl N. Davis. 016. Mining frequent biological sequences based on bitmap without candidate sequence generation. Comput. Biol. Med. 69, 1 (016), 152–157.
[96]
Sheetal Rathi and C. A. Dhote. 2015. Parallel Implementation of FP-Growth Algorithm on XML Data Using Multiple GPU. Springer India, 581–589.
[97]
Valentina Richthammer, Tobias Scheinert, and Michael Glaß. 2020. Data mining in system-level design space exploration of embedded systems. In Embedded Computer Systems: Architectures, Modeling, and Simulation, Alex Orailoglu, Matthias Jung, and Marc Reichenbach (Eds.). Springer International Publishing, 52–66.
[98]
Oussama Rouane, Hacene Belhadef, and Mustapha Bouakkaz. 2019. Combine clustering and frequent itemsets mining to enhance biomedical text summarization. Expert Syst. Appl. 135, 1 (2019), 362–373.
[99]
Grace L Samson, Joan Lu, and Aminat A Showole. 2014. Mining complex spatial patterns: Issues and techniques. J. Inf. Knowl. Manage. 13, 02 (2014), 1450019.
[100]
Grace L. Samson, Joan Lu, Mistura M. Usman, and Qiang Xu. 2017. Spatial databases: An overview. Ontol. Big Data Consid. Effect. Intell. 1, 1 (2017), 111–149.
[101]
J. Schroeder. 2001. COPLINK: Database Integration and Access for a Law Enforcement Intranet, Final Report. Technical Report 190988. Department of Justice. 132 pages.
[102]
Shaobo Shi, Yue Qi, and Qin Wang. 2013. FPGA acceleration for intersection computation in frequent itemset mining. In Proceedings of the International Conference on Cyber-Enabled Distributed Computers and Knowledge Discovery (CyberC’13). IEEE, 514–519.
[103]
Otis Smart and Lauren Burrell. 2015. Genetic programming and frequent itemset mining to identify feature selection patterns of iEEG and fMRI epilepsy data. Eng. Appl. Artif. Intell. 39, 1 (2015), 198–214.
[104]
Mohammad Karim Sohrabi and Hossein Azgomi. 2019. Finding similar documents using frequent pattern mining methods. Int. J. Uncert. Fuzz. Knowl.-Bas. Syst. 27, 01 (2019), 73–96.
[105]
Liwen Sun, Reynold Cheng, David W. Cheung, and Jiefeng Cheng. 2010. Mining uncertain data with probabilistic guarantees. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’10). ACM, 273–282.
[106]
Song Sun, Michael Steffen, and Joseph Zambreno. 2008. A reconfigurable platform for frequent pattern mining. In Proceedings of the International Conference on Reconfigurable Computing and FPGAs (RECONFIG’08). IEEE Computer Society, 55–60.
[107]
Song Sun and Joseph Zambreno. 2008. Mining association rules with systolic trees. In Proceedings of the International Conference on Field Programmable Logic and Applications (FPL’08). IEEE, 143–148.
[108]
Song Sun and Joseph Zambreno. 2011. Design and analysis of a reconfigurable platform for frequent pattern mining. IEEE Trans. Parallel Distrib. Syst. 22, 1 (2011), 1497–1505.
[109]
Yuliang Sun, Zilong Wang, Sitao Huang, Lanjun Wang, Yu Wang, Rong Luo, and Huazhong Yang. 2014. Accelerating frequent item counting with FPGA. In Proceedings of the ACM/SIGDA International Symposium on Field-programmable Gate Arrays (FPGA’14). ACM, 109–112.
[110]
Experts Systems. 2019. Data mining and criminal intelligence: A new era in crime prevention. https://www.expert.ai/blog/data-mining-and-criminal-intelligence/.
[111]
A. Tehreem, S. G. Khawaja, M. U. Akram, S. A. Khan, and M. Ali. 2017. Parallel architecture for implementation of frequent itemset mining using FP-growth. In Proceedings of the International Conference on Signals and Systems (ICSigSys’17). 92–98.
[112]
Rajkumar Tekchandani, Rajesh Bhatia, and Maninder Singh. 2018. Semantic code clone detection for internet of things applications using reaching definition and liveness analysis. J. Supercomput. 74, 9 (01 Sept. 2018), 4199–4226.
[113]
Jens Teubner, René Müller, and Gustavo Alonso. 2010. FPGA acceleration for the frequent item problem. In Proceedings of the IEEE 26th International Conference on Data Engineering (ICDE’10). IEEE, 669–680.
[114]
Jens Teubner, Rudolf Muller, and Gustavo Alonso. 2011. Frequent item computation on a chip. IEEE Trans. Knowl. Data Eng. 23, 8 (2011), 1169–1181.
[115]
David W. Thöni and Alfred Strey. 2009. Novel strategies for hardware acceleration of frequent itemset mining with the apriori algorithm. In Proceedings of the 19th International Conference on Field Programmable Logic and Appslications (FPL’09). IEEE, 489–492.
[116]
Mayank Tiwary, Abhaya Kumar Sahoo, and Rachita Misra. 2014. Efficient implementation of apriori algorithm on hdfs using GPU. In Proceedings of the International Conference on High Performance Computing and Applications (ICHPCA’14). IEEE, 1–7.
[117]
Da Tong and Viktor Prasanna. 2013. Online heavy hitter detector on FPGA. In Proceedings of the International Conference on Reconfigurable Computing and FPGAs (RECONFIG’13). IEEE, 1–6.
[118]
R. L. Uy and N. Marcos. 2016. Fast 1-itemset frequency count using CUDA. In Proceedings of the IEEE Region 10 Conference (TENCON’16). 210–213.
[119]
Lan Vu and Gita Alaghband. 2015. A self-adaptive method for frequent pattern mining using a CPU-GPU hybrid model. In Proceedings of the Symposium on High Performance Computing (HPC’15). Society for Computer Simulation International, 192–201.
[120]
Chao Wang, Wenqi Lou, Lei Gong, Lihui Jin, Luchao Tan, Yahui Hu, Xi Li, and Xuehai Zhou. 2017. Reconfigurable hardware accelerators: Opportunities, trends, and challenges. arXiv:cs.AR/1712.04771. Retrieved from https://arxiv.org/abs/cs.AR/1712.04771.
[121]
Fei Wang and Bo Yuan. 2014. Parallel frequent pattern mining without candidate generation on GPUs. In Proceedings of the IEEE International Conference on Data Mining Workshop (ICDMW’14). IEEE, 1046–1052.
[122]
Ke Wang, Liu Tang, Jiawei Han, and Junqiang Liu. 2002. Top Down FP-Growth for Association Rule Mining. Springer, Berlin, 334–340.
[123]
Eric W. Weisstein. 2017. Lexicographic Order. From MathWorld—A Wolfram Web Resource. https://mathworld.wolfram.com/LexicographicOrder.html.
[124]
Ying-Hsiang Wen, Jen-Wei Huang, and Ming-Syan Chen. 2008. Hardware-enhanced association rule mining with hashing and pipelining. IEEE Trans. Knowl. Data Eng. 20, 6 (Jun. 2008), 784–795.
[125]
T. Xie, S. Thummalapenta, D. Lo, and C. Liu. 2009. Data mining for software engineering. Computer 42, 8 (Aug. 2009), 55–62.
[126]
H. Xiong, P. Tan, and Vipin Kumar. 2003. Mining strong affinity association patterns in data sets with skewed support distribution. In Proceedings of the 3rd IEEE International Conference on Data Mining. 387–394.
[127]
Liming Yao, Zhongwen Xu, Xiaoyang Zhou, and Benjamin Lev. 2019. Synergies between Association Rules and Collaborative Filtering in Recommender System: An Application to Auto Industry. Springer International Publishing, Cham, 65–80.
[128]
Zhou Yong, Guo He, and Han Jun. 2012. Mining Frequent Itemsets over Recent Data Stream Based on Genetic Algorithm. INTECH Open Access Publisher.
[129]
Mohammed J. Zaki. 2000. Scalable algorithms for association mining. IEEE Trans. Knowl. Data Eng. 12, 3 (2000), 372–390.
[130]
Fan Zhang, Yan Zhang, and Jason Bakos. 2011. GPApriori: GPU-Accelerated frequent itemset mining. In Proceedings of the IEEE International Conference on Cluster Computing (CLUSTER’11). IEEE Computer Society, 590–594.
[131]
Fan Zhang, Yan Zhang, and Jason Bakos. 2013. Accelerating frequent itemset mining on graphics processing units. The J. of Supercomput. 66, 1 (2013), 94–117.
[132]
Yan Zhang, Fan Zhang, Zheming Jin, and Jason D. Bakos. 2013. An FPGA-based accelerator for frequent itemset mining. ACM Trans. Reconf. Technol. Syst. 6, 1, Article 2 (May 2013), 2:1–2:17 pages.
[133]
Jiayi Zhou, Kun-Ming Yu, and Bin-Chang Wu. 2010. Parallel frequent patterns mining algorithm on GPU. In Proceedings of the IEEE International Conference on Systems Man and Cybernetics (SMC’10). IEEE, 435–440.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Computing Surveys
ACM Computing Surveys  Volume 54, Issue 9
December 2022
800 pages
ISSN:0360-0300
EISSN:1557-7341
DOI:10.1145/3485140
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 October 2021
Accepted: 01 June 2021
Revised: 01 February 2021
Received: 01 June 2020
Published in CSUR Volume 54, Issue 9

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Frequent itemsets mining
  2. data streams
  3. GPU/FPGA custom hardware architectures

Qualifiers

  • Survey
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)172
  • Downloads (Last 6 weeks)5
Reflects downloads up to 27 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Development and Implementation of an FPGA-Embedded Multimedia Remote Monitoring System for Information Technology Server Room ManagementInternational Journal of Digital Multimedia Broadcasting10.1155/2024/44205782024Online publication date: 7-Mar-2024
  • (2024)GMiner++: Boosting GPU-based frequent itemset mining by reducing redundant computationsExpert Systems with Applications10.1016/j.eswa.2024.123928250(123928)Online publication date: Sep-2024
  • (2024)Data MiningEncyclopedia of Optimization10.1007/978-3-030-54621-2_108-1(1-7)Online publication date: 11-Aug-2024
  • (2023)Boosting precision crop protection towards agriculture 5.0 via machine learning and emerging technologies: A contextual reviewFrontiers in Plant Science10.3389/fpls.2023.114332614Online publication date: 22-Mar-2023
  • (2022)High-Level Design Optimizations for Implementing Data Stream Sketch Frequency Estimators on FPGAsElectronics10.3390/electronics1115239911:15(2399)Online publication date: 31-Jul-2022
  • (2022)Construction of a Smart Supply Chain for Sand Factory Using the Edge-Computing-Based Deep Learning AlgorithmScientific Programming10.1155/2022/96077552022Online publication date: 1-Jan-2022
  • (2022)Parallel frequent itemsets mining using distributed graphic processing unitsMultimedia Tools and Applications10.1007/s11042-022-13225-z81:30(43873-43895)Online publication date: 1-Dec-2022

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media