Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2554688.2554766acmconferencesArticle/Chapter ViewAbstractPublication PagesfpgaConference Proceedingsconference-collections
research-article

Accelerating frequent item counting with FPGA

Published: 26 February 2014 Publication History

Abstract

Frequent item counting is one of the most important operations in time series data mining algorithms, and the space saving algorithm is a widely used approach to solving this problem. With the rapid rising of data input speeds, the most challenging problem in frequent item counting is to meet the requirement of wire-speed processing. In this paper, we propose a streaming oriented PE-ring framework on FPGA for counting frequent items. Compared with the best existing FPGA implementation, our basic PE-ring framework saves 50% lookup table resources cost and achieves the same throughput in a more scalable way. Furthermore, we adopt SIMD-like cascaded filter for further performance improvements, which outperforms the previous work by up to 3.24 times in some data distributions.

References

[1]
Liu, J., Pan, Y., Wang, K., et al. Mining frequent item sets by opportunistic projection. Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2002: 229--238.
[2]
Beil, F., Ester, M., Xu, X. Frequent term-based text clustering. Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2002: 436--442.
[3]
Cormode, G., Hadjieleftheriou, M. Finding frequent items in data streams. Proceedings of the VLDB Endowment, 2008, 1(2): 1530--1541.
[4]
Chakrabarti, A., Cormode, G., McGregor, A. A near-optimal algorithm for computing the entropy of a stream. Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms. Society for Industrial and Applied Mathematics, 2007: 328--335.
[5]
Roy, P., Teubner, J., Alonso, G. Efficient frequent item counting in multi-core hardware Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2012: 1451--1459.
[6]
Teubner, J., Muller, R., Alonso, G. Frequent item computation on a chip. Knowledge and Data Engineering, IEEE Transactions on, 2011, 23(8): 1169--1181.
[7]
Metwally, A., Agrawal, D., Abbadi, A. E. An integrated efficient solution for computing frequent and top-k elements in data streams. ACM Transactions on Database Systems (TODS), 2006, 31(3): 1095--1133.
[8]
Wang, Z., Huang, S., Wang, L., et al. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. Proceedings of the ACM/SIGDA international symposium on Field programmable gate arrays. ACM, 2013: 53--62.
[9]
Montemurro, M. A. Beyond the Zipf-Mandelbrot law in quantitative linguistics. Physica A: Statistical Mechanics and its Applications, 2001, 300(3): 567--578.
[10]
Gabaix, X. Zipf's Law and the Growth of Cities. The American Economic Review, 1999, 89(2): 129--132.

Cited By

View all
  • (2023)Finding the Top-K Heavy Hitters in Data Streams: A Reconfigurable Accelerator Based on an FPGA-Optimized AlgorithmElectronics10.3390/electronics1211237612:11(2376)Online publication date: 24-May-2023
  • (2023)Fast approximation of the top‐k items in data streams using FPGAsIET Computers & Digital Techniques10.1049/cdt2.1205317:2(60-73)Online publication date: 19-Feb-2023
  • (2022)High-Level Design Optimizations for Implementing Data Stream Sketch Frequency Estimators on FPGAsElectronics10.3390/electronics1115239911:15(2399)Online publication date: 31-Jul-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
FPGA '14: Proceedings of the 2014 ACM/SIGDA international symposium on Field-programmable gate arrays
February 2014
272 pages
ISBN:9781450326711
DOI:10.1145/2554688
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 February 2014

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. fpga
  2. frequent item counting
  3. time series

Qualifiers

  • Research-article

Conference

FPGA'14
Sponsor:

Acceptance Rates

FPGA '14 Paper Acceptance Rate 30 of 110 submissions, 27%;
Overall Acceptance Rate 125 of 627 submissions, 20%

Upcoming Conference

FPGA '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)13
  • Downloads (Last 6 weeks)0
Reflects downloads up to 27 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Finding the Top-K Heavy Hitters in Data Streams: A Reconfigurable Accelerator Based on an FPGA-Optimized AlgorithmElectronics10.3390/electronics1211237612:11(2376)Online publication date: 24-May-2023
  • (2023)Fast approximation of the top‐k items in data streams using FPGAsIET Computers & Digital Techniques10.1049/cdt2.1205317:2(60-73)Online publication date: 19-Feb-2023
  • (2022)High-Level Design Optimizations for Implementing Data Stream Sketch Frequency Estimators on FPGAsElectronics10.3390/electronics1115239911:15(2399)Online publication date: 31-Jul-2022
  • (2021)FPGA/GPU-based Acceleration for Frequent Itemsets Mining: A Comprehensive ReviewACM Computing Surveys10.1145/347228954:9(1-35)Online publication date: 8-Oct-2021
  • (2021)Convolution without multiplication: A general speed up strategy for CNNsScience China Technological Sciences10.1007/s11431-021-1936-2Online publication date: 11-Nov-2021
  • (2021)Fast Approximation of the Top-k Items in Data Streams Using a Reconfigurable AcceleratorApplied Reconfigurable Computing. Architectures, Tools, and Applications10.1007/978-3-030-79025-7_1(3-17)Online publication date: 23-Jun-2021
  • (2020)An Efficient Hardware Architecture for Finding Frequent Items in Data Streams2020 IEEE 38th International Conference on Computer Design (ICCD)10.1109/ICCD50377.2020.00034(113-119)Online publication date: Oct-2020
  • (2019)Hardware Accelerators for Data SearchFPGA-BASED Hardware Accelerators10.1007/978-3-030-20721-2_3(69-103)Online publication date: 31-May-2019
  • (2017)CaribouProceedings of the VLDB Endowment10.14778/3137628.313763210:11(1202-1213)Online publication date: 1-Aug-2017
  • (2017)On the design of hardware-software architectures for frequent itemsets mining on data streamsJournal of Intelligent Information Systems10.1007/s10844-017-0461-850:3(415-440)Online publication date: 16-May-2017
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media