Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1457150.1457152acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

CAM conscious integrated answering of frequent elements and top-k queries over data streams

Published: 13 June 2008 Publication History

Abstract

Frequent elements and top-k queries constitute an important class of queries for data stream analysis applications. Certain applications require answers for both frequent elements and top-k queries on the same stream. In addition, the ever increasing data rates call for providing fast answers to the queries, and researchers have been looking towards exploiting specialized hardware for this purpose. Content Addressable Memory(CAM) provides an efficient way of looking up elements and hence are well suited for the class of algorithms that involve lookups. In this paper, we present a fast and efficient CAM conscious integrated solution for answering both frequent elements and top-k queries on the same stream. We call our scheme CAM conscious Space Saving with Stream Summary (CSSwSS), and it can efficiently answer continuous queries. We provide an implementation of the proposed scheme using commodity CAM chips, and the experimental evaluation demonstrates that not only does the proposed scheme outperforms existing CAM conscious techniques by an order of magnitude at query loads of about 10%, but the proposed scheme can also efficiently answer continuous queries.

References

[1]
N. Bandi, A. Metwally, D. Agrawal, and A. E. Abbadi. Fast data stream algorithms using associative memories. In SIGMOD, pages 247--256, Beijing, China, 2007.
[2]
N. Bandi, S. Schnieder, D. Agrawal, and A. E. Abbadi. Hardware Acceleration of Database operations using Content Addressable Memories. In DaMoN, 2005.
[3]
B. Bhattacharjee, N. Abe, K. Goldman, B. Zadrozny, V. R. Chillakuru, M. del Carpio, and C. Apte. Using secure coprocessors for privacy preserving collaborative data mining and analysis. In DaMoN '06, 2006.
[4]
M. Charikar, K. Chen, and M. Farach-Colton. Finding frequent items in data streams. In ICALP'02, pages 693--703, 2002.
[5]
J. Cieslewicz and K. A. Ross. Adaptive Aggregation on Chip Multiprocessors. In VLDB 2007, pages 339--350, 2007.
[6]
G. Das, D. Gunopulos, N. Koudas, and N. Sarkas. Ad-hoc top-k query answering for data streams. In VLDB, pages 183--194, 2007.
[7]
E. D. Demaine, A. López-Ortiz, and J. I. Munro. Frequency estimation of internet packet streams with limited space. In ESA, volume 2461, pages 348--360, 2002.
[8]
R. Fang, B. He, M. Lu, K. Yang, N. K. Govindaraju, Q. Luo, and P. V. Sander. Gpuqp: query co-processing using graphics processors. In SIGMOD '07, pages 1061--1063, 2007.
[9]
M. Fischer and S. Salzberg. Finding a majority among n votes: Solution to problem 81--5. In Journal of Algorithms 3(4), pages 376--379, 1982.
[10]
B. T. Gold, A. Ailamaki, L. Huston, and B. Falsai. Accelerating database operators using a network processor. In DAMON '05, 2005.
[11]
M. Greenwald and S. Khanna. Space-efficient online computation of quantile summaries. In SIGMOD, 2001.
[12]
N. Hardavellas, I. Pandis, R. Johnson, N. G. Mancheril, A. Ailamaki, and B. Falsafi. Database Servers on Chip Multiprocessors: Limitations and Opportunities. In CIDR, pages 79--87, 2007.
[13]
Integrated Devices Technologies, Integrated IP Co-processor IDT 75K62134. http://idt.com/?genID=75K62134&source=products_genericPart_75K62134, 2006.
[14]
Intel Internet Exchange Architecture for Network Processors. Technical report, Intel Corp., 2002.
[15]
Intel IXP 2800 Network Processor Product Brief. http://www.intel.com/design/network/prodbrf/279054.htm, 2006.
[16]
G. S. Manku and R. Motwani. Approximate frequency counts over data streams. In VLDB, 2002.
[17]
A. Metwally, D. Agrawal, and A. E. Abbadi. An integrated efficient solution for computing frequent and top-k elements in data streams. ACM Trans. Database Syst., 31(3):1095--1133, 2006.
[18]
K. Mouratidis, S. Bakiras, and D. Papadias. Continuous monitoring of top-k queries over sliding windows. In SIGMOD, pages 635--646, 2006.
[19]
R. Panigrahy and D. Thomas. Finding Frequent Elements in Non-Bursty Streams. In ESA, pages 53--62, October 2007.
[20]
D. Shah and P. Gupta. Fast updating algorithms for tcams. IEEE Micro, 21(1):36--47, 2001.
[21]
Teja networking systems and teja np application development platform. http://www.teja.com, 2006.
[22]
S. Venkataraman, D. Song, P. Gibbons, and A. Blum. New streaming algorithms for fast detection of superspreaders. In NDSS, 2005.
[23]
J. Zhou, J. Cieslewicz, K. A. Ross, and M. Shah. Improving database performance on simultaneous multithreading processors. In VLDB '05, pages 49--60, 2005.
[24]
G. K. Zipf. Human Behavior and The Principle of Least Effort. Addison-Wesley, Cambridge, MA, 1949.
[25]
M. Zukowski, S. Héman, and P. Boncz. Architecture-conscious hashing. In DaMoN '06, 2006.

Cited By

View all
  • (2019)Parallelizing weighted frequency counting in high-speed network monitoringComputer Communications10.1016/j.comcom.2010.04.02634:4(536-547)Online publication date: 4-Jan-2019
  • (2018)Frequent items counter based on binary decodersIEICE Electronics Express10.1587/elex.15.2018080815:20(20180808-20180808)Online publication date: 2018
  • (2018)VLSI Design of Frequent Items Counting Using Binary Decoders Applied to 8-bit per Item Case-study2018 14th Conference on Ph.D. Research in Microelectronics and Electronics (PRIME)10.1109/PRIME.2018.8430308(161-164)Online publication date: Jul-2018
  • Show More Cited By

Index Terms

  1. CAM conscious integrated answering of frequent elements and top-k queries over data streams

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    DaMoN '08: Proceedings of the 4th international workshop on Data management on new hardware
    June 2008
    57 pages
    ISBN:9781605581842
    DOI:10.1145/1457150
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 13 June 2008

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. content addressable memory
    2. data streams
    3. frequent elements queries
    4. network processor
    5. stream algorithms
    6. top-k queries

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    DaMoN '08
    Sponsor:
    DaMoN '08: Data Management on New Hardware
    June 13, 2008
    Vancouver, Canada

    Acceptance Rates

    Overall Acceptance Rate 94 of 127 submissions, 74%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 19 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2019)Parallelizing weighted frequency counting in high-speed network monitoringComputer Communications10.1016/j.comcom.2010.04.02634:4(536-547)Online publication date: 4-Jan-2019
    • (2018)Frequent items counter based on binary decodersIEICE Electronics Express10.1587/elex.15.2018080815:20(20180808-20180808)Online publication date: 2018
    • (2018)VLSI Design of Frequent Items Counting Using Binary Decoders Applied to 8-bit per Item Case-study2018 14th Conference on Ph.D. Research in Microelectronics and Electronics (PRIME)10.1109/PRIME.2018.8430308(161-164)Online publication date: Jul-2018
    • (2017)FPGA-based frequent items counting using matrix of equality comparators2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS)10.1109/MWSCAS.2017.8052916(285-288)Online publication date: Aug-2017
    • (2013)An efficient framework for parallel and continuous frequent item monitoringConcurrency and Computation: Practice and Experience10.1002/cpe.318226:18(2856-2879)Online publication date: 5-Dec-2013
    • (2012)Parallelizing the Weighted Lossy Counting Algorithm in High-speed Network MonitoringProceedings of the 2012 Second International Conference on Instrumentation, Measurement, Computer, Communication and Control10.1109/IMCCC.2012.183(757-761)Online publication date: 8-Dec-2012
    • (2009)CoTSProceedings of the 2009 IEEE International Conference on Data Engineering10.1109/ICDE.2009.231(1323-1326)Online publication date: 29-Mar-2009

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media