Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3448016.3457318acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

MuSE Graphs for Flexible Distribution of Event Stream Processing in Networks

Published: 18 June 2021 Publication History

Abstract

Complex event processing (CEP) enables reactive and predictive applications through the continuous evaluation of queries over streams of event data. In a network of event sources, efficient query evaluation is achieved through distribution: Queries are split into operators (query decomposition), which are then assigned to some of the nodes (operator placement). Yet, existing solutions limit the decomposition to the operator hierarchy of a query, ignoring possible rewritings of it, and place each operator at exactly one node in the network. That neglects optimizations based on pattern composition through multiple queries as results are always gathered at a single sink node.
In this paper, we propose a new evaluation model for CEP, coined Multi-Sink Evaluation (MuSE) graphs. It incorporates arbitrary projections of queries for distribution and assigns them to potentially many nodes. We prove correctness of query evaluation with MuSE graphs and provide a cost model to assess its efficiency. Since the construction of cost-optimal MuSE graphs is intractable, we present an approximation algorithm and several pruning trategies. Our evaluation results show that MuSE graphs reduce network transmission costs by up to three orders of magnitude over baseline strategies.

Supplementary Material

MP4 File (3448016.3457318.mp4)
Complex event processing (CEP) enables reactive and predictive applications through the continuous evaluation of queries over streamsof event data. In a network of event sources, efficient query evaluation is achieved through distribution: Queries are split into operators(query decomposition), which are then assigned to some of the nodes(operator placement). Yet, existing solutions show two fundamentalissues: They limit the decomposition to the operator hierarchy of aquery, ignoring possible rewritings of it, and place each operator atexactly one node in the network. That neglects potential optimizations related to the composition of patterns through multiple queriesas results are always gathered at a single sink node.In this paper, we propose a new evaluation model for CEP, coinedMulti-Sink Evaluation (MuSE) graphs. It incorporates arbitrary projections of queries for distribution and assigns them to potentiallymany nodes. We prove correctness of query evaluation with MuSEgraphs and provide a cost model to assess its efficiency. Since theconstruction of cost-optimal MuSE graphs is intractable, we presentan approximation algorithm and several pruning strategies. Our evaluation results show that MuSE graphs reduce network transmissioncosts by up to three orders of magnitude over baseline strategies,while an implementation in a state-of-the-art framework for dishu-berlin.detributed computing highlights significant latency improvements.

References

[1]
Jagrati Agrawal, Yanlei Diao, Daniel Gyllstrom, and Neil Immerman. 2008. Efficient pattern matching over event streams. In Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2008, Vancouver, BC, Canada, June 10--12, 2008, Jason Tsong-Li Wang (Ed.). ACM, 147--160. https://doi.org/10.1145/1376616.1376634
[2]
Mert Akdere and Nesime Tatbul. 2008. Plan-based complex event detection across distributed sources. Proceedings of the VLDB Endowment, Vol. 1, 1 (2008), 66--77.
[3]
Arvind Arasu, Brian Babcock, Shivnath Babu, John Cieslewicz, Mayur Datar, Keith Ito, Rajeev Motwani, Utkarsh Srivastava, and Jennifer Widom. 2016. STREAM: The Stanford Data Stream Management System. In Data Stream Management - Processing High-Speed Data Streams, Minos N. Garofalakis, Johannes Gehrke, and Rajeev Rastogi (Eds.). Springer, 317--336. https://doi.org/10.1007/978--3--540--28608-0_16
[4]
Alexander Artikis, Alessandro Margara, Martin Ugarte, Stijn Vansummeren, and Matthias Weidlich. 2017. Complex Event Recognition Languages: Tutorial. In Proceedings of the 11th ACM International Conference on Distributed and Event-based Systems, DEBS 2017, Barcelona, Spain, June 19--23, 2017. ACM, 7--10. https://doi.org/10.1145/3093742.3095106
[5]
Alexander Artikis, Matthias Weidlich, Francc ois Schnitzler, Ioannis Boutsis, Thomas Liebig, Nico Piatkowski, Christian Bockermann, Katharina Morik, Vana Kalogeraki, Jakub Marecek, Avigdor Gal, Shie Mannor, Dimitrios Gunopulos, and Dermot Kinane. 2014. Heterogeneous Stream Processing and Crowdsourcing for Urban Traffic Management. In Proceedings of the 17th International Conference on Extending Database Technology, EDBT 2014, Athens, Greece, March 24--28, 2014, Sihem Amer-Yahia, Vassilis Christophides, Anastasios Kementsietsidis, Minos N. Garofalakis, Stratos Idreos, and Vincent Leroy (Eds.). OpenProceedings.org, 712--723. https://doi.org/10.5441/002/edbt.2014.77
[6]
Roger S. Barga, Jonathan Goldstein, Mohamed H. Ali, and Mingsheng Hong. 2007. Consistent Streaming Through Time: A Vision for Event Stream Processing. In CIDR 2007, Third Biennial Conference on Innovative Data Systems Research, Asilomar, CA, USA, January 7--10, 2007, Online Proceedings. www.cidrdb.org, 363--374. http://cidrdb.org/cidr2007/papers/cidr07p42.pdf
[7]
Shahid H. Bokhari. 1981. A Shortest Tree Algorithm for Optimal Assignments Across Space and Time in a Distributed Processor System. IEEE Trans. Software Eng., Vol. 7, 6 (1981), 583--589. https://doi.org/10.1109/TSE.1981.226469
[8]
Georgios Chatzimilioudis, Alfredo Cuzzocrea, Dimitrios Gunopulos, and Nikos Mamoulis. 2013. A novel distributed framework for optimizing query routing trees in wireless sensor networks via optimal operator placement. J. Comput. System Sci., Vol. 79, 3 (2013), 349--368.
[9]
Jianxia Chen, Lakshmish Ramaswamy, David K. Lowenthal, and Shivkumar Kalyanaraman. 2012a. Comet: Decentralized Complex Event Detection in Mobile Delay Tolerant Networks. In 13th IEEE International Conference on Mobile Data Management, MDM 2012, Bengaluru, India, July 23--26, 2012, Karl Aberer, Anupam Joshi, Sougata Mukherjea, Dipanjan Chakraborty, Hua Lu, Nalini Venkatasubramanian, and Salil S. Kanhere (Eds.). IEEE Computer Society, 131--136. https://doi.org/10.1109/MDM.2012.18
[10]
Jianxia Chen, Lakshmish Ramaswamy, David K Lowenthal, and Shivkumar Kalyanaraman. 2012b. Comet: Decentralized complex event detection in mobile delay tolerant networks. In 2012 IEEE 13th International Conference on Mobile Data Management. IEEE, 131--136.
[11]
Gianpaolo Cugola and Alessandro Margara. 2013. Deployment strategies for distributed complex event processing. Computing, Vol. 95, 2 (2013), 129--156.
[12]
Raul Castro Fernandez, Matthias Weidlich, Peter R. Pietzuch, and Avigdor Gal. 2014. Scalable stateful stream processing for smart grids. In The 8th ACM International Conference on Distributed Event-Based Systems, DEBS '14, Mumbai, India, May 26--29, 2014, Umesh Bellur and Ravi Kothari (Eds.). ACM, 276--281. https://doi.org/10.1145/2611286.2611326
[13]
Ioannis Flouris, Nikos Giatrakos, Antonios Deligiannakis, and Minos N. Garofalakis. 2020. Network-wide complex event processing over geographically distributed data sources. Inf. Syst., Vol. 88 (2020). https://doi.org/10.1016/j.is.2019.101442
[14]
Nikos Giatrakos, Elias Alevizos, Alexander Artikis, Antonios Deligiannakis, and Minos N. Garofalakis. 2020. Complex event recognition in the Big Data era: a survey. VLDB J., Vol. 29, 1 (2020), 313--352. https://doi.org/10.1007/s00778-019-00557-w
[15]
Jonathan Goldstein, Ahmed S. Abdelhamid, Mike Barnett, Sebastian Burckhardt, Badrish Chandramouli, Darren Gehring, Niel Lebeck, Christopher Meiklejohn, Umar Farooq Minhas, Ryan Newton, Rahee Peshawaria, Tal Zaccai, and Irene Zhang. 2020. A.M.B.R.O.S.I.A: Providing Performant Virtual Resiliency for Distributed Applications. Proc. VLDB Endow., Vol. 13, 5 (2020), 588--601. http://www.vldb.org/pvldb/vol13/p588-goldstein.pdf
[16]
InSystems. 2021. proANT Transport Robots. http://www.insystems.de/en/produkte/proant-transport-roboter/.
[17]
Matteo Nardelli, Valeria Cardellini, Vincenzo Grassi, and Francesco LO PRESTI. 2019. Efficient Operator Placement for Distributed Data Stream Processing Applications. IEEE Transactions on Parallel and Distributed Systems (2019).
[18]
Peter R. Pietzuch, Jonathan Ledlie, Jeffrey Shneidman, Mema Roussopoulos, Matt Welsh, and Margo I. Seltzer. 2006. Network-Aware Operator Placement for Stream-Processing Systems. In Proceedings of the 22nd International Conference on Data Engineering, ICDE 2006, 3--8 April 2006, Atlanta, GA, USA, Ling Liu, Andreas Reuter, Kyu-Young Whang, and Jianjun Zhang (Eds.). IEEE Computer Society, 49. https://doi.org/10.1109/ICDE.2006.105
[19]
Medhabi Ray, Chuan Lei, and Elke A. Rundensteiner. 2016. Scalable Pattern Sharing on Event Streams. In Proceedings of the 2016 International Conference on Management of Data, SIGMOD Conference 2016, San Francisco, CA, USA, June 26 - July 01, 2016, Fatma Ö zcan, Georgia Koutrika, and Sam Madden (Eds.). ACM, 495--510. https://doi.org/10.1145/2882903.2882947
[20]
Samira Akili and Matthias Weidlich. 2021. MuSE Graphs for Flexible Distribution of Event Stream Processing in Networks -- Technical Report. https://github.com/samieze/aMuSE .
[21]
Nicholas Poul Schultz-Møller, Matteo Migliavacca, and Peter R. Pietzuch. 2009. Distributed complex event processing with query rewriting. In Proceedings of the Third ACM International Conference on Distributed Event-Based Systems, DEBS 2009, Nashville, Tennessee, USA, July 6--9, 2009, Aniruddha S. Gokhale and Douglas C. Schmidt (Eds.). ACM. https://doi.org/10.1145/1619258.1619264
[22]
Utkarsh Srivastava, Kamesh Munagala, and Jennifer Widom. 2005. Operator placement for in-network stream query processing. In Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART sym posium on Principles of database systems. ACM, 250--258.
[23]
Fabrice Starks, Vera Goebel, Stein Kristiansen, and Thomas Plagemann. 2018. Mobile Distributed Complex Event Processing - Ubi Sumus? Quo Vadimus? In Mobile Big Data, A Roadmap from Models to Technologies, Georgios Skourletopoulos, George Mastorakis, Constandinos X. Mavromoustakis, Ciprian Dobre, and Evangelos Pallis (Eds.). Lecture Notes on Data Engineering and Communications Technologies, Vol. 10. Springer, 147--180. https://doi.org/10.1007/978--3--319--67925--9_7
[24]
Kia Teymourian, Malte Rohde, and Adrian Paschke. 2012. Knowledge-based processing of complex stock market events. In 15th International Conference on Extending Database Technology, EDBT '12, Berlin, Germany, March 27--30, 2012, Proceedings, Elke A. Rundensteiner, Volker Markl, Ioana Manolescu, Sihem Amer-Yahia, Felix Naumann, and Ismail Ari (Eds.). ACM, 594--597. https://doi.org/10.1145/2247596.2247674
[25]
John Wilkes. 2020. Yet more Google compute cluster trace data. Google research blog. Posted at https://ai.googleblog.com/2020/04/yet-more-google-compute-cluster-trace.html.
[26]
Lei Ying, Zhen Liu, Don Towsley, and Cathy H Xia. 2008. Distributed operator placement and data caching in large-scale sensor networks. In IEEE INFOCOM 2008-The 27th Conference on Computer Communications. IEEE, 977--985.
[27]
Haopeng Zhang, Yanlei Diao, and Neil Immerman. 2014. On complexity and optimization of expensive queries in complex event processing. In International Conference on Management of Data, SIGMOD 2014, Snowbird, UT, USA, June 22--27, 2014, Curtis E. Dyreson, Feifei Li, and M. TamerÖzsu (Eds.). ACM, 217--228. https://doi.org/10.1145/2588555.2593671

Cited By

View all
  • (2024)On-Demand Pattern Aggregation in Event NetworksProceedings of the International Workshop on Big Data in Emergent Distributed Environments10.1145/3663741.3664781(1-6)Online publication date: 9-Jun-2024
  • (2024)DecoPa: Query Decomposition for Parallel Complex Event ProcessingProceedings of the ACM on Management of Data10.1145/36549352:3(1-26)Online publication date: 30-May-2024
  • (2024)Efficient multi-query evaluation for distributed CEP through predicate-based push–pull plansInformation Systems10.1016/j.is.2023.102250120:COnline publication date: 4-Mar-2024
  • Show More Cited By

Index Terms

  1. MuSE Graphs for Flexible Distribution of Event Stream Processing in Networks

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGMOD '21: Proceedings of the 2021 International Conference on Management of Data
    June 2021
    2969 pages
    ISBN:9781450383431
    DOI:10.1145/3448016
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 18 June 2021

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. complex event processing
    2. operator placement
    3. query distribution

    Qualifiers

    • Research-article

    Conference

    SIGMOD/PODS '21
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 785 of 4,003 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)58
    • Downloads (Last 6 weeks)5
    Reflects downloads up to 03 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)On-Demand Pattern Aggregation in Event NetworksProceedings of the International Workshop on Big Data in Emergent Distributed Environments10.1145/3663741.3664781(1-6)Online publication date: 9-Jun-2024
    • (2024)DecoPa: Query Decomposition for Parallel Complex Event ProcessingProceedings of the ACM on Management of Data10.1145/36549352:3(1-26)Online publication date: 30-May-2024
    • (2024)Efficient multi-query evaluation for distributed CEP through predicate-based push–pull plansInformation Systems10.1016/j.is.2023.102250120:COnline publication date: 4-Mar-2024
    • (2023)INEv: In-Network Evaluation for Event Stream ProcessingProceedings of the ACM on Management of Data10.1145/35889551:1(1-26)Online publication date: 30-May-2023
    • (2022)Dynamic Performance Analysis of STEP System in Internet of Vehicles Based on Queuing TheoryComputational Intelligence and Neuroscience10.1155/2022/83220292022Online publication date: 10-Apr-2022

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media