Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3085504.3085528acmotherconferencesArticle/Chapter ViewAbstractPublication PagesssdbmConference Proceedingsconference-collections
research-article

DualDB: An Efficient LSM-based Publish/Subscribe Storage System

Published: 27 June 2017 Publication History

Abstract

Publish/Subscribe systems allow subscribers to monitor for events of interest generated by publishers. Current publish/subscribe query systems are efficient when the subscriptions (queries) are relatively static -- for instance, the set of followers in Twitter -- or can fit in memory. However, an increasing number of applications in this era of Big Data and Internet of Things (IoT) are based on a highly dynamic query paradigm, where continuous queries are in the millions and are created and expire in a rate comparable, or even higher, to that of the data (event) entries. For instance moving objects like airplanes, cars or sensors may continuously generate measurement data like air pressure or traffic, which are consumed by other moving objects.
In this paper we propose and compare a novel publish/subscribe storage architecture, DualDB, based on the popular NoSQL Log-Structured Merge Tree (LSM) storage paradigm, to support high-throughput and dynamic publish/subscribe systems. Our method naturally supports queries on both past and future data, and generate instant notifications, which are desirable properties missing from many previous systems. We implemented and experimentally evaluated our methods on the popular LSM-based LevelDB system, using real datasets. Our results show that we can achieve significantly higher throughput compared to state-of-the-art baselines.

References

[1]
Sattam Alsubaiee, Alexander Behm, Vinayak Borkar, Zachary Heilbron, Young-Seok Kim, Michael J Carey, Markus Dreseler, and Chen Li. 2014. Storage Management in AsterixDB. Proceedings of the VLDB Endowment 7, 10 (2014).
[2]
Michael J Carey, Steven Jacobs, and Vassilis J Tsotras. 2016. Breaking BAD: a data serving vision for big active data. In Proceedings of the 10th ACM International Conference on Distributed and Event-based Systems. ACM, 181--186.
[3]
Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C Hsieh, Deborah A Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, and Robert E Gruber. 2008. Bigtable: A distributed storage system for structured data. TOCS 26, 2 (2008), 4.
[4]
Jianjun Chen, David J DeWitt, Feng Tian, and Yuan Wang. 2000. NiagaraCQ: A scalable continuous query system for internet databases. In ACM SIGMOD Record, Vol. 29. ACM, 379--390.
[5]
Patrick Th Eugster, Pascal A Felber, Rachid Guerraoui, and Anne-Marie Kermarrec. 2003. The many faces of publish/subscribe. ACM Computing Surveys (CSUR) 35, 2 (2003), 114--131.
[6]
Eli Fidler, Hans-Arno Jacobsen, Guoli Li, and Serge Mankovski. 2005. The PADRES Distributed Publish/Subscribe System. In FIW. 12--30.
[7]
Google Inc. 2017. LevelDB. http://leveldb.org/. (Feb 2017).
[8]
Hans-Arno Jacobsen, Vinod Muthusamy, and Guoli Li. 2009. The PADRES Event Processing Network: Uniform Querying of Past and Future EventsDas PADRES Ereignisverarbeitungsnetzwerk: Einheitliche Anfragen auf Ereignisse der Vergangenheit und Zukunft. it-Information Technology Methoden und innovative Anwendungen der Informatik und Informationstechnik 51, 5 (2009), 250--260.
[9]
Avinash Lakshman and Prashant Malik. 2010. Cassandra: A Decentralized Structured Storage System. SIGOPS Oper. Syst. Rev. 44, 2 (apr 2010), 35--40.
[10]
Jennifer Widom and Sheldon J Finkelstein. 1990. Set-oriented production rules in relational database systems. In ACM SIGMOD Record, Vol. 19. ACM, 259--270.

Cited By

View all
  • (2021)Applications of geospatial big data in the Internet of ThingsTransactions in GIS10.1111/tgis.1284626:1(41-71)Online publication date: 24-Sep-2021
  • (2020)SineKV: Decoupled Secondary Indexing for LSM-based Key-Value Stores2020 IEEE 40th International Conference on Distributed Computing Systems (ICDCS)10.1109/ICDCS47774.2020.00071(1112-1122)Online publication date: Nov-2020
  • (2020)BAD to the bone: Big Active Data at its coreThe VLDB Journal10.1007/s00778-020-00616-7Online publication date: 23-May-2020
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
SSDBM '17: Proceedings of the 29th International Conference on Scientific and Statistical Database Management
June 2017
373 pages
ISBN:9781450352826
DOI:10.1145/3085504
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

In-Cooperation

  • Northwestern University: Northwestern University

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 June 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Big Data
  2. Continuous Query
  3. Instant Notification
  4. LevelDB
  5. Log-Structured Merge Tree
  6. NoSQL
  7. Publish/Subscribe
  8. Triggers

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

SSDBM '17

Acceptance Rates

Overall Acceptance Rate 56 of 146 submissions, 38%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)1
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Applications of geospatial big data in the Internet of ThingsTransactions in GIS10.1111/tgis.1284626:1(41-71)Online publication date: 24-Sep-2021
  • (2020)SineKV: Decoupled Secondary Indexing for LSM-based Key-Value Stores2020 IEEE 40th International Conference on Distributed Computing Systems (ICDCS)10.1109/ICDCS47774.2020.00071(1112-1122)Online publication date: Nov-2020
  • (2020)BAD to the bone: Big Active Data at its coreThe VLDB Journal10.1007/s00778-020-00616-7Online publication date: 23-May-2020
  • (2019)Real-Time NotificationsSmart Marketing With the Internet of Things10.4018/978-1-5225-5763-0.ch006(114-122)Online publication date: 2019
  • (2019)High-throughput publish/subscribe on top of LSM-based storageDistributed and Parallel Databases10.1007/s10619-018-7236-237:1(101-132)Online publication date: 1-Mar-2019
  • (2018)A Comparative Study of Secondary Indexing Techniques in LSM-based NoSQL DatabasesProceedings of the 2018 International Conference on Management of Data10.1145/3183713.3196900(551-566)Online publication date: 27-May-2018

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media