Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

End-to-end support for joins in large-scale publish/subscribe systems

Published: 01 August 2008 Publication History

Abstract

We address the problem of supporting a large number of select-join subscriptions for wide-area publish/subscribe. Subscriptions are joins over different tables, with varying interests expressed as range selection conditions over table attributes. Naive schemes, such as computing and sending join results from a server, are inefficient because they produce redundant data, and are unable to share dissemination costs across subscribers and events. We propose a novel, scalable scheme that group-processes and disseminates a general mix of multi-way select-join subscriptions. We also propose a simple and application-agnostic extension to content-driven networks (CN), which further improves sharing of dissemination costs. Experimental evaluations show that our schemes can generate orders of magnitude lower network traffic at very low processing cost. Our extension to CN can further reduce traffic by another order of magnitude, with almost no increase in notification latency.

References

[1]
P. K. Agarwal, J. Xie, J. Yang, and H. Yu. Scalable continuous query processing by tracking hotspots. In VLDB, 2006.
[2]
Y. Ahmad, U. Cetintemel, J. Jannotti, and A. Zgolinski. Locality aware networked join evaluation. In NetDB, 2005.
[3]
P. A. Bernstein, N. Goodman, E. Wong, C. L. Reeve, and J. J. B. Rothnie. Query processing in a system for distributed databases (SDD-1). ACM TODS, 1981.
[4]
A. Carzaniga and A. L. Wolf. Content-based networking: A new communication infrastructure. In NSF Workshop on an Infrastructure for Mobile and Wireless Systems, 2001.
[5]
B. Chandramouli, J. Xie, and J. Yang. On the database/network interface in large-scale publish/subscribe systems. In SIGMOD, 2006.
[6]
B. Chandramouli, J. M. Phillips, and J. Yang. Value-based notification conditions in large-scale publish/subscribe systems. In VLDB, 2007.
[7]
B. Chandramouli, J. Yang, P. K. Agarwal, A. Yu, and Y. Zheng. ProSem: Scalable wide-area publish/subscribe. In SIGMOD, 2008.
[8]
S. Chandrasekaran and M. J. Franklin. Psoup: a system for streaming queries over streaming data. VLDB J., 2003.
[9]
H. Chang, R. Govindan, S. Jamin, S. Shenker, and W. Willinger. Towards Capturing Representative AS-Level Internet Topologies. In SIGMETRICS, 2002.
[10]
J. Chen, D. J. DeWitt, F. Tian, and Y. Wang. NiagaraCQ: A scalable continuous query system for internet databases. In SIGMOD, 2000.
[11]
Y. Diao, S. Rizvi, and M. J. Franklin. Towards an internetscale XML dissemination service. In VLDB, 2004.
[12]
E. Fidler, H.-A. Jacobsen, G. Li, and S. Mankovski. The PADRES distributed publish/subscribe system. In FIW, 2005.
[13]
A. Gupta and I. Mumick, editors. Materialized Views: Techniques, Implementations, and Applications. MIT Press, 1999.
[14]
A. Gupta, O. D. Sahin, D. Agrawal, and A. E. Abbadi. Meghdoot: Content-based publish/subscribe over P2P networks. In Middleware, 2004.
[15]
M. Hong et al. Massively multi-query join processing in publish/subscribe systems. In SIGMOD, 2007.
[16]
R. Huebsch et al. Querying the internet with PIER. In VLDB, 2003.
[17]
N. Huyn. Speeding up view maintenance using cheap filters at the warehouse. In ICDE, 2000.
[18]
S. Idreos, C. Tryfonopoulos, and M. Koubarakis. Distributed evaluation of continuous equi-join queries over large structured overlay networks. In ICDE, 2006.
[19]
Y. Jin and R. Strom. Relational subscription middleware for internet-scale publish-subscribe. In DEBS, 2003.
[20]
B. Liu and E. Rundensteiner. Cost-driven general join view maintenance over distributed data sources. In ICDE, 2005.
[21]
L. Liu, C. Pu, and W. Tang. Continual queries for internet scale event-driven information delivery. TKDE, 1999.
[22]
S. Madden, M. Shah, J. Hellerstein, and V. Raman. Continuously adaptive continuous queries over streams. In SIGMOD, 2002.
[23]
D. Quass, A. Gupta, I. Mumick, and J. Widom. Making views self-maintainable for data warehousing. In PDIS, 1996.
[24]
S. Ratnasamy et al. A scalable content addressable network. In SIGCOMM, 2001.
[25]
K. Stocker, D. Kossmann, R. Braumandi, and A. Kemper. Integrating semi-join-reducers into state-of-the-art query processors. In ICDE, 2001.
[26]
A. Carzaniga, D. S. Rosenblum, and A. L. Wolf. Design and evaluation of a wide-area event notification service. ACM Trans. on Computer Systems, 2001.
[27]
M. Castro, P. Druschel, A. Kermarrec, and A. Rowstron. SCRIBE: A large-scale and decentralized application-level multicast infrastructure. IEEE JSAC, 2002.
[28]
R. Chand and P. A. Felber. A scalable protocol for contentbased routing in overlay networks. In NCA, 2003.
[29]
Y. Chawathe et al. A case study in building layered DHT applications. In SIGCOMM, 2005.
[30]
A. Crainiceanu, P. Linga, J. Gehrke, and J. Shanmugasundaram. Querying peer-to-peer networks using P-trees. In WebDB, 2004.
[31]
G. Cugola, E. Di Nitto, and A. Fuggetta. The JEDI eventbased infrastructure and its application to the development of the OPSSWFMS. IEEE Trans. Softw. Eng., 2001.
[32]
C. du Mouza, W. Litwin, and P. Rigaux. SD-Rtree: A scalable distributed Rtree. In ICDE, 2007.
[33]
G. Mühl. Generic constraints for content-based publish/ subscribe. In CoopIS '01: Proceedings of the 9th International Conference on Cooperative Information Systems, 2001.
[34]
L. Opyrchal et al. Exploiting IP multicast in content-based publish-subscribe systems. In IFIP/ACM International Conference on Distributed systems platforms, 2000.
[35]
O. Papaemmanouil and U. Cetintemel. SemCast: Semantic multicast for content-based data dissemination. In ICDE, 2005.
[36]
P. R. Pietzuch and J. Bacon. Hermes: A distributed eventbased middleware architecture. In ICDCSW, 2002.

Cited By

View all
  • (2018)PS-tree-based efficient boolean expression matching for high-dimensional and dense workloadsProceedings of the VLDB Endowment10.14778/3291264.329127012:3(251-264)Online publication date: 1-Nov-2018
  • (2014)An efficient publish/subscribe index for e-commerce databasesProceedings of the VLDB Endowment10.14778/2732296.27322987:8(613-624)Online publication date: 1-Apr-2014
  • (2014)Distributed event aggregation for content-based publish/subscribe systemsProceedings of the 8th ACM International Conference on Distributed Event-Based Systems10.1145/2611286.2611302(95-106)Online publication date: 26-May-2014
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 1, Issue 1
August 2008
1216 pages

Publisher

VLDB Endowment

Publication History

Published: 01 August 2008
Published in PVLDB Volume 1, Issue 1

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)1
Reflects downloads up to 04 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2018)PS-tree-based efficient boolean expression matching for high-dimensional and dense workloadsProceedings of the VLDB Endowment10.14778/3291264.329127012:3(251-264)Online publication date: 1-Nov-2018
  • (2014)An efficient publish/subscribe index for e-commerce databasesProceedings of the VLDB Endowment10.14778/2732296.27322987:8(613-624)Online publication date: 1-Apr-2014
  • (2014)Distributed event aggregation for content-based publish/subscribe systemsProceedings of the 8th ACM International Conference on Distributed Event-Based Systems10.1145/2611286.2611302(95-106)Online publication date: 26-May-2014
  • (2013)Cache refreshing for online social news feedsProceedings of the 22nd ACM international conference on Information & Knowledge Management10.1145/2505515.2505742(787-792)Online publication date: 27-Oct-2013
  • (2010)A load shedding framework for XML stream joinsProceedings of the 21st international conference on Database and expert systems applications: Part I10.5555/1881867.1881894(269-280)Online publication date: 30-Aug-2010
  • (2010)Feeding frenzyProceedings of the 2010 ACM SIGMOD International Conference on Management of data10.1145/1807167.1807257(831-842)Online publication date: 6-Jun-2010
  • (2009)Scalable delivery of stream query resultProceedings of the VLDB Endowment10.14778/1687627.16876342:1(49-60)Online publication date: 1-Aug-2009
  • (2009)Processing publish/subscribe queries over distributed data streamsProceedings of the Third ACM International Conference on Distributed Event-Based Systems10.1145/1619258.1619286(1-4)Online publication date: 6-Jul-2009

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media