skip to main content
10.1145/2335484.2335537acmconferencesArticle/Chapter ViewAbstractPublication PagesdebsConference Proceedingsconference-collections
tutorial

A tutorial on reliability in publish/subscribe services

Published: 16 July 2012 Publication History

Abstract

Publish/subscribe services are required in several long-term on-going industrial projects that envision a radical rethinking of software systems by integrating existing legacy systems in large-scale federating architectures. In fact, such systems are made of a constellation of systems that cooperate with each other by means of the event notification provided by publish/subscribe services over wide-area networks. Such services have met an enthusiastic success in implementing these large-scale federations thanks to their intrinsic decoupling properties that improve the offered scalability guarantees. However, a very important requirement of such federations is the capability of the adopted publish/subscribe service to tolerate faults occurring in the network and/or computing nodes composing the federation, without negatively affecting the provided event notification. Therefore, it is crucial that publish/subscribe services are equipped with proper methods to support reliable event notification.
In this paper, we present this topic of reliable event notification by introducing its definition, a model of the faults that have to be tolerated, the available methods to recover from such faults and how current publish/subscribe products deal with reliability.

References

[1]
C. Esposito, D. Cotroneo, A. Gokhale and D. C. Schmidt. Architectural Evolution of Monitor and Control Systems - Issues and Challenges. introduction paper for the Special Issue on Data Dissemination for Large scale Complex Critical Infrastructures at International Journal of Network Protocols and Algorithms, 2(3):1--17, 2010.
[2]
Eurocontrol. The ATM Deployment Sequence, SESAR Project Milestone Deliverable D4, February 2008.
[3]
R. Hasan, R. Bobba, and H. Khurana. Analyzing NASPInet Data Flows. Proceedings of the IEEE Power Systems Conference & Exhibition (PSCE 06), March 2006.
[4]
R. Hauxe. Health information systems - past, present, future. International Journal of Medical Informatics, 75(3):268--281, March 2006.
[5]
A. Markopoulou, F. Tobagi, and M. Karam. Loss and Delay Measurements of Internet Backbones. Computer Communications, 29(10):1590--1604, September 2003.
[6]
D. Oppenheimer, A. Ganapathi, and D. A. Patterson. Why Do Internet Services Fail, and What Can Be Done About It? Proceedings of the 4th USENIX Symposium on Internet Technologies and Systems, 4:1--1, March 2003.
[7]
P. Costa, M. Migliavacca, G. P. Picco, and G. Cugola. Epidemic Algorithms for Reliable Content-Based Publish-Subscribe: An Evaluation. Proceeding of the 24th IEEE International Conference on Distributed Computing Systems (ICDCS 04), pages 552--561, March 2000.
[8]
S. P. Mahambre and U. Bellur. An Adaptive Approach for Ensuring Reliability in Event based Middleware. Proceedings of the Second International Conference on Distributed Event-based Systems (DEBS 07), pages 157--168, June 2008.
[9]
M. R. Selim, Y. Goto, and J. Cheng. A Replication Oriented Approach to Event Based Middleware over Structured Peer to Peer Networks. Proceedings of the 5th International Workshop on Middleware for Pervasive and Ad-hoc Computing, pages 61--63, November 2007.
[10]
M. A. Jaeger, G. Muhl, M. Werner, H. Parzyjegla, and H.-U. Heiss. Algorithms for Reconfiguring Self-Stabilizing Publish/Subscribe Systems. Autonomous Systems - Self-Organization, Management, and Control, Springer, pages 135--147, September 2008.
[11]
E. Gamma, R. Helm, R. Johnson, and J. Vlissides. Design Patterns. Addison-Wesley, 1995.
[12]
R. Meier and V. Cahill. Taxonomy of Distributed Event-Based Programming Systems. The Computer Journal, 48(4):602--626, June 2005.
[13]
G. V. Chockler, I. Keidar, and R. Vitenberg. Group Communication Specifications: A Comprehensive Study. ACM Computing Surveys (CSUR), 33(4):427--469, December 2001.
[14]
A. Carzaniga, D. S. Rosenblum, and A. L. Wolf. Design and Evaluation of a Wide-Area Event Notification Service. ACM Transactions on Computer Systems (TOCS), 19(3):332--383, August 2001.
[15]
A. Corsaro, L. Querzoni, S. Scipioni, S. Tucci Piergiovanni, and A. Virgillito. Quality of Service in Publish/Subscribe Middleware. Global Data Management, Edited by R. Baldoni, G. Cortese, IOS Press, pages 79--97, 2006.
[16]
S. Pallickara, H. Bulut, and G. Fox. Fault-Tolerant Reliable Delivery of Messages in Distributed Publish/Subscribe Systems. Proceedings of the Fourth International Conference on Autonomic Computing, June 2007.
[17]
S. Deering and D. Cheriton. Multicast Routing in Datagram Internetworks and Extended LANs. ACM Transactions on Computer Systems (TCS), 8(2):85--100, May 1990.
[18]
J. F. Buford, H. Yu, and E. K. Lua. P2P Networking and Applications. Morgan Kaufmann, 2008.
[19]
C. Esposito, D. Cotroneo, and A. Gokhale. Reliable Publish/Subscribe Middleware for Time-sensitive Internet-scale Applications. Proceedings of the 3rd ACM International Conference on Distributed Event-Based Systems (DEBS 09), July 2009.
[20]
P. Veríssimo and L. Rodrigues. Distributed Systems for System Architects. Kluwer Academic Publishers, 2004.
[21]
T. K. Moon. Error Correction Coding - Mathematical Methods and Algorithms. Wiley, 2006.
[22]
A. Avizienis, J.-C. Laprie, B. Randell, and C. Landwehr. Basic Concepts and Taxonomy of Dependable and Secure Computing. IEEE Transactions on Dependable and Secure Computing, 1(1):11--33, January--March 2004.
[23]
F. Baccelli, A. Chaintreau, Z. Liu, A. Riabov, and S. Sahu. Scalability of Reliable Group Communication Using Overlays. Proceedings of the 23th Annual Joint Conference of the IEEE Computer and Communications Societies (INFOCOM 04), pages 419--430, March 2004.
[24]
OMG. Data Distribution Service (DDS) for Real-Time Systems, v1.2, January 2007.
[25]
M. Balakrishnan, K. P. Birman, A. Phanishayee, and S. Pleisch. Ricochet: Lateral Error Correction for Time-Critical Multicast. Proceedings of the 4th USENIX Symposium on Networked System Design & Implementation (NSDI 07), pages 73--86, April 2007.
[26]
S. Q. Zhuang, B. Y. Zhao, A. D. Joseph, R. H. Katz, and J. D. Kubiatowicz. Bayeux: An Architecture for Scalable and Fault-tolerant Wide-area Data Dissemination. Proceedings of the 11th International Workshop on Network and Operating Systems Support for Digital Audio and Video, pages 11--20, 2001.
[27]
R. Chand and P. Felber. XNET: a Reliable Content-based Publish/Subscribe System. Proceedings of the 23rd IEEE International Symposium on Reliable Distributed Systems (SRDS 04), pages 264--273, October 2004.
[28]
J. Han, D. Watson, and F. Jahanian. An Experimental Study of Internet Path Diversity. IEEE Transactions on Dependable and Secure Computing (TDSC), 3(4):273--288, October 2006.
[29]
C. Esposito, D. Cotroneo, and S. Russo. Reliable Event Dissemination over Wide-Area Networks without Severe Performance Fluctuations. Proceedings of the 13th IEEE ISORC, May 2010.
[30]
OMG. DDS Interoperability Protocol (DDSI), v2.1, January 2009.
[31]
Sun Microsystems. Java Message Service, v1.1, April 2002.
[32]
AMQP Consortium. Advanced Message Queueing Protocol, v1.0, April 2011.

Cited By

View all
  • (2022)Open-Source Publish-Subscribe Systems: A Comparative StudyAdvanced Information Networking and Applications10.1007/978-3-030-99584-3_10(105-115)Online publication date: 31-Mar-2022
  • (2015)Message Races in Data Distribution Service ProgramsProceedings of the 2015 8th International Conference on Database Theory and Application (DTA)10.1109/DTA.2015.17(33-36)Online publication date: 25-Nov-2015
  • (2013)Enabling Cloud Connectivity Using SDN and NFV TechnologiesMobile Networks and Management10.1007/978-3-319-04277-0_19(245-258)Online publication date: 2013

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
DEBS '12: Proceedings of the 6th ACM International Conference on Distributed Event-Based Systems
July 2012
410 pages
ISBN:9781450313155
DOI:10.1145/2335484

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 July 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. fault model
  2. reliable event notification

Qualifiers

  • Tutorial

Conference

DEBS '12

Acceptance Rates

Overall Acceptance Rate 145 of 583 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)6
  • Downloads (Last 6 weeks)0
Reflects downloads up to 20 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Open-Source Publish-Subscribe Systems: A Comparative StudyAdvanced Information Networking and Applications10.1007/978-3-030-99584-3_10(105-115)Online publication date: 31-Mar-2022
  • (2015)Message Races in Data Distribution Service ProgramsProceedings of the 2015 8th International Conference on Database Theory and Application (DTA)10.1109/DTA.2015.17(33-36)Online publication date: 25-Nov-2015
  • (2013)Enabling Cloud Connectivity Using SDN and NFV TechnologiesMobile Networks and Management10.1007/978-3-319-04277-0_19(245-258)Online publication date: 2013

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media