Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article
Free access

Resource aggregation for fault tolerance in integrated services networks

Published: 01 April 1998 Publication History

Abstract

For several real-time applications it is critical that the failure of a network component does not lead to unexpected termination or long disruption of service. In this paper, we propose a scheme called RAFT (Resource Aggregation for Fault Tolerance) that guarantees recovery in a timely and resource-efficient manner. RAFT is presented in the framework of the Reliable Back-bone (RBone), a virtual network layered on top of an integrated services network. Applications can request fault tolerance against RBone link and node failures. The basic idea of RAFT is to setup every fault tolerant flow along a secondary path that serves as a backup in case the primary path fails. The secondary path resource reservations are aggregated whenever possible to reduce the overhead of providing fault tolerance. We show that the RSVP resource reservation protocol can support RAFT with simple extensions.

References

[1]
{1} J. Anderson, B. T. Doshi, S. Dravida, and P. Harshavardhana, "Fast restoration of ATM networks," IEEE Journal "on Selected Areas in Communications, vol. 12, no. 1, pp. 128-138, January 1994.
[2]
{2} A. Banerjea, Fault Management for Real-time Networks, PhD thesis, Department of Computer Science, University of California, Berkeley, December 1994.
[3]
{3} A. Banerjea, "Simulation study of the capacity effects of dispersity routing for fault tolerant realtime channels," in Proceedings SIGCOMM Symposium, pp. 194-205, August 1996.
[4]
{4} A. Banerjea, C. J. Parris, and D. Ferrari, "Recovering guaranteed performance service connections from single and multiple faults," in Proceedings IEEE Globecom, pp. 162-168, November 1994.
[5]
{5} R. Braden, L. Zhang, S. Berson, S. Herzog, and S. Jamin, Resource ReSerVation Protocol (RSVP) - Version 1, Functional Specification , September 1997. RFC 2205.
[6]
{6} D. D. Clark, "The Design Philoshopy of the DARPA Internet Protocols," in Proceedings SIGCOMM Symposium, pp. 106-114, 1988.
[7]
{7} D. Sidhu, S. Abdallah, and R. Nair, "Finding disjoint paths in networks," in Proceedings SIGCOMM Symposium, pp. 43-51, 1991.
[8]
{8} R. Guerin, S. Kamat, and S. Herzog, QoS Path Management with RSVP, March 1997. Internet Draft: draft-qos-path-mgmt-rsvp- 00.txt (work in progress).
[9]
{9} S. Han and K. G. Shin, "Fast restoration of real-time communication service from component failures in multi-hop networks," in Proceedings SIGCOMM Symposium , September 1997.
[10]
{10} P. L. Higginson and M. C. Shand, "Development of router clusters to provide fast failover in IP networks," Digital Technical Journal, vol. 9, no. 3, pp. 32-41, 1997.
[11]
{11} K. Ishida, Y. Kakuda, and T. Kikuno, "A routing protocol for finding two node-disjoint paths in computer networks," in Proceedings IEEE International Conference on Network Protocols, pp. 340-347, 1995.
[12]
{12} R. Kawamura, K. Sato, and I. Tokizawa, "Self-healing ATM networks based on virtual path concept," IEEE Journal on Selected Areas in Communications, vol. 12, no. 1, pp. 120-127, January 1994.
[13]
{13} K. R. Krishnan, R. D. Doverspike, and C. D. Pack, "Improved survivability with multi-layer dynamic routing," IEEE Communications Magazine, pp. 62-68, July 1995.
[14]
{14} D. Kuhn, "Sources of failure in the public switched telephone network," IEEE Computer , pp. 31-36, April 1997.
[15]
{15} J. C. McDonald, "Public network integrity - avoiding a crisis in trust," IEEE Journal on Selected Areas in Communications, vol. 12, no. 1, pp. 5-12, January 1994.
[16]
{16} K. Murakami and H. S. Kim, "Virtual path routing for survivable ATM networks," IEEE//ACM Transactions on Networking, vol. 4, no. 1, pp. 22-39, February 1996.
[17]
{17} P. A. Veitch, D. G. Smith, and I. Hawker, "A comparison of pre-planned routing techniques for virtual path restoration," in Performance Modelling and Evaluation of ATM Networks, D. D. Kouvatsos, editor, volume 2, Chapman and Hall, 1996.
[18]
{18} D. K. Pradhan, Fault-Tolerant Computer System Design, Prentice Hall, 1996.
[19]
{19} Y. Rekhter and T. Li, A Border Gateway Protocol 4 (BGP-4), March 1995. RFC 1771.
[20]
{20} S. Shenker, C. Partridge, and R. Guerin, Specification of Guaranteed Quality of Service , September 1997. RFC 2212.
[21]
{21} UUNET, "UUNET Technologies," http://www.uunet.net/, December 1997.
[22]
{22} Q. Zheng and K. G. Shin, "Fault-tolerant real-time communication in distributed computing systems," in Proceedings 22nd International Symposium on Fault-Tolerant Computing , pp. 86-93, July 1992.

Cited By

View all
  • (2013)Spare capacity allocation using shared backup path protection for dual link failuresComputer Communications10.5555/2445634.244590436:6(666-677)Online publication date: 1-Mar-2013
  • (2013)Spare capacity allocation using shared backup path protection for dual link failuresComputer Communications10.1016/j.comcom.2012.09.00736:6(666-677)Online publication date: Mar-2013
  • (2011)Spare capacity allocation using shared backup path protection for dual link failures2011 8th International Workshop on the Design of Reliable Communication Networks (DRCN)10.1109/DRCN.2011.6076893(118-125)Online publication date: Oct-2011
  • Show More Cited By

Index Terms

  1. Resource aggregation for fault tolerance in integrated services networks

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 01 April 1998
      Published in SIGCOMM-CCR Volume 28, Issue 2

      Check for updates

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)13
      • Downloads (Last 6 weeks)2
      Reflects downloads up to 06 Oct 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2013)Spare capacity allocation using shared backup path protection for dual link failuresComputer Communications10.5555/2445634.244590436:6(666-677)Online publication date: 1-Mar-2013
      • (2013)Spare capacity allocation using shared backup path protection for dual link failuresComputer Communications10.1016/j.comcom.2012.09.00736:6(666-677)Online publication date: Mar-2013
      • (2011)Spare capacity allocation using shared backup path protection for dual link failures2011 8th International Workshop on the Design of Reliable Communication Networks (DRCN)10.1109/DRCN.2011.6076893(118-125)Online publication date: Oct-2011
      • (2011)Providing survivability against jamming attack for multi-radio multi-channel wireless mesh networksJournal of Network and Computer Applications10.1016/j.jnca.2010.03.02234:2(443-454)Online publication date: 1-Mar-2011
      • (2011)Fast spanning tree reconnection mechanism for resilient Metro Ethernet networksComputer Networks: The International Journal of Computer and Telecommunications Networking10.1016/j.comnet.2011.05.00255:12(2717-2729)Online publication date: 1-Aug-2011
      • (2010)An optimization algorithm of spare capacity allocation by dynamic survivable routingProceedings of the First international conference on Advances in Swarm Intelligence - Volume Part II10.1007/978-3-642-13498-2_57(439-445)Online publication date: 12-Jun-2010
      • (2009)Quantitative Measurement of Routing Restoration Strategies for Multi-hop Wireless Networks2009 6th IEEE Annual Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks Workshops10.1109/SAHCNW.2009.5172958(1-6)Online publication date: Jun-2009
      • (2009)Optimal Wireless Network Restoration under Jamming AttackProceedings of the 2009 Proceedings of 18th International Conference on Computer Communications and Networks10.1109/ICCCN.2009.5235337(1-6)Online publication date: 3-Aug-2009
      • (2009)Providing survivability against jamming attack via joint dynamic routing and channel assignment2009 7th International Workshop on Design of Reliable Communication Networks10.1109/DRCN.2009.5340005(198-205)Online publication date: Oct-2009
      • (2007)Ultrafast Potential-Backup-Cost (PBC)-Based Shared Path Protection SchemesJournal of Lightwave Technology10.1109/JLT.2007.90132725:8(2251-2259)Online publication date: Aug-2007
      • Show More Cited By

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media