Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3555050.3569140acmconferencesArticle/Chapter ViewAbstractPublication PagesconextConference Proceedingsconference-collections
research-article
Open access

R-MPLS: recursive protection for highly dependable MPLS networks

Published: 30 November 2022 Publication History

Abstract

Most modern communication networks feature fast rerouting mechanisms in the data plane. However, design and configuration of such mechanisms even under multiple failures is known to be difficult. In order to increase the resilience of the widely deployed MPLS networks, we propose R-MPLS, an alternative link protection mechanism for MPLS networks that uses recursive protection and can route around multiple simultaneously failed links. Our new R-MPLS approach comes with strong theoretical underpinnings, is implementable in a fully distributed way and executable on existing MPLS hardware, and formally guarantees that no forwarding loops are introduced. We implement our R-MPLS protection in an automated tool which overcomes the complexity of configuring such resilient network data planes, and report on the benefits of recursive protection in realistic network topologies. We find that R-MPLS significantly increases network robustness against multiple failures, with only moderate increase in the number of forwarding rules and communication overhead (both comparable to industry-standards like RSVP-TE FRR).

References

[1]
Anubhavnidhi Abhashkumar, Aaron Gember-Jacobson, and Aditya Akella. 2020. Tiramisu: Fast Multilayer Network Verification. In 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI). USENIX, 201--219.
[2]
L. Andersson, I. Minei, and B. Thomas. 2007. Multiprotocol Label Switching Architecture. RFC 5036. RFC Editor. 1--135 pages.
[3]
David Applegate and Edith Cohen. 2003. Making Intra-domain Routing Robust to Changing and Uncertain Traffic Demands: Understanding Fundamental Tradeoffs. In Proceedings of the 2003 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communications (Karlsruhe, Germany) (SIGCOMM'03). ACM, New York, NY, USA, 313--324.
[4]
Jasbir Singh Arora. 2017. Introduction to Optimum Design (fourth ed.). Academic Press, Boston.
[5]
Alia K Atlas and Alex Zinin. 2008. Basic specification for IP fast-reroute: loop-free alternates. IETF RFC 5286.
[6]
R. Beckett, A. Gupta, R. Mahajan, and D. Walker. 2017. A General Approach to Network Configuration Verification. In Proceedings of the Conference of the ACM Special Interest Group on Data Communication (SIGCOMM'17). ACM, 155--168.
[7]
R. Beckett, R. Mahajan, T. Millstein, J. Padhye, and D. Walker. 2016. Don't Mind the Gap: Bridging Network-Wide Objectives and Device-Level Configurations. In Proc. ACM SIGCOMM. ACM, 328--341.
[8]
S. Bryant, C. Filsfils, S. Previdi, M. Shand, and N. So. 2015. Remote Loop-Free Alternate (LFA) Fast Reroute (FRR). RFC 7490. RFC Editor.
[9]
S. Bryant, S. Previdi, and M. Shand. 2013. A Framework for IP and MPLS Fast Reroute Using Not-Via Addresses. RFC 6981. RFC Editor.
[10]
Marco Chiesa, Andrzej Kamisinski, Jacek Rak, Gabor Retvari, and Stefan Schmid. 2021. A Survey of Fast-Recovery Mechanisms in Packet-Switched Networks. IEEE Communications Surveys and Tutorials (COMST) (2021).
[11]
Marco Chiesa, Ilya Nikolaevskiy, Slobodan Mitrović, Andrei Gurtov, Aleksander Madry, Michael Schapira, and Scott Shenker. 2016. On the resiliency of static forwarding tables. IEEE/ACM Transactions on Networking 25, 2 (2016), 1133--1146.
[12]
Marco Chiesa, Ilya Nikolaevskiy, Slobodan Mitrović, Aurojit Panda, Andrei Gurtov, Aleksander Maidry, Michael Schapira, and Scott Shenker. 2016. The quest for resilient (static) forwarding tables. In Proc. IEEE INFOCOM.
[13]
Marco Chiesa, Roshan Sedar, Gianni Antichi, Michael Borokhovich, Andrzej Kamisiński, Georgios Nikolaidis, and Stefan Schmid. 2019. Purr: A primitive for reconfigurable fast reroute: Hope for the best and program for the worst. In Proceedings of the 15th International Conference on Emerging Networking Experiments And Technologies. 1--14.
[14]
Richard Chirgwin. 2017. Google routing blunder sent Japan's Internet dark on Friday. In https://www.theregister.co.uk/2017/08/27/google_routing_blunder_sent_japans_internet_dark/.
[15]
Duluth News Tribune. 2018. Human error to blame in Minnesota 911 outage. In https://www.ems1.com/911/articles/389343048-Officials-Human-error-to-blame-in-Minn-911-outage/.
[16]
Theodore Elhourani, Abishek Gopalan, and Srinivasan Ramasubramanian. 2014. IP fast rerouting for multi-link failures. In Proc. IEEE INFOCOM. ACM, 2148--2156.
[17]
EthernetAlliance.org. 2009. Ethernet Jumbo Frames. http://www.ethernetalliance.org/wp-content/uploads/2011/10/EA-Ethernet-Jumbo-Frames-v0-1.pdf
[18]
Joan Feigenbaum, Brighten Godfrey, Aurojit Panda, Michael Schapira, Scott Shenker, and Ankit Singla. 2012. Brief announcement: On the resilience of routing tables. In Proceedings of the 2012 ACM symposium on Principles of distributed computing. 237--238.
[19]
Clarence Filsfils, Nagendra Kumar Nainar, Carlos Pignataro, Juan Camilo Cardona, and Pierre Francois. 2015. The segment routing architecture. In 2015 IEEE Global Communications Conference (GLOBECOM). IEEE, 1--6.
[20]
Klaus-Tycho Foerster, Juho Hirvonen, Yvonne-Anne Pignolet, Stefan Schmid, and Gilles Tredan. 2021. On the Feasibility of Perfect Resilience with Local Fast Failover. In Proc. SIAM Symposium on Algorithmic Principles of Computer Systems (APOCS).
[21]
Ari Fogel, Stanley Fung, Luis Pedrosa, Meg Walraed-Sullivan, Ramesh Govindan, Ratul Mahajan, and Todd Millstein. 2015. A General Approach to Network Configuration Analysis. In 12th USENIX Symposium on Networked Systems Design and Implementation (NSDI). USENIX Association, 469--483.
[22]
Aaron Gember-Jacobson, Raajay Viswanathan, Aditya Akella, and Ratul Mahajan. 2016. Fast Control Plane Analysis Using an Abstract Representation. In Proceedings of the 2016 ACM SIGCOMM Conference. ACM, 300--313.
[23]
Phillipa Gill, Navendu Jain, and Nachiappan Nagappan. 2011. Understanding network failures in data centers: measurement, analysis, and implications. In ACM SIGCOMM Computer Communication Review, Vol. 41 (4). 350--361.
[24]
Thomas Holterbach, Edgar Costa Molero, Maria Apostolaki, Alberto Dainotti, Stefano Vissicchio, and Laurent Vanbever. 2019. Blink: Fast connectivity recovery entirely in the data plane. In 16th USENIX Symposium on Networked Systems Design and Implementation (NSDI). 161--176.
[25]
Jesper Stenbjerg Jensen, Troels Beck Krøgh, Jonas Sand Madsen, Stefan Schmid, Jiří Srba, and Marc Tom Thorgersen. 2018. P-Rex: Fast Verification of MPLS Networks with Multiple Link Failures. In Proceedings of the 14th International Conference on Emerging Networking EXperiments and Technologies (CoNEXT). ACM, 217--227.
[26]
Peter Gjøl Jensen, Dan Kristiansen, Stefan Schmid, Morten Konggaard Schou, Bernhard Clemens Schrenk, and Jiri Srba. 2020. AalWiNes: A Fast and Quantitative What-If Analysis Tool for MPLS Networks. In Proceedings of the 16th International Conference on Emerging Networking EXperiments and Technologies (CoNEXT). ACM, 474--481.
[27]
Simon Knight, Hung X. Nguyen, Nickolas Falkner, Rhys Bowden, and Matthew Roughan. 2011. The Internet Topology Zoo. IEEE Journal on Selected Areas in Communications 29, 9 (2011), 1765--1775.
[28]
Craig Labovitz, G Robert Malan, and Farnam Jahanian. 1998. Internet routing instability. IEEE/ACM transactions on Networking 6, 5 (1998), 515--528.
[29]
Karthik Lakshminarayanan, Matthew Caesar, Murali Rangan, Tom Anderson, Scott Shenker, and Ion Stoica. 2007. Achieving Convergence-Free Routing Using Failure-Carrying Packets. In Proceedings of the 2007 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communications (Kyoto, Japan) (SIGCOMM '07). Association for Computing Machinery, New York, NY, USA, 241--252.
[30]
O. Lemeshko and K. Arous. 2014. Fast ReRoute model for different backup schemes in MPLS-network. In 2014 First International Scientific-Practical Conference Problems of Infocommunications Science and Technology. 39--41.
[31]
O. Lemeshko, A. Romanyuk, and H. Kozlova. 2013. Design schemes for MPLS Fast ReRoute. In 2013 12th International Conference on the Experience of Designing and Application of CAD Systems in Microelectronics (CADSM). 202--203.
[32]
O. Lemeshko and O. Yeremenko. 2018. Linear optimization model of MPLS Traffic Engineering Fast ReRoute for link, node, and bandwidth protection. In 2018 14th International Conference on Advanced Trends in Radioelecrtronics, Telecommunications and Computer Engineering (TCSET). 1009--1013.
[33]
Tony Li and Henk Smit. 2008. IS-IS Extensions for Traffic Engineering. Technical Report 5305.
[34]
Stephane Litkowski, Pierre Francois, Ahmed Bashandy, Clarence Filsfils, and Bruno Decraene. 2018. RFC draft: Topology Independent Fast Reroute using Segment Routing. Technical Report. https://tools.ietf.org/html/draft-bashandy-rtgwg-segment-routing-ti-lfa-02
[35]
J. Liu, A. Panda, A. Singla, B. Godfrey, M. Schapira, and S. Shenker. 2013. Ensuring connectivity via data plane mechanisms. In 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI). USENIX Association, 113--126.
[36]
Junda Liu, Aurojit Panda, Ankit Singla, Brighten Godfrey, Michael Schapira, and Scott Shenker. 2013. Ensuring Connectivity via Data Plane Mechanisms. In 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI 13). USENIX Association, 113--126.
[37]
Michael Menth, Michael Duelli, Ruediger Martin, and Jens Milbrandt. 2009. Resilience analysis of packet-witched communication networks. IEEE/ACM transactions on Networking (ToN) 17, 6 (2009), 1950--1963.
[38]
P. Pan, G. Swallow, and A. Atlas. 2005. Fast Reroute Extensions to RSVP-TE for LSP Tunnels. RFC 4090. RFC Editor. 1--38 pages.
[39]
Santhosh Prabhu, Kuan Yen Chou, Ali Kheradmand, Brighten Godfrey, and Matthew Caesar. 2020. Plankton: Scalable network configuration verification through model checking. In 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI). USENIX Association, 953--967.
[40]
E. Rosen, A. Viswanathan, and R. Callon. 2001. Multiprotocol Label Switching Architecture. RFC 3031. RFC Editor. 1--61 pages.
[41]
Stefan Schmid, Morten Konggaard Schou, Jiri Srba, and Juan Vanerio. 2022. Artifact for "R-MPLS: Recursive Protection for Highly Dependable MPLS Networks".
[42]
Stefan Schmid and Jiří Srba. 2018. Polynomial-Time What-If Analysis for Prefix-Manipulating MPLS Networks. In IEEE International Conference on Computer Communications (INFOCOM). IEEE, 1--9.
[43]
Nick Shelly, Brendan Tschaen, Klaus-Tycho Förster, Michael Chang, Theophilus Benson, and Laurent Vanbever. 2015. Destroying networks for fun (and profit). In Proceedings of the 14th ACM Workshop on Hot Topics in Networks. 1--7.
[44]
Steve Smith. 2003. Introduction to MPLS. https://www.cisco.com/c/dam/global/fr_ca/training-events/pdfs/Intro_to_mpls.pdf. Visited: 19/05/2020.
[45]
Brent Stephens, Alan L Cox, and Scott Rixner. 2016. Scalable multi-failure fast failover via forwarding table compression. In Proceedings of the Symposium on SDN Research. 1--12.
[46]
Juan Vanerio, Stefan Schmid, Morten K Schou, and Jiri Srba. 2022. MPLS-Kit: An MPLS Data Plane Toolkit. In 2022 Global Internet (GI) Symposium (GI 2022). IEEE, Paris, France. (To appear).
[47]
Hao Wang, Haiyong Xie, Lili Qiu, Yang Richard Yang, Yin Zhang, and Albert Greenberg. 2006. COPE: Traffic Engineering in Dynamic Networks. In Proceedings of the 2006 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communications (Pisa, Italy) (SIGCOMM '06). ACM, 99--110.
[48]
Ye Wang, Hao Wang, Ajay Mahimkar, Richard Alimi, Yin Zhang, Lili Qiu, and Yang Richard Yang. 2010. R3: Resilient Routing Reconfiguration. ACM SIGCOMM Computer Communication Review 40, 4 (Aug. 2010), 291--302.
[49]
Dahai Xu, Yizhi Xiong, Chunming Qiao, and Guangzhi Li. 2004. Failure protection in layered networks with shared risk link groups. IEEE Network 18, 3 (2004), 36--41.
[50]
O. S. Yeremenko, O. V. Lemeshko, and N. Tariki. 2017. Fast ReRoute scalable solution with protection schemes of network elements. In 2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON). 783--788.
[51]
Derek M. Yeung, Dave Katz, and Kireeti Kompella. 2003. Traffic Engineering (TE) Extensions to OSPF Version 2. Technical Report 3630.
[52]
Lixia Zhang, Robert T. Braden, Andreas Terzis, and Subramaniam Vincent. 2000. RSVP Diagnostic Messages. RFC 2745. RFC Editor.

Cited By

View all
  • (2024)SyPer: Synthesis of Perfectly Resilient Local Fast Re-Routing Rules for Highly Dependable NetworksIEEE INFOCOM 2024 - IEEE Conference on Computer Communications10.1109/INFOCOM52122.2024.10621323(2398-2407)Online publication date: 20-May-2024
  • (2024)SyRep: Efficient Synthesis and Repair of Fast Re-Route Forwarding Tables for Resilient Networks2024 54th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN)10.1109/DSN58291.2024.00053(483-494)Online publication date: 24-Jun-2024

Index Terms

  1. R-MPLS: recursive protection for highly dependable MPLS networks

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CoNEXT '22: Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies
      November 2022
      431 pages
      ISBN:9781450395083
      DOI:10.1145/3555050
      This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives International 4.0 License.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 30 November 2022

      Check for updates

      Badges

      Author Tags

      1. MPLS
      2. fast reroute
      3. network algorithms
      4. network reliability

      Qualifiers

      • Research-article

      Conference

      CoNEXT '22
      Sponsor:

      Acceptance Rates

      CoNEXT '22 Paper Acceptance Rate 28 of 151 submissions, 19%;
      Overall Acceptance Rate 198 of 789 submissions, 25%

      Upcoming Conference

      CoNEXT '24

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)137
      • Downloads (Last 6 weeks)15
      Reflects downloads up to 14 Oct 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)SyPer: Synthesis of Perfectly Resilient Local Fast Re-Routing Rules for Highly Dependable NetworksIEEE INFOCOM 2024 - IEEE Conference on Computer Communications10.1109/INFOCOM52122.2024.10621323(2398-2407)Online publication date: 20-May-2024
      • (2024)SyRep: Efficient Synthesis and Repair of Fast Re-Route Forwarding Tables for Resilient Networks2024 54th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN)10.1109/DSN58291.2024.00053(483-494)Online publication date: 24-Jun-2024

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media