Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3326285.3329036acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiwqosConference Proceedingsconference-collections
research-article
Public Access

RetroFlow: maintaining control resiliency and flow programmability for software-defined WANs

Published: 24 June 2019 Publication History

Abstract

Providing resilient network control is a critical concern for deploying Software-Defined Networking (SDN) into Wide-Area Networks (WANs). For performance reasons, a Software-Defined WAN is divided into multiple domains controlled by multiple controllers with a logically centralized view. Under controller failures, we need to remap the control of offline switches from failed controllers to other active controllers. Existing solutions could either overload active controllers to interrupt their normal operations or degrade network performance because of increasing the controller-switch communication overhead. In this paper, we propose RetroFlow to achieve low communication overhead without interrupting the normal processing of active controllers during controller failures. By intelligently configuring a set of selected offline switches working under the legacy routing mode, RetroFlow relieves the active controllers from controlling the selected offline switches while maintaining the flow programmability (e.g., the ability to change paths of flows) of SDN. RetroFlow also smartly transfers the control of offline switches with the SDN routing mode to active controllers to minimize the communication overhead from these offline switches to the active controllers. Simulation results show that compared with the baseline algorithm, RetroFlow can reduce the communication overhead up to 52.6% during a moderate controller failure by recovering 100% flows from offline switches and can reduce the communication overhead up to 61.2% during a serious controller failure by setting to recover 90% of flows from offline switches.

References

[1]
"Software-defined infrastructure at uber," https://www.linuxfoundation.org/blog/2018/06/software-defined-infrastructure-at-uber/.
[2]
S. Jain, A. Kumar, S. Mandal, L. Poutievski, A. Singh, S. Venkata, J. Wanderer, J. Zhou, and M. Zhu, "B4: Experience with a globally-deployed software defined WAN," in ACM SIGCOMM'13.
[3]
"First in the u.s. to mobile 5g âĂŞ whatâĂŹs next? defining at&tâĂŹs network path in 2019 and beyond," https://about.att.com/story/2019/2019_and_beyond.html.
[4]
T. Hu, Z. Guo, P. Yi, T. Baker, and J. Lan, "Multi-controller based software-defined networking: A survey," IEEE Access, vol. 6, pp. 15 980-15 996, 2018.
[5]
Z. Guo, M. Su, Y. Xu, Z. Duan, L. Wang, S. Hui, and H. J. Chao, "Improving the performance of load balancing in software-defined networks through load variance-based synchronization," Computer Networks, vol. 68, pp. 95--109, 2014.
[6]
D. Hock, M. Hartmann, S. Gebert, M. Jarschel, T. Zinner, and P. Tran-Gia, "Pareto-optimal resilient controller placement in sdn-based core networks," in IEEE ITC'13.
[7]
M. Tanha, D. Sajjadi, and J. Pan, "Enduring node failures through resilient controller placement for software defined networks," in IEEE GLOBECOM'16.
[8]
B. P. R. Killi and S. V. Rao, "Capacitated next controller placement in software defined networks," IEEE Transactions on Network and Service Management, vol. 14, no. 3, pp. 514--527, 2017.
[9]
T. Hu, Z. Guo, J. Zhang, and J. Lan, "Adaptive slave controller assignment for fault-tolerant control plane in software-defined networking," in IEEE ICC'18.
[10]
G. Yao, J. Bi, and L. Guo, "On the cascading failures of multi-controllers in software defined networks," in IEEE ICNP'13.
[11]
T. Hu, P. Yi, Z. Guo, J. Lan, and Y. Hu, "Dynamic slave controller assignment for enhancing control plane robustness in software-defined networks," Future Generation Computer Systems, vol. 95, pp. 681--693, 2019.
[12]
"Brocade mlx-8 pe," https://www.dataswitchworks.com/datasheets/MLX_Series_DS.pdf.
[13]
N. L. Van Adrichem, C. Doerr, and F. A. Kuipers, "Opennetmon: Network monitoring in openflow software-defined networks," in IEEE NOMS'14.
[14]
A. Tootoonchian, M. Ghobadi, and Y. Ganjali, "Opentm: traffic matrix estimator for openflow networks," in Springer PAM'10, pp. 201--210.
[15]
"Openflow switch specification 1.3," https://www.opennetworking.org/wp-content/uploads/2014/10/openflow-spec-v1.3.0.pdf.
[16]
J. Xie, D. Guo, X. Li, Y. Shen, and X. Jiang, "Cutting long-tail latency of routing response in software defined networks," IEEE Journal on Selected Areas in Communications, vol. 36, no. 3, pp. 384--396, 2018.
[17]
B. Heller, R. Sherwood, and N. McKeown, "The controller placement problem," in ACM HotSDN'12.
[18]
G. Yao, J. Bi, Y. Li, and L. Guo, "On the capacitated controller placement problem in software defined networks," IEEE Communications Letters, vol. 18, no. 8, pp. 1339--1342, 2014.
[19]
S. Agarwal, M. Kodialam, and T. Lakshman, "Traffic engineering in software defined networks," in IEEE INFOCOM'13.
[20]
K. Poularakis, G. Iosifidis, G. Smaragdakis, and L. Tassiulas, "One step at a time: Optimizing SDN upgrades in ISP networks," in IEEE INFOCOM'17.
[21]
Z. Guo, W. Chen, Y. Liu, Y. Xu, and Z. Zhang, "Joint switch upgrade and controller deployment in hybrid software-defined networks," IEEE Journal on Selected Areas in Communications, vol. 37, no. 5, pp. 1012--1028, May 2019.
[22]
T. Wang, F. Liu, J. Guo, and H. Xu, "Dynamic SDN controller assignment in data center networks: Stable matching with transfers," in IEEE INFOCOM'16.
[23]
H. Xu, Z. Yu, C. Qian, X.-Y. Li, and Z. Liu, "Minimizing flow statistics collection cost of sdn using wildcard requests," in IEEE INFOCOM'17.
[24]
M. R. Gary and D. S. Johnson, "Computers and intractability: A guide to the theory of np-completeness," 1979.
[25]
S. Knight, H. X. Nguyen, N. Falkner, R. Bowden, and M. Roughan, "The internet topology zoo," IEEE Journal on Selected Areas in Communications, vol. 29, no. 9, pp. 1765 -- 1775, 2011.
[26]
C. C. Robusto, "The cosine-haversine formula," The American Mathematical Monthly, vol. 64, no. 1, pp. 38--40, 1957.
[27]
"Speed, rates, times, delays: Data link parameters for cse 461," https://courses.cs.washington.edu/courses/cse461/99wi/issues/definitions.html.
[28]
"Gurobi optimization," http://www.gurobi.com.
[29]
N. Perrot and T. Reynaud, "Optimal placement of controllers in a resilient sdn architecture," in IEEE DRCN'16.
[30]
A. Alshamrani, S. Guha, S. Pisharody, A. Chowdhary, and D. Huang, "Fault tolerant controller placement in distributed sdn environments," in IEEE ICC'18.

Cited By

View all
  • (2024)ARES: Predictable Traffic Engineering under Controller Failures in SD-WANsProceedings of the ACM Web Conference 202410.1145/3589334.3645321(2703-2712)Online publication date: 13-May-2024
  • (2024)Maintaining the Network Performance of Software-Defined WANs With Efficient Critical RoutingIEEE Transactions on Network and Service Management10.1109/TNSM.2023.333540421:2(2240-2252)Online publication date: Apr-2024
  • (2024)Maintaining Control Resiliency for Traffic Engineering in SD-WANsIEEE/ACM Transactions on Networking10.1109/TNET.2024.339384132:4(3485-3498)Online publication date: Aug-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
IWQoS '19: Proceedings of the International Symposium on Quality of Service
June 2019
420 pages
ISBN:9781450367783
DOI:10.1145/3326285
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 June 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. control plane
  2. hybrid routing
  3. programmability
  4. resiliency
  5. software-defined networking
  6. wide area networks

Qualifiers

  • Research-article

Funding Sources

Conference

IWQoS '19

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)161
  • Downloads (Last 6 weeks)49
Reflects downloads up to 08 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)ARES: Predictable Traffic Engineering under Controller Failures in SD-WANsProceedings of the ACM Web Conference 202410.1145/3589334.3645321(2703-2712)Online publication date: 13-May-2024
  • (2024)Maintaining the Network Performance of Software-Defined WANs With Efficient Critical RoutingIEEE Transactions on Network and Service Management10.1109/TNSM.2023.333540421:2(2240-2252)Online publication date: Apr-2024
  • (2024)Maintaining Control Resiliency for Traffic Engineering in SD-WANsIEEE/ACM Transactions on Networking10.1109/TNET.2024.339384132:4(3485-3498)Online publication date: Aug-2024
  • (2024)Toward Improved Path Programmability Recovery for Software-Defined WANs Under Multiple Controller FailuresIEEE/ACM Transactions on Networking10.1109/TNET.2023.328645632:1(143-158)Online publication date: Feb-2024
  • (2024)Path Programmability Recovery under Controller Failures for SD-WANs: Recent Advances and Future Research ChallengesIEEE Communications Magazine10.1109/MCOM.001.230045962:11(100-106)Online publication date: Nov-2024
  • (2024)Mitigating the impact of controller failures on QoS robustness for software-defined wide area networksComputer Networks10.1016/j.comnet.2023.110096238(110096)Online publication date: Jan-2024
  • (2023)An End-Host-Importance-Aware Secure Service-Enabled Hybrid SDN DeploymentIEEE Transactions on Network and Service Management10.1109/TNSM.2022.320869520:2(2056-2070)Online publication date: Jun-2023
  • (2023)Toward Flexible and Predictable Path Programmability Recovery Under Multiple Controller Failures in Software-Defined WANsIEEE/ACM Transactions on Networking10.1109/TNET.2022.322742331:5(1965-1980)Online publication date: Oct-2023
  • (2022)Maintaining Control Resiliency and Flow Programmability in Software-Defined WANs During Controller FailuresIEEE/ACM Transactions on Networking10.1109/TNET.2021.312877130:3(969-984)Online publication date: Jun-2022
  • (2021)Main and Secondary Controller Assignment With Optimal Priority Policy Against Multiple FailuresIEEE Transactions on Network and Service Management10.1109/TNSM.2021.306464618:4(4391-4405)Online publication date: Dec-2021
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media