Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3337821.3337832acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicppConference Proceedingsconference-collections
research-article

SAFE: Service Availability via Failure Elimination Through VNF Scaling

Published: 05 August 2019 Publication History

Abstract

Virtualized network functions (VNFs) enable software applications to replace traditional middleboxes, which is more flexible and scalable in the network service provision. This paper focuses on ensuring Service Availability via Failure Elimination (SAFE) using VNF scaling, that is, given the resource requirements of VNF instances, finding an optimal and robust instance consolidation strategy, which can recover from one instance failure quickly. To address the above problem, we present a framework based on rounding and dynamic programming. First, we discretize the range of resource requirements for VNF instances deployment into several sub-ranges, so that the number of instance types becomes a constant. Second, we further reduce the number of instance types by gathering several small instances into a bigger one. Third, we propose an algorithm built on dynamic programming to solve the instance consolidation problem with a limited number of instance types. We set up a testbed to profile the functional relationship between resource and throughput for different types of VNF instances, and conduct simulations to validate our theoretical results according to profiling results. The simulation results show that our algorithm outperforms the standby deployment model by 27.33% on average in terms of the number of servers required. Furthermore, SAFE has marginal overhead, around 7.22%, compared to instance consolidation strategy without VNF backup consideration.

References

[1]
Bhutani Ankita and Wadhwani Preeti. 2018. Network Function Virtualization (NFV) Market to hit $70bn by 2024. https://www.gminsights.com/pressrelease/network-function-virtualization-nfv-market.
[2]
Apache. 2019. Apache Traffic Control. https://trafficcontrol.apache.org/.
[3]
Khuzaima Daudjee, Shahin Kamali, and Alejandro López-Ortiz. 2014. On the online fault-tolerant server consolidation problem. In ACM SPAA. 12--21.
[4]
Weiran Ding, Hongfang Yu, and Shouxi Luo. 2017. Enhancing the reliability of services in NFV with the cost-efficient redundancy scheme. In IEEE ICC. 1--6.
[5]
Jingyuan Fan, Chaowen Guan, Yangming Zhao, and Chunming Qiao. 2017. Availability-aware mapping of service function chains. In IEEE INFOCOM. 1--9.
[6]
Jingyuan Fan, Meiling Jiang, Ori Rottenstreich, Yangming Zhao, Tong Guan, Ram Ramesh, Sanjukta Das, and Chunming Qiao. 2018. A framework for provisioning availability of NFV in data center networks. IEEE Journal on Selected Areas in Communications 36, 10 (2018), 2246--2259.
[7]
Xincai Fei, Fangming Liu, Hong Xu, and Hai Jin. 2018. Adaptive VNF scaling and flow routing with proactive demand prediction. In IEEE INFOCOM. 486--494.
[8]
Massimo Gallo and Rafael Laufer. 2018. ClickNF: a modular stack for custom network functions. In USENIX ATC. 745--757.
[9]
Phillipa Gill, Navendu Jain, and Nachiappan Nagappan. 2011. Understanding network failures in data centers: measurement, analysis, and implications. In ACM SIGCOMM. 350--361.
[10]
Yang Hu, Mingcong Song, and Tao Li. 2017. Towards Full Containerization in Containerized Network Function Virtualization. In ACM ASPLOS. 467--481.
[11]
Jinho Hwang, K. K. Ramakrishnan, and Timothy Wood. 2014. NetVM: High Performance and Flexible Networking Using Virtualization on Commodity Platforms. In USENIX NSDI. 445--458.
[12]
Insun Jang, Dongeun Suh, Sangheon Pack, and György Dán. 2017. Joint Optimization of Service Function Placement and Flow Distribution for Service Function Chaining. IEEE Journal on Selected Areas in Communications 35, 11 (2017), 2532--2541.
[13]
Yongzheng Jia, Chuan Wu, Zongpeng Li, Franck Le, Alex Liu, Zongpeng Li, Yongzheng Jia, Chuan Wu, Franck Le, and Alex Liu. 2018. Online scaling of NFV service chains across geo-distributed datacenters. IEEE/ACM Transactions on Networking 26, 2 (2018), 699--710.
[14]
Murad Kablan, Azzam Alsudais, Eric Keller, and Franck Le. 2017. Stateless Network Functions: Breaking the Tight Coupling of State and Processing. In USENIX NSDI. 97--112.
[15]
Yossi Kanizo, Ori Rottenstreich, Itai Segall, and Jose Yallouz. 2017. Optimizing virtual backup allocation for middleboxes. IEEE/ACM Transactions on Networking 25, 5 (2017), 2759--2772.
[16]
Yossi Kanizo, Ori Rottenstreich, Itai Segall, and Jose Yallouz. 2018. Designing Optimal Middlebox Recovery Schemes with Performance Guarantees. In IEEE INFOCOM. 2096--2104.
[17]
Junaid Khalid, Eric Rozner, Wesley Felter, Cong Xu, Karthick Rajamani, Alexandre Ferreira, and Aditya Akella. 2018. Iron: Isolating Network-based CPU in Container Environments. In USENIX NSDI. 312--328.
[18]
Eddie Kohler, Robert Morris, Benjie Chen, John Jannotti, and M Frans Kaashoek. 2000. The Click modular router. ACM Transactions on Computer Systems 18, 3 (2000), 263--297.
[19]
Sameer G Kulkarni, Guyue Liu, KK Ramakrishnan, Mayutan Arumaithurai, Timothy Wood, and Xiaoming Fu. 2018. REINFORCE: achieving efficient failure resiliency for network function virtualization based services. In ACM CoNEXT. 41--53.
[20]
Joao Martins, Mohamed Ahmed, Costin Raiciu, Vladimir Olteanu, Michio Honda, Roberto Bifulco, and Felipe Huici. 2014. ClickOS and the art of network function virtualization. In USENIX NSDI. 459--473.
[21]
Huin Nicolas, Jaumard Brigitte, and Giroire Frédéric. 2018. Optimal Network Service Chain Provisioning. IEEE/ACM Transactions on Networking 26, 3 (2018), 1320--1333.
[22]
Shoumik Palkar, Chang Lan, Sangjin Han, Keon Jang, Aurojit Panda, Sylvia Ratnasamy, Luigi Rizzo, and Scott Shenker. 2015. E2: a framework for NFV applications. In ACM SOSP. 121--136.
[23]
Snort Team. 2015. VRT Rule Set. https://www.snort.org/talos.
[24]
TelcoSystems. 2016. Report: Global NFV market to grow by 33% in four years. http://www.telco.com/blog/report-global-nfv-market-grow-33-four-years/.
[25]
Andrea Tomassilli, Frédéric Giroire, Nicolas Huin, and Stéphane Pérennes. 2018. Provably Efficient Algorithms for Placement of Service Function Chains with Ordering Constraints. In IEEE INFOCOM. 1--9.
[26]
Amin Tootoonchian, Aurojit Panda, Chang Lan, Melvin Walls, Katerina Argyraki, Sylvia Ratnasamy, and Scott Shenker. 2018. ResQ: Enabling SLOs in Network Function Virtualization. In USENIX NSDI. 283--297.
[27]
Shinae Woo, Justine Sherry, Sangjin Han, Sue Moon, Sylvia Ratnasamy, and Scott Shenker. 2018. Elastic Scaling of Stateful Network Functions. In USENIX NSDI. 299--312.
[28]
Jinbei Zhang, Weijie Wu, and John Lui. 2018. On the Theory of Function Placement and Chaining for Network Function Virtualization. In ACM MobiHoc. 91--100.
[29]
Qixia Zhang, Yikai Xiao, Fangming Liu, John CS Lui, Jian Guo, and Tao Wang. 2017. Joint optimization of chain placement and request scheduling for network function virtualization. In IEEE ICDCS. 731--741.
[30]
Wei Zhang, Guyue Liu, Wenhui Zhang, Neel Shah, Phillip Lopreiato, Gregoire Todeschi, KK Ramakrishnan, and Timothy Wood. 2016. OpenNetVM: A platform for high performance network service chains. In ACM HotMIddlebox. 26--31.
[31]
Xiaoxi Zhang, Chuan Wu, Zongpeng Li, and Francis CM Lau. 2017. Proactive VNF provisioning with multi-timescale cloud resources: Fusing online learning and online optimization. In IEEE INFOCOM. 1--9.
[32]
Jiaqi Zheng, Qiufang Ma, Chen Tian, Haipeng Dai, Wei Zhang, Guihai Chen, and Gong Zhang. 2019. Orchestrating Service Chain Deployment with Plutus in Next Generation Cellular Core. In IEEE IWQoS. 1--10.
[33]
Jiaqi Zheng, Hong Xu, Guihai Chen, and Haipeng Dai. 2015. Minimizing transient congestion during network update in data centers. In IEEE ICNP. 1--10.
[34]
Jiaqi Zheng, Hong Xu, Xiaojun Zhu, Guihai Chen, and Yanhui Geng. 2016. We've got you covered: Failure recovery with backup tunnels in traffic engineering. In IEEE ICNP. 1--10.

Cited By

View all
  • (2024)Service Recovery in NFV-Enabled Networks: Algorithm Design and AnalysisIEEE Transactions on Cloud Computing10.1109/TCC.2024.340218512:2(800-813)Online publication date: Apr-2024
  • (2023)Stable Service Caching in MECs of Hierarchical Service Markets With Uncertain Request RatesIEEE Transactions on Mobile Computing10.1109/TMC.2022.314987022:7(4279-4296)Online publication date: 1-Jul-2023
  • (2023)Steam computing paradigm: Cross‐layer solutions over cloud, fog, and edge computingIET Wireless Sensor Systems10.1049/wss2.1205114:5(157-180)Online publication date: 20-Feb-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICPP '19: Proceedings of the 48th International Conference on Parallel Processing
August 2019
1107 pages
ISBN:9781450362955
DOI:10.1145/3337821
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

  • University of Tsukuba: University of Tsukuba

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 August 2019

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • National Key R&D Program
  • Fundamental Research Funds for the Central Universities
  • Natural Science Foundation of Jiangsu Province
  • National Natural Science Foundation of China

Conference

ICPP 2019

Acceptance Rates

Overall Acceptance Rate 91 of 313 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)17
  • Downloads (Last 6 weeks)1
Reflects downloads up to 15 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Service Recovery in NFV-Enabled Networks: Algorithm Design and AnalysisIEEE Transactions on Cloud Computing10.1109/TCC.2024.340218512:2(800-813)Online publication date: Apr-2024
  • (2023)Stable Service Caching in MECs of Hierarchical Service Markets With Uncertain Request RatesIEEE Transactions on Mobile Computing10.1109/TMC.2022.314987022:7(4279-4296)Online publication date: 1-Jul-2023
  • (2023)Steam computing paradigm: Cross‐layer solutions over cloud, fog, and edge computingIET Wireless Sensor Systems10.1049/wss2.1205114:5(157-180)Online publication date: 20-Feb-2023
  • (2022)Near Optimal Learning-Driven Mechanisms for Stable NFV Markets in Multitier Cloud NetworksIEEE/ACM Transactions on Networking10.1109/TNET.2022.317929530:6(2601-2615)Online publication date: Dec-2022
  • (2022)Parallel Deployment of VNFs in Service Function Chain: Benefit or Not?2022 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom)10.1109/ISPA-BDCloud-SocialCom-SustainCom57177.2022.00086(628-635)Online publication date: Dec-2022
  • (2021)Latency-Efficient VNF Deployment and Path Routing for Reliable Service ChainIEEE Transactions on Network Science and Engineering10.1109/TNSE.2020.30480338:1(651-661)Online publication date: 1-Jan-2021
  • (2021)Near Optimal and Dynamic Mechanisms Towards a Stable NFV Market in Multi-Tier Cloud NetworksIEEE INFOCOM 2021 - IEEE Conference on Computer Communications10.1109/INFOCOM42981.2021.9488819(1-10)Online publication date: 10-May-2021
  • (2020)Service Function Chain Deployment with Guaranteed Resilience2020 IEEE 17th International Conference on Mobile Ad Hoc and Sensor Systems (MASS)10.1109/MASS50613.2020.00077(585-593)Online publication date: Dec-2020
  • (2020)Packet-in Request Redirection for Minimizing Control Plane Response Time2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS47924.2020.00099(926-935)Online publication date: May-2020
  • (2020)To Cache or Not to Cache: Stable Service Caching in Mobile Edge-Clouds of a Service Market2020 IEEE 40th International Conference on Distributed Computing Systems (ICDCS)10.1109/ICDCS47774.2020.00051(421-431)Online publication date: Nov-2020

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media