Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3387514.3405868acmconferencesArticle/Chapter ViewAbstractPublication PagescommConference Proceedingsconference-collections
research-article
Open access

Contention-Aware Performance Prediction For Virtualized Network Functions

Published: 30 July 2020 Publication History

Abstract

At the core of Network Functions Virtualization lie Network Functions (NFs) that run co-resident on the same server, contend over its hardware resources and, thus, might suffer from reduced performance relative to running alone on the same hardware. Therefore, to efficiently manage resources and meet performance SLAs, NFV orchestrators need mechanisms to predict contention-induced performance degradation. In this work, we find that prior performance prediction frameworks suffer from poor accuracy on modern architectures and NFs because they treat memory as a monolithic whole. In addition, we show that, in practice, there exist multiple components of the memory subsystem that can separately induce contention. By precisely characterizing (1) the pressure each NF applies on the server's shared hardware resources (contentiousness) and (2) how susceptible each NF is to performance drop due to competing contentiousness (sensitivity), we develop SLOMO, a multivariable performance prediction framework for Network Functions. We show that relative to prior work SLOMO reduces prediction error by 2-5x and enables 6-14% more efficient cluster utilization. SLOMO's codebase can be found at https://github.com/cmu-snap/SLOMO.

Supplementary Material

MP4 File (3387514.3405868.mp4)
At the core of Network Functions Virtualization lie Network Functions (NFs) that run co-resident on the same server, contend over its hardware resources and, thus, might suffer from reduced performance relative to running alone on the same hardware. Therefore, to manage resources and meet performance SLAs, NFV orchestrators need mechanisms to predict contention-induced performance degradation. In this work, we find that prior performance prediction frameworks suffer from poor accuracy on modern architectures andNFs because they treat memory as a monolithic whole. In addition, we show that, in practice, there exist multiple components of the memory subsystem that can separately induce contention. By precisely characterizing (1) the pressure each NF applies on the server?s shared hardware resources (contentiousness) and (2) how susceptible each NF is to performance drop due to competing contentiousness (sensitivity), we develop SLOMO, a multivariable performance prediction framework for Network Functions.

References

[1]
Aggregate PCM Metrics. https://software.intel.com/en-us/forums/software-tuning-performance-optimization-platform-monitoring/topic/277497.
[2]
AT&T. Domain2. https://www.att.com/Common/about_us/pdf/AT&T%20Domain%202.0%20Vision%20White%20Paper.pdf.
[3]
AT&T. ECOMP. https://policyforum.att.com/wp-content/uploads/2017/03/ecomp-architecture-whitepaper-att.pdf.
[4]
Dpdk-performance tuning guide. https://doc.dpdk.org/guides-16.11/linux_gsg/nic_perf_intel_platform.html.
[5]
ETSI. Network Functions Virtualization. https://portal.etsi.org/NFV/NFV_White_Paper.pdf.
[6]
Intel cache allocation technology. https://www.intel.com/content/www/us/en/communications/cache-monitoring-cache-allocation-technologies.html.
[7]
Intel direct data i/o technology. https://www.intel.com/content/www/us/en/io/data-direct-i-o-technology.html.
[8]
Intel PCM. https://github.com/opcm/pcm.
[9]
Intel skylake-x review: Core i9 7900x, i7 7820x and i7 7800x tested. https://www.anandtech.com/show/11550/the-intel-skylakex-review-core-i9-7900x-i7-7820x-and-i7-7800x-tested/4.
[10]
Linux Foundation. OPNFV. https://www.opnfv.org/.
[11]
Mellanox-performance tuning guide. https://community.mellanox.com/s/article/performance-tuning-for-mellanox-adapters.
[12]
Reducing os jitter due to per-cpu kthreads. https://www.kernel.org/doc/html/latest/admin-guide/kernel-per-CPU-kthreads.html.
[13]
Snort: Network Intrusion Detection & Detection System. https://www.snort.org.
[14]
SR-IOV. https://github.com/intel/sriov-network-device-plugin.
[15]
Suricata: Open source ids, ips, nsm ensgine. https://suricata-ids.org.
[16]
A. Abel, F. Benz, J. Doerfert, B. Dörr, S. Hahn, F. Haupenthal, M.Jacobs, A. H. Moin, J. Reineke, B. Schommer, et al. Impact of resource sharing on performance and performance prediction: A survey. In International Conference on Concurrency Theory, pages 25--43. Springer, 2013.
[17]
L. A. Barroso, J. Clidaras, and U. Hölzle. The datacenter as a computer: An introduction to the design of warehouse-scale machines. Synthesis lectures on computer architecture, 8(3):1--154, 2013.
[18]
J. Benesty, J. Chen, Y. Huang, and I. Cohen. Pearson correlation coefficient. In Noise reduction in speech processing, pages 1--4. Springer, 2009.
[19]
C. M. Buechler and J. Pingle. pfsense: The definitive guide. Reed Media Services, 2009.
[20]
C. Delimitrou and C. Kozyrakis. Paragon: Qos-aware scheduling for heterogeneous datacenters. In ACM SIGPLAN Notices, volume 48, pages 77--88. ACM, 2013.
[21]
C. Delimitrou and C. Kozyrakis. Quasar: resource-efficient and qos-aware cluster management. ACM SIGPLAN Notices, 49(4):127--144, 2014.
[22]
T. G. Dietterich. Ensemble methods in machine learning. In International workshop on multiple classifier systems, pages 1--15. Springer, 2000.
[23]
I. D. Direct. I/o technology (intel ddio) a primer, 2012.
[24]
M. Dobrescu, K. Argyraki, and S. Ratnasamy. Toward predictable performance in software packet-processing platforms. In Proc. NSDI 12, pages 141--154, San Jose, CA, 2012. USENIX.
[25]
M. Dobrescu, N. Egi, K. Argyraki, B. Chun, K. Fall, G. Iannaccone, A. Knies, M. Manesh, and S. Ratnasamy. Routebricks: Exploiting parallelism to scale software routers. In Proc. SOSP 2009, SOSP '09, pages 15--28, New York, NY, USA, 2009. ACM.
[26]
E. Ebrahimi, C. J. Lee, O. Mutlu, and Y. N. Patt. Fairness via source throttling: a configurable and high-performance fairness substrate for multi-core memory systems. In ACM Sigplan Notices, volume 45, pages 335--346. ACM, 2010.
[27]
J. H. Friedman. Greedy function approximation: a gradient boosting machine. Annals of statistics, pages 1189--1232, 2001.
[28]
A. Gember-Jacobson, R. Viswanathan, C. Prakash, R. Grandl, J. Khalid, S. Das, and A. Akella. Opennf: Enabling innovation in network function control. In ACM SIGCOMM Computer Communication Review, volume 44, pages 163--174. ACM, 2014.
[29]
D. M. Hawkins. The problem of overfitting. Journal of chemical information and computer sciences, 44(1):1--12, 2004.
[30]
R. Iyer, L. Pedrosa, A. Zaostrovnykh, S. Pirelli, K. Argyraki, and G. Candea. Performance contracts for software network functions. In 16th { USENIX} Symposium on Networked Systems Design and Implementation ({NSDI} 19), pages 517--530, 2019.
[31]
M. Kablan, A. Alsudais, E. Keller, and F. Le. Stateless network functions: Breaking the tight coupling of state and processing. In NSDI, pages 97--112, 2017.
[32]
E. Kohler, R. Morris, B. Chen, J. Jannotti, and F. Kaashoek. The click modular router. In Proc. TOCS 2000, volume 18, pages 263--297. ACM, 2000.
[33]
C. Kozyrakis, A. Kansal, S. Sankar, and K. Vaid. Server engineering insights for large-scale online services. IEEE micro, 30(4):8--19, 2010.
[34]
M. Kurth, B. Gras, D. Andriesse, C. Giuffrida, H. Bos, and K. Razavi. Netcat: Practical cache attacks from the network, 2020.
[35]
J. Li, N. K. Sharma, D. R. K. Ports, and S. D. Gribble. Tales of the tail: Hardware, os, and application-level sources of tail latency. In Proceedings of the ACM Symposium on Cloud Computing, SOCC '14, page 1--14, New York, NY, USA, 2014. Association for Computing Machinery.
[36]
Y. Li and M. Chen. Software-defined network function virtualization: A survey. IEEE Access, 3:2542--2553, 2015.
[37]
E. C. man Jr, M. Garey, and D. Johnson. Approximation algorithms for bin packing: A survey. Approximation algorithms for NP-hard problems, pages 46--93, 1996.
[38]
J. Mars, L. Tang, and R. Hundt. Heterogeneity in "homogeneous" warehouse-scale computers: A performance opportunity. IEEE Computer Architecture Letters, 10(2):29--32, 2011.
[39]
J. Mars, L. Tang, R. Hundt, K. Skadron, and M. L. Soffa. Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations. In Proceedings of the 44th annual IEEE/ACM International Symposium on Microarchitecture, pages 248--259. ACM, 2011.
[40]
L. A. Mauricio, M. G. Rubinstein, and O. C. Duarte. Proposing and evaluating the performance of a firewall implemented as a virtualized network function. In 2016 7th International Conference on the Network of the Future (NOF), pages 1--3. IEEE, 2016.
[41]
S. Palkar, C. Lan, S.Han, K. Jang, A. Panda, S. Ratnasamy, L. Rizzo, and S. Shenker. E2: A framework for nfv applications. In Proc. SOSP 2015, SOSP '15, pages 121--136, New York, NY, USA, 2015. ACM.
[42]
A. Panda, S. Han, K. Jang, M. Walls, S. Ratnasamy, and S. Shenker. Netbricks: Taking the v out of nfv. In OSDI, pages 203--216, 2016.
[43]
L. Pedrosa, R. Iyer, A. Zaostrovnykh, J. Fietz, and K. Argyraki. Automated synthesis of adversarial workloads for network functions. In Proceedings of the 2018 Conference of the ACM Special Interest Group on Data Communication, SIGCOMM '18, pages 372--385, New York, NY, USA, 2018. ACM.
[44]
V. Sekar, N. Egi, S. Ratnasamy, M. Reiter, and G. Shi. Design and implementation of a consolidated middlebox architecture. In Proc. of NSDI 2012, NSDI'12, pages 24--24, Berkeley, CA, USA, 2012. USENIX Association.
[45]
J. Sherry, S. Hasan, C. Scott, A. Krishnamurthy, S. Ratnasamy, and V. Sekar. Making middleboxes someone else's problem: network processing as a cloud service. ACM SIGCOMM Computer Communication Review, 42(4):13--24, 2012.
[46]
B. Stephens, A. Akella, and M. Swift. Loom: Flexible and efficient {NIC } packet scheduling. In 16th {USENIX} Symposium on Networked Systems Design and Implementation ({NSDI} 19), pages 33--46, 2019.
[47]
L. Subramanian, V. Seshadri, A. Ghosh, S. Khan, and O. Mutlu. The application slowdown model: Quantifying and controlling the impact of inter-application interference at shared caches and main memory. In Proc. ACM MICRO 2015, pages 62--75. ACM, 2015.
[48]
L. Subramanian, V. Seshadri, Y. Kim, B. Jaiyen, and O. Mutlu. Mise: Providing performance predictability and improving fairness in shared main memory systems. In High Performance Computer Architecture (HPCA2013), 2013 IEEE 19th International Symposium on, pages 639--650. IEEE, 2013.
[49]
L. Tang, J. Mars, and M. L. Soffa. Contentiousness vs. sensitivity: Improving contention aware runtime systems on multicore architectures. In Proceedings of the 1st International Workshop on Adaptive Self-Tuning Computing Systems for the Exaflop Era, EXADAPT '11, pages 12--21, New York, NY, USA, 2011. ACM.
[50]
L. Tang, J. Mars, N. Vachharajani, R. Hundt, and M. L. Soffa. The impact of memory subsystem resource sharing on datacenter applications. In ACM SIGARCH Computer Architecture News, volume 39, pages 283--294. ACM, 2011.
[51]
A. Tootoonchian, A. Panda, C. Lan, M. Walls, K. Argyraki, S. Ratnasamy, and S. Shenker. Resq: Enabling slos in network function virtualization. In 15th USENIX Symposium on Networked Systems Design and Implementation NSDI 18. USENIX, 2018.
[52]
Y. Yang and J. O. Pedersen. A comparative study on feature selection in text categorization. In Icml, volume 97, page 35, 1997.
[53]
A. Zaostrovnykh, S. Pirelli, R. Iyer, M. Rizzo, L. Pedrosa, K. Argyraki, and G. Candea. Verifying software network functions with no verification expertise. In Proceedings of the 27th ACM Symposium on Operating Systems Principles, pages 275--290, 2019.
[54]
Z.-H. Zhou. Ensemble methods: foundations and algorithms. Chapman and Hall/CRC, 2012.

Cited By

View all
  • (2024)Programmable Real-Time Scheduling of Disaggregated Network Functions2024 IFIP Networking Conference (IFIP Networking)10.23919/IFIPNetworking62109.2024.10619906(1-7)Online publication date: 3-Jun-2024
  • (2024)Prudentia: Findings of an Internet Fairness WatchdogProceedings of the ACM SIGCOMM 2024 Conference10.1145/3651890.3672229(506-520)Online publication date: 4-Aug-2024
  • (2024)AdaptChain: Adaptive Data Sharing and Synchronization for NFV Systems on Heterogeneous ArchitecturesIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2024.340059435:7(1281-1292)Online publication date: Jul-2024
  • Show More Cited By

Index Terms

  1. Contention-Aware Performance Prediction For Virtualized Network Functions

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGCOMM '20: Proceedings of the Annual conference of the ACM Special Interest Group on Data Communication on the applications, technologies, architectures, and protocols for computer communication
    July 2020
    814 pages
    ISBN:9781450379557
    DOI:10.1145/3387514
    This work is licensed under a Creative Commons Attribution International 4.0 License.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 30 July 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Network Functions Performance
    2. Packet Processing Software

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Funding Sources

    Conference

    SIGCOMM '20
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 462 of 3,389 submissions, 14%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)333
    • Downloads (Last 6 weeks)36
    Reflects downloads up to 13 Sep 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Programmable Real-Time Scheduling of Disaggregated Network Functions2024 IFIP Networking Conference (IFIP Networking)10.23919/IFIPNetworking62109.2024.10619906(1-7)Online publication date: 3-Jun-2024
    • (2024)Prudentia: Findings of an Internet Fairness WatchdogProceedings of the ACM SIGCOMM 2024 Conference10.1145/3651890.3672229(506-520)Online publication date: 4-Aug-2024
    • (2024)AdaptChain: Adaptive Data Sharing and Synchronization for NFV Systems on Heterogeneous ArchitecturesIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2024.340059435:7(1281-1292)Online publication date: Jul-2024
    • (2024)Morpheus: A Run Time Compiler and Optimizer for Software Data PlanesIEEE/ACM Transactions on Networking10.1109/TNET.2023.334628632:3(2269-2284)Online publication date: 1-Jun-2024
    • (2024)AIRIC: Orchestration of Virtualized Radio Access Networks With Noisy NeighboursIEEE Journal on Selected Areas in Communications10.1109/JSAC.2023.333974942:2(432-445)Online publication date: Feb-2024
    • (2024)ATHENA: Machine Learning and Reasoning for Radio Resources Scheduling in vRAN SystemsIEEE Journal on Selected Areas in Communications10.1109/JSAC.2023.333615542:2(263-279)Online publication date: Feb-2024
    • (2024)Intel Accelerators Ecosystem: An SoC-Oriented Perspective : Industry Product2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA)10.1109/ISCA59077.2024.00066(848-862)Online publication date: 29-Jun-2024
    • (2024)Non-invasive performance prediction of high-speed softwarized network services with limited knowledgeIEEE INFOCOM 2024 - IEEE Conference on Computer Communications10.1109/INFOCOM52122.2024.10621097(2328-2337)Online publication date: 20-May-2024
    • (2024)Syscall Analysis for Resource Stress Identification for Container Network Functions2024 IEEE 17th International Conference on Cloud Computing (CLOUD)10.1109/CLOUD62652.2024.00037(256-266)Online publication date: 7-Jul-2024
    • (2024)Energy efficient and delay aware deployment of parallelized service function chains in NFV-based networksComputer Networks10.1016/j.comnet.2024.110289(110289)Online publication date: Feb-2024
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media