Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2815675.2815698acmconferencesArticle/Chapter ViewAbstractPublication PagesimcConference Proceedingsconference-collections
research-article

PerfSight: Performance Diagnosis for Software Dataplanes

Published: 28 October 2015 Publication History

Abstract

The advent of network functions virtualization (NFV) means that data planes are no longer simply composed of routers and switches. Instead they are very complex and involve a variety of sophisticated packet processing elements that reside on the OSes and software running on compute servers where network functions (NFs) are hosted. In this paper, we argue that these new "software data planes" are susceptible to at least three new classes of performance problems. To diagnose such problems, we design, implement and evaluate, PerfSight, a ground-up system that works by extracting comprehensive low-level information regarding packet processing and I/O performance of the various elements in the software data plane. Name then analyzes the information gathered in various dimensions (e.g., across all VMs on a machine, or all VMs deployed by a tenant). By looking across aggregates, we show that it becomes possible to detect and diagnose key performance problems. Experimental results show that our framework can result in accurate detection of the root causes of key performance problems in software data planes, and it imposes very little overhead.

References

[1]
Balance: the open source load-balancer and tcp proxy. http://www.inlab.de/balance.html.
[2]
Cherryproxy: a filtering http proxy extensible in python. http://www.decalage.info/python/cherryproxy.
[3]
http://rightscale.com.
[4]
http://tools.ietf.org/html/rfc3954.html.
[5]
Path mtu discovery. http://tools.ietf.org/html/rfc1191.
[6]
Snort: Open source network intrusion prevention. http://www.snort.org.
[7]
www.openvswitch.org.
[8]
www.sflow.org.
[9]
www.tcpdump.org.
[10]
A. Anand, V. Sekar, and A. Akella. Smartre: an architecture for coordinated network-wide redundancy elimination. In ACM SIGCOMM Computer Communication Review, volume 39, pages 87--98. ACM, 2009.
[11]
P. Bahl, R. Chandra, A. Greenberg, and S. Kandula. Towards highly reliable enterprise network services via interence of multi-level dependencies. In SIGCOMM, 2007.
[12]
F. Bellard. Qemu, a fast and portable dynamic translator. In USENIX Annual Technical Conference, FREENIX Track, pages 41--46, 2005.
[13]
T. Bu, N. Duffield, F. L. Presti, and D. Towsley. Network tomography on general topologies. In Proceedings of the 2002 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS '02, pages 21--30, New York, NY, USA, 2002. ACM.
[14]
M. Canini, D. Venzano, P. Peresini, D. Kostic, and J. Rexford. A nice way to test openflow applications. In NSDI, 2012.
[15]
M. Chiosi, D. Clarke, P. Willis, A. Reid, J. Feger, M. Bugenhagen, W. Khan, M. Fargano, C. Cui, H. Deng, et al. Network functions virtualisation-introductory white paper. In SDN and OpenFlow World Congress, 2012.
[16]
A. Dhamdhere, R. Teixeira, C. Dovrolis, and C. Diot. Netdiagnoser: Troubleshooting network unreachabilities using end-to-end probes and routing data. In CoNEXT, 2007.
[17]
R. Fonseca, G. Porter, R. H. Katz, S. Shenker, and I. Stocia. X-trace: A pervasive network tracing framework. In NSDI, 2007.
[18]
A. Gember, A. Akella, T. Benson, and R. Grandl. Stratos: Virtual middleboxes as first-class entities. In UW-Madison, Technical Report, 2012.
[19]
A. Goel, C. Krasic, K. Li, and J. Walpole. Supporting low latency tcp-based media streams. In Quality of Service, 2002. Tenth IEEE International Workshop on, pages 193--203. IEEE, 2002.
[20]
N. Handigol, B. Heller, V. Jeyakumar, D. Mazieres, and N. McKeown. I know what your packet did last hop: Using packet histories to troubleshoot networks. In NSDI, 2014.
[21]
S. Kandula, D. Katabi, and J.-P. Vasseur. Shrink: A tool for failure diagnosis in ip networks. In MineNet workshop, 2005.
[22]
S. Kandula, R. Mahajan, P. Verkaik, S. Agarwal, J. Padhye, and P. Bahl. Detailed diagnosis in enterprise networks. In SIGCOMM, 2009.
[23]
P. Kazemian, M. Chang, H. Zeng, G. Varghese, N. McKeown, and S. Whyte. Real time network policy checking using header space analysis. In NSDI, 2014.
[24]
P. Kazemian, G. Varghese, and N. McKeown. Header space analysis: Static checking for networks. In NSDI, 2012.
[25]
A. Khurshid, W. Zhou, M. Caesar, and P. B. Godfrey. Veriflow: Verifying network-wide invariants in real time. In HotSDN, 2012.
[26]
S. Kliger, S. Yemini, Y. Yemini, D. Ohsie, and S. Stolfo. A coding approach to event correlation. In IM, 1995.
[27]
E. Kohler, R. Morris, B. Chen, J. Jannotti, and M. F. Kaashoek. The click modular router. ACM Transactions on Computer Systems (TOCS), 2000.
[28]
R. R. Kompella, J. Yates, A. Greenberg, and A. C. Snoeren. Ip fault localization via risk modeling. In NSDI, 2005.
[29]
R. R. Kompella, J. Yates, A. Greenberg, and A. C. Snoeren. Detection and localization of network black holes. In INFOCOM, 2007.
[30]
T. Koponen, K. Amidon, P. Balland, M. Casado, A. Chanda, B. Fulton, I. Ganichev, J. Gross, N. Gude, P. Ingram, et al. Network virtualization in multi-tenant datacenters. In NSDI, 2014.
[31]
H. Mai, A. Khurshid, R. Agarwal, M. Caesar, P. B. Godfrey, and S. T. King. Debugging the data plane with anteater. In SIGCOMM, 2011.
[32]
J. N. McCann. Automating performance diagnosis in networked systems. In PHD thesis, 2012.
[33]
R. Potharaju and N. Jain. Demystifying the dark side of the middle: A field study of middlebox failures in datacenters. In IMC, 2013.
[34]
C. D. P. Ramanathan and D. Moore. Packet dispersion techniques and capacity estimation.
[35]
W. Wu, G. Wang, A. Akella, and A. Shaikh. Virtual network diagnosis as a service. In SoCC, 2013.
[36]
M. Yu, A. Greenberg, D. Maltz, J. Rexford, and L. Yuan. Profiling network performance for multi-tier data center applications. In NSDI, 2011.
[37]
H. Zeng, S. Zhang, F. Ye, V. Jeyakumar, M. Ju, J. Liu, N. McKeown, and A. Vahdat. Libra: Divide and conquer to verify forwarding tables in huge networks. In NSDI, 2014

Cited By

View all
  • (2024)Non-invasive performance prediction of high-speed softwarized network services with limited knowledgeIEEE INFOCOM 2024 - IEEE Conference on Computer Communications10.1109/INFOCOM52122.2024.10621097(2328-2337)Online publication date: 20-May-2024
  • (2024)Graph neural network based robust anomaly detection at service level in SDN driven microservice systemComputer Networks: The International Journal of Computer and Telecommunications Networking10.1016/j.comnet.2023.110135239:COnline publication date: 1-Feb-2024
  • (2023)Performance analysis of DPDK-based applications through tracingJournal of Parallel and Distributed Computing10.1016/j.jpdc.2022.10.012173:C(1-19)Online publication date: 1-Mar-2023
  • Show More Cited By

Index Terms

  1. PerfSight: Performance Diagnosis for Software Dataplanes

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    IMC '15: Proceedings of the 2015 Internet Measurement Conference
    October 2015
    550 pages
    ISBN:9781450338486
    DOI:10.1145/2815675
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 October 2015

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. cloud networks
    2. performance
    3. software data plane
    4. troubleshooting

    Qualifiers

    • Research-article

    Funding Sources

    • NSF

    Conference

    IMC '15
    Sponsor:
    IMC '15: Internet Measurement Conference
    October 28 - 30, 2015
    Tokyo, Japan

    Acceptance Rates

    IMC '15 Paper Acceptance Rate 31 of 96 submissions, 32%;
    Overall Acceptance Rate 277 of 1,083 submissions, 26%

    Upcoming Conference

    IMC '24
    ACM Internet Measurement Conference
    November 4 - 6, 2024
    Madrid , AA , Spain

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 26 Sep 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Non-invasive performance prediction of high-speed softwarized network services with limited knowledgeIEEE INFOCOM 2024 - IEEE Conference on Computer Communications10.1109/INFOCOM52122.2024.10621097(2328-2337)Online publication date: 20-May-2024
    • (2024)Graph neural network based robust anomaly detection at service level in SDN driven microservice systemComputer Networks: The International Journal of Computer and Telecommunications Networking10.1016/j.comnet.2023.110135239:COnline publication date: 1-Feb-2024
    • (2023)Performance analysis of DPDK-based applications through tracingJournal of Parallel and Distributed Computing10.1016/j.jpdc.2022.10.012173:C(1-19)Online publication date: 1-Mar-2023
    • (2021)NFV Platforms: Taxonomy, Design Choices and Future ChallengesIEEE Transactions on Network and Service Management10.1109/TNSM.2020.304538118:1(30-48)Online publication date: Mar-2021
    • (2021)Towards a Network Queuing Assessment for Elasticity Management of Virtualized Services2021 IEEE 18th Annual Consumer Communications & Networking Conference (CCNC)10.1109/CCNC49032.2021.9369609(1-6)Online publication date: 9-Jan-2021
    • (2020)MicroscopeProceedings of the Annual conference of the ACM Special Interest Group on Data Communication on the applications, technologies, architectures, and protocols for computer communication10.1145/3387514.3405876(390-403)Online publication date: 30-Jul-2020
    • (2020)NFV Data Centers: A Systematic ReviewIEEE Access10.1109/ACCESS.2020.29735688(51713-51735)Online publication date: 2020
    • (2019)Performance contracts for software network functionsProceedings of the 16th USENIX Conference on Networked Systems Design and Implementation10.5555/3323234.3323277(517-530)Online publication date: 26-Feb-2019
    • (2019)Fault Diagnosis for the Virtualized Network in the Cloud Environment using Reinforcement Learning2019 IEEE International Conference on Smart Cloud (SmartCloud)10.1109/SmartCloud.2019.00047(231-236)Online publication date: Dec-2019
    • (2019)Towards Verifiable Performance Measurement over In-the-Cloud MiddleboxesIEEE INFOCOM 2019 - IEEE Conference on Computer Communications10.1109/INFOCOM.2019.8737435(1162-1170)Online publication date: Apr-2019
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media