Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Planck: millisecond-scale monitoring and control for commodity networks

Published: 17 August 2014 Publication History

Abstract

Software-defined networking introduces the possibility of building self-tuning networks that constantly monitor network conditions and react rapidly to important events such as congestion. Unfortunately, state-of-the-art monitoring mechanisms for conventional networks require hundreds of milliseconds to seconds to extract global network state, like link utilization or the identity of "elephant" flows. Such latencies are adequate for responding to persistent issues, e.g., link failures or long-lasting congestion, but are inadequate for responding to transient problems, e.g., congestion induced by bursty workloads sharing a link. In this paper, we present Planck, a novel network measurement architecture that employs oversubscribed port mirroring to extract network information at 280 µs--7 ms timescales on a 1 Gbps commodity switch and 275 µs--4 ms timescales on a 10 Gbps commodity switch,over 11x and 18x faster than recent approaches, respectively (and up to 291x if switch firmware allowed buffering to be disabled on some ports). To demonstrate the value of Planck's speed and accuracy, we use it to drive a traffic engineering application that can reroute congested flows in milliseconds. On a 10 Gbps commodity switch, Planck-driven traffic engineering achieves aggregate throughput within 1--4% of optimal for most workloads we evaluated, even with flows as small as 50 MiB, an improvement of up to 53% over previous schemes.

References

[1]
M. Al-Fares, A. Loukissas, and A. Vahdat. A Scalable, Commodity Data Center Network Architecture. In SIGCOMM, 2008.
[2]
M. Al-fares, S. Radhakrishnan, B. Raghavan, N. Huang, and A. Vahdat. Hedera: Dynamic Flow Scheduling for Data Center Networks. In NSDI, 2010.
[3]
B. Claise, Ed. Cisco Systems NetFlow Services Export Version 9. RFC 3954. http://www.ietf.org/rfc/rfc3954.txt, October 2004.
[4]
T. Benson, A. Anand, A. Akella, and M. Zhang. MicroTE: Fine Grained Traffic Engineering for Data Centers. In CoNEXT, 2011.
[5]
A. Curtis, W. Kim, and P. Yalagandula. Mahout: Low-Overhead Datacenter Traffic Management using End-Host-Based Elephant Detection. In INFOCOM, 2011.
[6]
A. R. Curtis, J. C. Mogul, J. Tourrilhes, P. Yalagandula, P. Sharma, and S. Banerjee. DevoFlow: Scaling Flow Management for High-Performance Networks. In SIGCOMM, 2011.
[7]
N. Duffield, C. Lund, and M. Thorup. Learn More, Sample Less: Control of Volume and Variance in Network Measurement. IEEE Transactions on Information Theory, 51(5):1756--1775, 2005.
[8]
C. Estan, K. Keys, D. Moore, and G. Varghese. Building a Better NetFlow. In SIGCOMM, 2004.
[9]
C. Estan and G. Varghese. New Directions in Traffic Measurement and Accounting. In SIGCOMM, 2002.
[10]
N. Farrington, G. Porter, S. Radhakrishnan, H. H. Bazzaz, V. Subramanya, Y. Fainman, G. Papen, and A. Vahdat. Helios: A Hybrid Electrical/Optical Switch Architecture for Modular Data Centers. In SIGCOMM, 2010.
[11]
A. D. Ferguson, A. Guha, C. Liang, R. Fonseca, and S. Krishnamurthi. Participatory Networking: An API for Application Control of SDNs. In SIGCOMM, 2013.
[12]
Floodlight OpenFlow Controller. http://floodlight.openflowhub.org/.
[13]
A. Greenberg, J. R. Hamilton, N. Jain, S. Kandula, C. Kim, P. Lahiri, D. A. Maltz, P. Patel, and S. Sengupta. VL2: A scalable and flexible data center network. In SIGCOMM, 2009.
[14]
D. Halperin, S. Kandula, J. Padhye, P. Bahl, and D. Wetherall. Augmenting Data Center Networks with Multi-Gigabit Wireless Links. In SIGCOMM, 2011.
[15]
C.-Y. Hong, S. Kandula, R. Mahajan, M. Zhang, V. Gill, M. Nanduri, and R. Wattenhofer. Achieving High Utilization with Software-driven WAN. In SIGCOMM, 2013.
[16]
IBM BNT RackSwitch G8264. http://www.redbooks.ibm.com/abstracts/tips0815.html.
[17]
Intel® Data Plane Development Kit (Intel® DPDK) Overview Packet Processing on Intel® Architecture. http://goo.gl/qdg3rZ, December 2012.
[18]
Intel® DPDK: Data Plane Development Kit. http://www.dpdk.org.
[19]
V. Jacobson. Van Jacobson's Network Channels. http://lwn.net/Articles/169961/, January 2006.
[20]
S. Jain, A. Kumar, S. Mandal, J. Ong, L. Poutievski, A. Singh, S. Venkata, J. Wanderer, J. Zhou, M. Zhu, J. Zolla, U. Hölzle, S. Stuart, and A. Vahdat. B4: Experience with a Globally-deployed Software Defined WAN. In SIGCOMM, 2013.
[21]
V. Jeyakumar, M. Alizadeh, D. Mazières, B. Prabhakar, C. Kim, and A. Greenberg. EyeQ: Practical Network Performance Isolation at the Edge. In NSDI, 2013.
[22]
S. Kandula, S. Sengupta, A. Greenberg, and P. Patel. The Nature of Datacenter Traffic: Measurements and Analysis. In IMC, 2009.
[23]
R. Kapoor, A. C. Snoeren, G. M. Voelker, and G. Porter. Bullet Trains: A Study of NIC Burst Behavior at Microsecond Timescales. In CoNEXT, 2013.
[24]
E. Kohler, R. Morris, B. Chen, J. Jannotti, and M. F. Kaashoek. The Click Modular Router. ToCS, 18(3):263--297, August 2000.
[25]
J. C. Mogul and P. Congdon. Hey, You Darned Counters! Get Off My ASIC! In HotSDN, 2012.
[26]
Openflow-switch. https://www.opennetworking.org/standards/openflow-switch.
[27]
Libpcap File Format. http://wiki.wireshark.org/Development/LibpcapFileFormat.
[28]
PF_RING: High-speed Packet Capture, Filtering and Analysis. http://www.ntop.org/products/pf_ring/.
[29]
P. Phaal and S. Panchen. Packet Sampling Basics. http://www.sflow.org/packetSamplingBasics/index.htm.
[30]
L. Rizzo. Netmap: A Novel Framework for Fast Packet I/O. In USENIX ATC, 2012.
[31]
S. Sen, D. Shue, S. Ihm, and M. J. Freedman. Scalable, Optimal Flow Routing in Datacenters via Local Link Balancing. In CoNEXT, 2013.
[32]
sFlow. http://sflow.org/about/index.php.
[33]
sFlow-RT. http://inmon.com/products/sFlow-RT.php.
[34]
Large Flow Detection Script. http://blog.sflow.com/2013/06/large-flow-detection-script.html, June 2013.
[35]
A. Shieh, S. Kandula, A. Greenberg, C. Kim, and B. Saha. Sharing the Data Center Network. In NSDI, 2011.
[36]
A. Shieh, S. Kandula, and E. G. Sirer. SideCar: Building Programmable Datacenter Networks without Programmable Switches. In HotNets, 2010.
[37]
A. Singla, C.-Y. Hong, L. Popa, and P. B. Godfrey. Jellyfish: Networking Data Centers Randomly. In NSDI, 2012.
[38]
Version 2 of the Protocol Operations for the Simple Network Management Protocol (SNMP). RFC 3416. http://www.ietf.org/rfc/rfc3416.txt.
[39]
B. Stephens, A. Cox, W. Felter, C. Dixon, and J. Carter. PAST: Scalable Ethernet for Data Centers. In CoNEXT, 2012.
[40]
G. P. R. Strong, N. Farrington, A. Forencich, P. Chen-Sun, T. Rosing, Y. Fainman, G. Papen, and A. Vahdat. Integrating Microsecond Circuit Switching into the Data Center. In SIGCOMM, 2013.
[41]
J. Suh, T. T. Kwon, C. Dixon, W. Felter, and J. Carter. OpenSample: A Low-latency, Sampling-based Measurement Platform for SDN. In ICDCS, 2014.
[42]
G. Wang, D. G. Andersen, M. Kaminsky, K. Papagiannaki, T. S. E. Ng, M. Kozuch, and M. Ryan. c-Through: Part-time Optics in Data Centers. In SIGCOMM, 2010.
[43]
Wireshark. http://www.wireshark.org/.
[44]
M. Yu, L. Jose, and R. Miao. Software Defined Traffic Measurement with OpenSketch. In NSDI, 2013.

Cited By

View all
  • (2024)NetGSR: Towards Efficient and Reliable Network Monitoring with Generative Super ResolutionProceedings of the ACM on Networking10.1145/36964002:CoNEXT4(1-27)Online publication date: 25-Nov-2024
  • (2024)Hostmesh: Monitor and Diagnose Networks in Rail-optimized RoCE ClustersProceedings of the 8th Asia-Pacific Workshop on Networking10.1145/3663408.3663426(122-128)Online publication date: 3-Aug-2024
  • (2024)R-Pingmesh: A Service-Aware RoCE Network Monitoring and Diagnostic SystemProceedings of the ACM SIGCOMM 2024 Conference10.1145/3651890.3672264(554-567)Online publication date: 4-Aug-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGCOMM Computer Communication Review
ACM SIGCOMM Computer Communication Review  Volume 44, Issue 4
SIGCOMM'14
October 2014
672 pages
ISSN:0146-4833
DOI:10.1145/2740070
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 August 2014
Published in SIGCOMM-CCR Volume 44, Issue 4

Check for updates

Author Tags

  1. networking measurement
  2. software-defined networking
  3. traffic engineering

Qualifiers

  • Research-article

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)140
  • Downloads (Last 6 weeks)30
Reflects downloads up to 23 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)NetGSR: Towards Efficient and Reliable Network Monitoring with Generative Super ResolutionProceedings of the ACM on Networking10.1145/36964002:CoNEXT4(1-27)Online publication date: 25-Nov-2024
  • (2024)Hostmesh: Monitor and Diagnose Networks in Rail-optimized RoCE ClustersProceedings of the 8th Asia-Pacific Workshop on Networking10.1145/3663408.3663426(122-128)Online publication date: 3-Aug-2024
  • (2024)R-Pingmesh: A Service-Aware RoCE Network Monitoring and Diagnostic SystemProceedings of the ACM SIGCOMM 2024 Conference10.1145/3651890.3672264(554-567)Online publication date: 4-Aug-2024
  • (2024)Zoom2Net: Constrained Network Telemetry ImputationProceedings of the ACM SIGCOMM 2024 Conference10.1145/3651890.3672225(764-777)Online publication date: 4-Aug-2024
  • (2024)Link Traffic-Delay Mapping Model Learning Based on Multi-Class Samples in Software-Defined NetworksIEEE Transactions on Services Computing10.1109/TSC.2024.3463198(1-16)Online publication date: 2024
  • (2024)Proactive Telemetry in Large-Scale Multi-Tenant Cloud Overlay NetworksIEEE/ACM Transactions on Networking10.1109/TNET.2024.338178632:4(3002-3017)Online publication date: Aug-2024
  • (2024)MUSE: A Runtime Incrementally Reconfigurable Network Adapting to HPC Real-Time Traffic2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS57955.2024.00073(765-779)Online publication date: 27-May-2024
  • (2024)SpotMon: Enabling General Hotspot Monitoring in Key-Value Stores2024 IEEE 32nd International Conference on Network Protocols (ICNP)10.1109/ICNP61940.2024.10858526(1-12)Online publication date: 28-Oct-2024
  • (2024)FARM: Comprehensive Data Center Network Monitoring and Management2024 IEEE 44th International Conference on Distributed Computing Systems (ICDCS)10.1109/ICDCS60910.2024.00055(520-530)Online publication date: 23-Jul-2024
  • (2024)A First Look at the Impact of Measurement on Orchestrating Digital Twin Network2024 IEEE 13th International Conference on Cloud Networking (CloudNet)10.1109/CloudNet62863.2024.10815817(1-6)Online publication date: 27-Nov-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media