Abstract
Network troubleshooting usually requires packet level traffic capturing and analyzing. Indeed, the observation of emission patterns sheds some light on the kind of degradation experienced by a connection. In the case of reliable transport traffic where congestion control is performed, such as TCP and QUIC traffic, these patterns are the fruit of decisions made by the congestion control algorithm (CCA), according to its own perception of network conditions. The CCA estimates the bottleneck’s capacity via an exponential probing, during the so-called “Slow-Start” (SS) state. The bottleneck is considered reached upon reception of congestion signs, typically lost packets or abnormal packet delays depending on the version of CCA used. The SS state duration is thus a key indicator for the diagnosis of faults; this indicator is estimated empirically by human experts today, which is time-consuming and a cumbersome task with large error margins. This paper proposes a method to automatically identify the slow-start state from actively and passively obtained bidirectional packet traces. It relies on an innovative timeless representation of the observed packets series. We implemented our method in our active and passive probes and tested it with CUBIC and BBR under different network conditions. We then picked a few real-life examples to illustrate the value of our representation for easy discrimination between typical faults and for identifying BBR among CCAs variants.
Similar content being viewed by others
Data Availability
The packet captures that were collected using the operator (Orange) probes and servers can not be shared due to confidentiel reasonning
Notes
Tool available online at https://193.252.113.227/cgi-bin/ats.cgi.
References
Claise B (2004) Cisco Systems NetFlow Services Export Version 9. RFC Editor. https://doi.org/10.17487/RFC3954. https://www.rfc-editor.org/info/rfc3954
Yu M, Jose L, Miao R (2013) Software defined traffic measurement with OpenSketch. In: Proceedings of the 10th USENIX Conference on Networked Systems Design and Implementation. nsdi’13, pp. 29–42. USENIX Association, USA
Panchen S, McKee N, Phaal P (2001) InMon Corporation’s sFlow: a method for monitoring traffic in switched and routed networks. RFC Editor. https://doi.org/10.17487/RFC3176. https://www.rfc-editor.org/info/rfc3176
Veal B, Li K, Lowenthal D (2005) New methods for passive estimation of TCP roundtrip times. In: Dovrolis C (ed) Passive and Active Network Measurement. Springer, Berlin, Heidelberg, pp 121–134
Benko P, Veres A (2002) A passive method for estimating end-to-end TCP packet loss. In: Global Telecommunications Conference, 2002. GLOBECOM ’02. IEEE, vol. 3, pp. 2609–26133. https://doi.org/10.1109/GLOCOM.2002.1189102
Iyengar J, Thomson M (2021) QUIC: a UDP-based multiplexed and secure transport. RFC Editor. https://doi.org/10.17487/RFC9000. https://www.rfc-editor.org/info/rfc9000
Ha S, Rhee I, Xu L (2008) CUBIC: a new TCP-friendly high-speed TCP variant. SIGOPS Oper Syst Rev 42(5):64–74. https://doi.org/10.1145/1400097.1400105
Cardwell N, Cheng Y, Yeganeh SH, Swett I, Jacobson V (2022) BBR congestion control. Internet-Draft draft-cardwell-iccrg-bbr-congestion-control-02, Internet Engineering Task Force (March). Work in Progress. https://datatracker.ietf.org/doc/draft-cardwell-iccrg-bbr-congestion-control/02/
Tlaiss Z, Hamchaoui I, Amigo I, Ferrieux A, Vaton S (2023) Troubleshooting enhancement with automated slow-start detection. In: 2023 26th Conference on Innovation in Clouds, Internet and Networks and Workshops (ICIN), pp. 129–136. https://doi.org/10.1109/ICIN56760.2023.10073485
Floyd S (2000) Congestion control principles. RFC Editor. https://doi.org/10.17487/RFC2914. https://www.rfc-editor.org/info/rfc2914
Mishra A, Sun X, Jain A, Pande S, Joshi R, Leong B (2019) The great internet TCP congestion control census. Proceed ACM Measure Anal Comput Syst 3:1–24. https://doi.org/10.1145/3366693
Balasubramanian P, Huang Y, Olson M (2023) HyStart++: modified slow start for TCP. Internet-Draft draft-ietf-tcpm-hystartplusplus-13, Internet Engineering Task Force (January). Work in Progress. https://datatracker.ietf.org/doc/draft-ietf-tcpm-hystartplusplus/13/
Blanton E, Paxson DV, Allman M (2009) TCP congestion control. RFC Editor. https://doi.org/10.17487/RFC5681. https://www.rfc-editor.org/info/rfc5681
Rüth J, Kunze I, Hohlfeld O (2019) TCP’s initial window–deployment in the wild and its impact on performance. IEEE Trans Netw Serv Manage 16(2):389–402. https://doi.org/10.1109/TNSM.2019.2896335
Stevens WR (1997) TCP slow start, congestion avoidance, fast retransmit, and fast recovery algorithms. RFC Editor. https://doi.org/10.17487/RFC2001. https://www.rfc-editor.org/info/rfc2001
Cardwell N, Cheng Y, Gunn CS, Yeganeh SH, Jacobson V (2017) BBR: congestion-based congestion control. Commun ACM 60(2):58–66. https://doi.org/10.1145/3009824
Sanders C, Smith J (2014) Applied network security monitoring, p. Syngress, Boston. https://doi.org/10.1016/B978-0-12-417208-1.09984-0 . https://www.sciencedirect.com/science/article/pii/B9780124172081099840
Casey E, Altheide C, Daywalt C, de Donno A, Forte D, Holley JO, Johnston A, van der Knijff R, Kokocinski A, Luehr PH, Maguire T, Pittman RD, Rose CW, Schwerha JJ, Shaver D, Smith JR (2010) Handbook of digital forensics and investigation, pp. 1–17. Academic Press, San Diego. https://doi.org/10.1016/B978-0-12-374267-4.00001-X. https://www.sciencedirect.com/science/article/pii/B978012374267400001X
Transmission control protocol. RFC Editor (1981). https://doi.org/10.17487/RFC0793. https://www.rfc-editor.org/info/rfc793
Kary: Understanding the tcptrace time-sequence graph in wireshark. https://packetbomb.com/understanding-the-tcptrace-time-sequence-graph-in-wireshark/
Tlaiss Z (2021) Anomaly root cause diagnosis from active and passive measurement analysis. In: 2021 33th International Teletraffic Congress (ITC-33), pp. 1–3
Hagos DH, Engelstad PE, Yazidi A, Kure O (2018) General TCP state inference model from passive measurements using machine learning techniques. IEEE Access 6:28372–28387. https://doi.org/10.1109/ACCESS.2018.2833107
Padhye J, Floyd S (2001) On inferring TCP behavior. ACM SIGCOMM Comput Commun Rev. doi 10(1145/383059)
Yang P, Luo W, Xu L, Deogun J, Lu Y (2011) TCP congestion avoidance algorithm identification. In: 2011 31st International Conference on Distributed Computing Systems, pp. 310–321. https://doi.org/10.1109/ICDCS.2011.27
Jaiswal S, Iannaccone G, Diot C, Kurose J, Towsley D (2004) Inferring TCP connection characteristics through passive measurements, pp. 1582–15923. https://doi.org/10.1109/INFCOM.2004.1354571
Toshihiko Kato RY, Leelianou Yongxialee Ohzahata S (2016) A study on how to characterize TCP congestion control algorithms from unidirectional packet traces. ICIMP 2016 : The Eleventh International Conference on Internet Monitoring and Protection
Kato T, Yan X, Yamamoto R, Ohzahata S (2018) Identification of TCP congestion control algorithms from unidirectional packet traces, pp. 22–27. https://doi.org/10.1145/3291842.3291922
Zhang Y, Breslau L, Paxson V, Shenker S (2002) On the characteristics and origins of internet flow rates. SIGCOMM Comput Commun Rev 32(4):309–322. https://doi.org/10.1145/964725.633055
Guo C, Yuan L, Xiang D, Dang Y, Huang R, Maltz D, Liu Z, Wang V, Pang B, Chen H, Lin Z-W, Kurien V (2015) Pingmesh: a large-scale system for data center network latency measurement and analysis. SIGCOMM Comput Commun Rev 45(4):139–152. https://doi.org/10.1145/2829988.2787496
Zhu Y, Kang N, Cao J, Greenberg A, Lu G, Mahajan R, Maltz D, Yuan L, Zhang M, Zhao BY, Zheng H (2015) Packet-level telemetry in large datacenter networks. SIGCOMM Comput Commun Rev 45(4):479–491. https://doi.org/10.1145/2829988.2787483
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Tlaiss, Z., Ferrieux, A., Amigo, I. et al. Automated slow-start detection for anomaly root cause analysis and BBR identification. Ann. Telecommun. 79, 149–163 (2024). https://doi.org/10.1007/s12243-023-00982-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12243-023-00982-7