Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Breaking the Inertial Thinking: Non-Blocking Multipath Congestion Control Based on the Single-Subflow Reinforcement Learning Model

Published: 25 March 2024 Publication History

Abstract

The Multipath TCP (MPTCP) protocol has received more attention due to the increasing number of terminals with multiple network interfaces. To meet the higher network performance demand of terminal services, many researches leverage reinforcement learning (RL) for MPTCP congestion control (CC) algorithms to improve the performance of MPTCP. However, we observe two limitations of existing RL-based mechanisms that make them impractical: 1) Fail to break the restriction of the input and output dimensions of RL, making the mechanisms unadaptable to the varying number of subflows. 2) Frequent model decisions block packet transmission, leading to under-utilization of bandwidth. This paper breaks the inertial thinking By “inertial thinking” here, we are referring to the initial reaction of others when dealing with CC in MPTCP. Given the interdependence between MPTCP subflows, scholars have traditionally opted for coupled CC. However, we have challenged this conventional thinking by independently handling the CC of different subflows in a single MPTCP flow and ensuring fairness. to overcome the above limitations and proposes Maggey, a non-blocking CC mechanism that applies the single-subflow model to multipath transmission. To this end, Maggey employs loosely coupled design principles and a unique reward function to ensure the fairness of the algorithm. Additionally, Maggey introduces iterative training to ensure the accuracy of training of the single-subflow model. Furthermore, a mode transition framework is artfully designed to avoid blocking, preserving the flexibility of RL-based CCs. These two features enhance the practicability of Maggey and the paper analyze the stability of Maggey. We implement Maggey in the Linux kernel and evaluate the performance of Maggey through extensive emulation and live experiments. The evaluation results show that Maggey boosts 26% throughput over DRL-CC at high bandwidth and improves 2%-60% throughput over traditional algorithms under different network conditions. Besides, Maggey maintains fairness in different scenarios.

References

[1]
Y. Huang, T. Z. Fu, D.-M. Chiu, J. C. Lui, and C. Huang, “Challenges, design and analysis of a large-scale p2p-vod system,” ACM SIGCOMM Comput. Commun. Rev., vol. 38, no. 4, pp. 375–388, 2008.
[2]
D. Konget al., “A novel fanless energy efficient edge computing system architecture and engineering practice for Baidu PCDN application,” in Proc. 18th IEEE Intersoc. Conf. Therm. Thermomec. Phenom. Electron. Syst. (ITherm), 2019, pp. 1–7.
[3]
A. Nikaet al., “Energy and performance of smartphone radio bundling in outdoor environments,” in Proc. 24th Int. Conf. World Wide Web, 2015, pp. 809–819.
[4]
C. Nicutar, D. Niculescu, and C. Raiciu, “Using cooperation for low power low latency cellular connectivity,” in Proc. 10th ACM Int. Conf. Emerg. Netw. Exp. Technol., 2014, pp. 337–348.
[5]
C. Raiciu, M. Handley, and D. Wischik, “Coupled congestion control for multipath transport protocols,” Internet Eng. Task Force, RFC 6356, 2011.
[6]
R. Khalili, N. Gast, M. Popovic, and J.-Y. Le Boudec, “MPTCP is not Pareto-optimal: Performance issues and a possible solution,” IEEE/ACM Trans. Netw., vol. 21, no. 5, pp. 1651–1665, Oct. 2013.
[7]
A. Walid, Q. Peng, J. Hwang, and S. Low, “Balanced linked adaptation congestion control algorithm for MPTCP,” Internet Eng. Task Force, Internet-Draft draft-walid-mptcp-congestion-control-04, 2016.
[8]
Y. Cao, M. Xu, and X. Fu, “Delay-based congestion control for multipath TCP,” in Proc. 20th IEEE Int. Conf. Netw. Protocols (ICNP), 2012, pp. 1–10.
[9]
W. Wei, K. Xue, J. Han, D. S. Wei, and P. Hong, “Shared bottleneck-based congestion control and packet scheduling for multipath TCP,” IEEE/ACM Trans. Netw., vol. 28, no. 2, pp. 653–666, Apr. 2020.
[10]
N. Jay, N. Rotman, B. Godfrey, M. Schapira, and A. Tamar, “A deep reinforcement learning perspective on Internet congestion control,” in Proc. ICML, 2019, pp. 3050–3059.
[11]
K. Winstein and H. Balakrishnan, “TCP ex machina: Computer-generated congestion control,” ACM SIGCOMM Comput. Commun. Rev., vol. 43, no. 4, pp. 123–134, 2013.
[12]
M. Donget al., “PCC vivace: Online-learning congestion control,” in Proc. 15th USENIX Symp. NSDI, 2018, pp. 343–356.
[13]
F. Y. Yanet al., “Pantheon: The training ground for Internet congestion-control research,” in Proc. ATC, 2018, pp. 731–743.
[14]
M. Dong, Q. Li, D. Zarchy, P. B. Godfrey, and M. Schapira, “PCC: Re-architecting congestion control for consistent high performance,” in Proc. 12th NSDI, 2015, pp. 395–408.
[15]
R. Zhuanget al., “Achieving flexible and lightweight multipath congestion control through online learning,” IEEE Trans. Netw. Service Manag., vol. 20, no. 1, pp. 46–59, Mar. 2023.
[16]
W. Li, H. Zhang, S. Gao, C. Xue, X. Wang, and S. Lu, “SmartCC: A reinforcement learning approach for multipath TCP congestion control in heterogeneous networks,” IEEE J. Sel. Areas Commun., vol. 37, no. 11, pp. 2621–2633, Nov. 2019.
[17]
Z. Xu, J. Tang, C. Yin, Y. Wang, and G. Xue, “Experience-driven congestion control: When multi-path TCP meets deep reinforcement learning,” IEEE J. Sel. Areas Commun., vol. 37, no. 6, pp. 1325–1336, Jun. 2019.
[18]
J. Hanet al., “EdAR: An experience-driven multipath scheduler for seamless handoff in mobile networks,” IEEE Trans. Wireless Commun., vol. 22, no. 10, pp. 6839–6852, Oct. 2023.
[19]
Y. Xing, K. Xue, Y. Zhang, J. Han, J. Li, and D. S. L. WeiMember, “An online learning assisted packet scheduler for MPTCP in mobile networks,” IEEE/ACM Trans. Netw., vol. 31, no. 5, pp. 2297–2312, Oct. 2023.
[20]
Q. De Coninck, M. Baerts, B. Hesmans, and O. Bonaventure, “A first analysis of Multipath TCP on smartphones,” in Proc. Int. Conf. Passive Act. Netw. Meas.. Springer, 2016, pp. 57–69.
[21]
J. Schulman, S. Levine, P. Abbeel, M. Jordan, and P. Moritz, “Trust region policy optimization,” in Proc. Int. Conf. Mach. Learn., 2015, pp. 1–16.
[22]
S. J. Siddiqi, F. Naeem, S. Khan, K. S. Khan, and M. Tariq, “Towards AI-enabled traffic management in multipath TCP: A survey,” Comput. Commun., vol. 181, pp. 412–427, Jan. 2022.
[23]
M. Trivellato and N. Benvenuto, “State control in networked control systems under packet drops and limited transmission bandwidth,” IEEE Trans. Commun., vol. 58, no. 2, pp. 611–622, Feb. 2010.
[24]
T. Gilad, N. Rozen-Schiff, P. B. Godfrey, C. Raiciu, and M. Schapira, “MPCC: Online learning multipath transport,” in Proc. 16th Int. Conf. Emerg. Netw. Exp. Technol., 2020, pp. 121–135.
[25]
V. Sivakumaret al., “MVFST-RL: An asynchronous RL framework for congestion control with delayed actions,” 2021, arXiv:1910.04054.
[26]
S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural Comput., vol. 9, no. 8, pp. 1735–1780, Nov. 1997.
[27]
S. Abbasloo, C.-Y. Yen, and H. J. Chao, “Classic meets modern: A pragmatic learning-based congestion control for the Internet,” in Proc. Annu. Conf. ACM Spec. Interest Group Data Commun. Appl., Technol., Archit., Protocols Comput. Commun., 2020, pp. 632–647.
[28]
B. Radunovic and J.-Y. Le Boudec, “A unified framework for max-min and min-max fairness with applications,” IEEE/ACM Trans. Netw., vol. 15, no. 5, pp. 1073–1083, Oct. 2007.
[29]
W. Ogryczak, M. Pióro, and A. Tomaszewski, “Telecommunications network design and max-min optimization problem,” J. Telecommun. Inf. Technol., vol. 4, pp. 43–56, Jan. 2005.
[30]
S. Gronauer and K. Diepold, “Multi-agent deep reinforcement learning: A survey,” Artif. Intell. Rev., vol. 55, pp. 895–943, Feb. 2022.
[31]
J. Schulman, F. Wolski, P. Dhariwal, A. Radford, and O. Klimov, “Proximal policy optimization algorithms,” 2017, arXiv:1707.06347.
[32]
Q. Peng, A. Walid, J. Hwang, and S. H. Low, “Multipath TCP: Analysis, design, and implementation,” IEEE/ACM Trans. Netw., vol. 24, no. 1, pp. 596–609, Feb. 2016.
[33]
S. R. Pokhrel and A. Walid, “Learning to harness bandwidth with multipath congestion control and scheduling,” IEEE Trans. Mobile Comput., vol. 22, no. 2, pp. 996–1009, Feb. 2023.
[34]
E. A. C. Paasch and S. Barre. “Multipath TCP in the Linux kernel.” Accessed: 2013. [Online]. Available: https://www.multipath-tcp.org/
[35]
M. Abadiet al., “TensorFlow: Large-scale machine learning on heterogeneous distributed systems,” 2016, arXiv:1603.04467.
[36]
iperf3.” 2019. [Online]. Available: https://github.com/esnet/iperf
[37]
B. Lantz, B. Heller, and N. McKeown, “A network in a laptop: Rapid prototyping for software-defined networks,” in Proc. 9th ACM SIGCOMM Workshop Hot Topics Netw., 2010, pp. 1–6.
[38]
S. Ha, I. Rhee, and L. Xu, “CUBIC: A new TCP-friendly high-speed TCP variant,” ACM SIGOPS Oper. Syst. Rev., vol. 42, no. 5, pp. 64–74, 2008.
[39]
P. N. D. Bukh, The Art of Computer Systems Performance Analysis, Techniques for Experimental Design, Measurement, Simulation and Modeling. Hoboken, NJ, USA: Wiley, 1992.
[40]
J. Hanet al., “Leveraging coupled BBR and adaptive packet scheduling to boost MPTCP,” IEEE Trans. Wireless Commun., vol. 20, no. 11, pp. 7555–7567, Nov. 2021.
[41]
W. Wei, K. Xue, J. Han, Y. Xing, D. S. L. Wei, and P. Hong, “BBR-based congestion control and packet scheduling for bottleneck fairness considered multipath TCP in heterogeneous wireless networks,” IEEE Trans. Veh. Technol., vol. 70, no. 1, pp. 914–927, Jan. 2021.
[42]
Y. Maet al., “Multi-objective congestion control,” in Proc. 17th Eur. Conf. Comput. Syst., 2022, pp. 218–235.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image IEEE Transactions on Network and Service Management
IEEE Transactions on Network and Service Management  Volume 21, Issue 3
June 2024
1087 pages

Publisher

IEEE Press

Publication History

Published: 25 March 2024

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media