Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3524273.3528184acmconferencesArticle/Chapter ViewAbstractPublication PagesmmsysConference Proceedingsconference-collections
research-article

RL-AFEC: adaptive forward error correction for real-time video communication based on reinforcement learning

Published: 05 August 2022 Publication History

Abstract

Real-time video communication is profoundly changing people's lives, especially in today's pandemic situation. However, packet loss during video transmission degrades reconstructed video quality, thus impairing users' Quality of Experience (QoE). Forward Error Correction (FEC) techniques are commonly employed in today's audio and video conferencing applications, such as Skype and Zoom, to mitigate the impact of packet loss. FEC helps recover the lost packets during transmissions at the receiver side, but the additional bandwidth consumption is also a concern. Since network conditions are highly dynamic, it is not trivial for FEC to maintain video quality with a fixed bandwidth overhead. In this paper, we propose RL-AFEC, an adaptive FEC scheme based on Reinforcement Learning (RL) to improve reconstructed video quality with an aim to mitigate bandwidth consumption for different network conditions. RL-AFEC learns to select a proper redundancy rate for each video frame, and then adds redundant packets based on the frame-level Reed-Solomon (RS) code. We also implement a novel packet-level Video Quality Assessment (VQA) method based on Video Multimethod Assessment Fusion (VMAF), which leverages Supervised Learning (SL) to generate video quality scores in real time by only extracting information from the packet stream without the need of visual contents. Extensive evaluations demonstrate the superiority of our scheme over other baseline FEC methods.

References

[1]
2019. Preparing Your IP Network for Video Conferencing. https://support.polycom.com/content/dam/polycom-support/products/uc-infrastructure-support/management-scheduling/dma/other-documents/en/preparing-ip-network-video-conferencing.pdf
[2]
Enrico Baccaglini, Tammam Tillo, and Gabriella Olmo. 2008. Slice sorting for unequal loss protection of video streams. IEEE Signal Processing Letters 15 (2008), 581--584.
[3]
BBC. 2021. Facebook remote working plan extended to all staff for long term. Retrieved January 13, 2022 from https://www.bbc.com/news/technology-57425636
[4]
Nicholas Bloom. 2020. Stanford research provides a snapshot of a new working-from-home economy. Retrieved January 13, 2022 from https://news.stanford.edu/2020/06/29/snapshot-new-working-home-economy
[5]
J-C Bolot, Sacha Fosse-Parisis, and Don Towsley. 1999. Adaptive FEC-based error control for Internet telephony. In IEEE INFOCOM'99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No. 99CH36320), Vol. 3. IEEE, 1453--1460.
[6]
Sheng Cheng, Han Hu, Xinggong Zhang, and Zongming Guo. 2020. Deeprs: Deep-learning based network-adaptive fec for real-time video communications. In 2020 IEEE International Symposium on Circuits and Systems (ISCAS). IEEE, 1--5.
[7]
Qin Dai and Ralf Lehnert. 2010. Impact of packet loss on the perceived video quality. In 2010 2nd International Conference on Evolving Internet. IEEE, 206--209.
[8]
Gabriel Dulac-Arnold, Richard Evans, Hado van Hasselt, Peter Sunehag, Timothy Lillicrap, Jonathan Hunt, Timothy Mann, Theophane Weber, Thomas Degris, and Ben Coppin. 2015. Deep reinforcement learning in large discrete action spaces. arXiv preprint arXiv:1512.07679 (2015).
[9]
Edwin O Elliott. 1963. Estimates of error rates for codes on burst-noise channels. The Bell System Technical Journal 42, 5 (1963), 1977--1997.
[10]
Salma Shukry Emara, Silas Fong, Baochun Li, Ashish Khisti, Wai-Tian Tan, Xiaoqing Zhu, and John Apostolopoulos. 2021. Low-latency network-adaptive error control for interactive streaming. IEEE Transactions on Multimedia (2021).
[11]
Edgar N Gilbert. 1960. Capacity of a burst-noise channel. Bell system technical journal 39, 5 (1960), 1253--1265.
[12]
Sheila S Hemami and Amy R Reibman. 2010. No-reference image and video quality estimation: Applications and human-motivated design. Signal processing: Image communication 25, 7 (2010), 469--481.
[13]
Te-Yuan Huang, Polly Huang, Kuan-Ta Chen, and Po-Jung Wang. 2010. Could Skype be more satisfying? A QoE-centric study of the FEC mechanism in an Internet-scale VoIP system. IEEE Network 24, 2 (2010), 42--48.
[14]
Mushahid Hussain and Abdul Hameed. 2018. Adaptive video-aware forward error correction code allocation for reliable video transmission. Signal, Image and Video Processing 12, 1 (2018), 161--169.
[15]
Cisco Global Cloud Index. 2018. Forecast and methodology, 2016--2021 white paper. Updated: February 1 (2018).
[16]
Wenyu Jiang and Henning Schulzrinne. 2000. Modeling of packet loss and delay and their effect on real-time multimedia service quality. In Proc. NOSSDAV.
[17]
Heather Kelly. 2020. Twitter employees don't ever have to go back to the office (unless they want to). Retrieved January 13, 2022 from https://www.washingtonpost.com/technology/2020/05/12/twitter-work-home
[18]
Eymen Kurdoglu, Yong Liu, and Yao Wang. 2017. Perceptual quality maximization for video calls with packet losses by optimizing FEC, frame rate, and quantization. IEEE Transactions on Multimedia 20, 7 (2017), 1876--1887.
[19]
Zhi Li, Christos Bampis, Julie Novak, Anne Aaron, Kyle Swanson, Anush Moorthy, and JD Cock. 2018. VMAF: The journey continues. Netflix Technology Blog 25 (2018).
[20]
Zhi Li, Christos Bampis, Julie Novak, Anne Aaron, Kyle Swanson, Anush Moorthy, and JD Cock. 2018. VMAF: The journey continues. Netflix Technology Blog 25 (2018).
[21]
Qiyong Liu, Zhaofeng Jia, Kai Jin, Jing Wu, and Huipin Zhang. 2019. Error resilience for interactive real-time multimedia application. US Patent 10,348,454.
[22]
Hongzi Mao, Ravi Netravali, and Mohammad Alizadeh. 2017. Neural adaptive video streaming with pensieve. In Proceedings of the Conference of the ACM Special Interest Group on Data Communication. 197--210.
[23]
Anish Mittal, Anush K Moorthy, and Alan C Bovik. 2011. Blind/referenceless image spatial quality evaluator. In 2011 conference record of the forty fifth asilomar conference on signals, systems and computers (ASILOMAR). IEEE, 723--727.
[24]
Anish Mittal, Rajiv Soundararajan, and Alan C Bovik. 2012. Making a "completely blind" image quality analyzer. IEEE Signal processing letters 20, 3 (2012), 209--212.
[25]
Volodymyr Mnih, Adria Puigdomenech Badia, Mehdi Mirza, Alex Graves, Timothy Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. 2016. Asynchronous methods for deep reinforcement learning. In International conference on machine learning. PMLR, 1928--1937.
[26]
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. 2013. Playing Atari with Deep Reinforcement Learning. (2013). http://arxiv.org/abs/1312.5602 NIPS Deep Learning Workshop 2013.
[27]
Chinmay Padhye, Kenneth J Christensen, and Wilfrido Moreno. 2000. A new adaptive FEC loss control algorithm for voice over IP applications. In Conference Proceedings of the 2000 IEEE International Performance, Computing, and Communications Conference (Cat. No. 00CH37086). IEEE, 307--313.
[28]
Lawrence Rabiner and Biinghwang Juang. 1986. An introduction to hidden Markov models. ieee assp magazine 3, 1 (1986), 4--16.
[29]
S Rajagopalan. 2020. An Overview of SD-WAN Load Balancing for WAN Connections. In 2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA). IEEE, 1--4.
[30]
Ashwin Rao, Arnaud Legout, Yeon-sup Lim, Don Towsley, Chadi Barakat, and Walid Dabbous. 2011. Network characteristics of video streaming traffic. In Proceedings of the Seventh COnference on emerging Networking EXperiments and Technologies. 1--12.
[31]
Irving S Reed and Gustave Solomon. 1960. Polynomial codes over certain finite fields. Journal of the society for industrial and applied mathematics 8, 2 (1960), 300--304.
[32]
P. Seeling and M. Reisslein. 2012. Video Transport Evaluation With H.264 Video Traces. IEEE Communications Surveys and Tutorials, in print 14, 4 (2012), 1142--1165. Traces available at trace.eas.asu.edu.
[33]
David Silver, Aja Huang, Chris J Maddison, Arthur Guez, Laurent Sifre, George Van Den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al. 2016. Mastering the game of Go with deep neural networks and tree search. nature 529, 7587 (2016), 484--489.
[34]
Suramya Tomar. 2006. Converting video formats with FFmpeg. Linux Journal 2006, 146 (2006), 10.
[35]
Tomoaki Tsugawa, Norihito Fujita, Takayuki Hama, Hideyuki Shimonishi, and Tutomu Murase. 2007. TCP-AFEC: An adaptive FEC code control for end-to-end bandwidth guarantee. In Packet Video 2007. IEEE, 294--301.
[36]
Thierry Turletti and Christian Huitema. 1996. Videoconferencing on the Internet. IEEE/ACM Transactions on networking 4, 3 (1996), 340--351.
[37]
Michael Wood. 2017. How to make SD-WAN secure. Network Security 2017, 1 (2017), 12--14.
[38]
Huahui Wu, Mark Claypool, and Robert Kinicki. 2005. Adjusting forward error correction with temporal scaling for TCP-friendly streaming MPEG. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 1, 4 (2005), 315--337.
[39]
Jiyan Wu, Bo Cheng, Ming Wang, and Junliang Chen. 2016. Priority-aware FEC coding for high-definition mobile video delivery using TCP. IEEE Transactions on Mobile Computing 16, 4 (2016), 1090--1106.
[40]
Jiyan Wu, Chau Yuen, and Junliang Chen. 2015. Leveraging the delay-friendliness of TCP with FEC coding in real-time video communication. IEEE Transactions on Communications 63, 10 (2015), 3584--3599.
[41]
Jimin Xiao, Tammam Tillo, Chunyu Lin, and Yao Zhao. 2012. Dynamic sub-GOP forward error correction code for real-time video applications. IEEE Transactions on Multimedia 14, 4 (2012), 1298--1308.
[42]
Jimin Xiao, Tammam Tillo, and Yao Zhao. 2013. Real-time video streaming using randomized expanding Reed-Solomon code. IEEE transactions on circuits and systems for video technology 23, 11 (2013), 1825--1836.
[43]
XK Yang, Ce Zhu, ZG Li, Xiao Lin, GN Feng, Si Wu, and Nam Ling. 2003. Unequal loss protection for robust transmission of motion compensated video over the internet. Signal Processing: Image Communication 18, 3 (2003), 157--167.
[44]
Zhenjie Yang, Yong Cui, Baochun Li, Yadong Liu, and Yi Xu. 2019. Software-defined wide area network (SD-WAN): Architecture, advances and opportunities. In 2019 28th International Conference on Computer Communication and Networks (ICCCN). IEEE, 1--9.
[45]
Deheng Ye, Zhao Liu, Mingfei Sun, Bei Shi, Peilin Zhao, Hao Wu, Hongsheng Yu, Shaojie Yang, Xipeng Wu, Qingwei Guo, et al. 2020. Mastering complex control in moba games with deep reinforcement learning. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 6672--6679.
[46]
Minghao Ye, Junjie Zhang, Zehua Guo, and H. Jonathan Chao. 2021. DATE: Disturbance-Aware Traffic Engineering with Reinforcement Learning in Software-Defined Networks. In 2021 IEEE/ACM 29th International Symposium on Quality of Service (IWQOS). 1--10.
[47]
Junjie Zhang, Minghao Ye, Zehua Guo, Chen-Yu Yen, and H Jonathan Chao. 2020. CFR-RL: Traffic engineering with reinforcement learning in SDN. IEEE Journal on Selected Areas in Communications 38, 10 (2020), 2249--2259.

Cited By

View all
  • (2024)Real-Time Video Streaming in MPT-GRE Multipath Networks2024 International Conference on Software, Telecommunications and Computer Networks (SoftCOM)10.23919/SoftCOM62040.2024.10721766(1-7)Online publication date: 26-Sep-2024
  • (2024)Mustang: Improving QoE for Real-Time Video in Cellular Networks by Masking JitterACM Transactions on Multimedia Computing, Communications, and Applications10.1145/367239920:9(1-23)Online publication date: 10-Jun-2024
  • (2024)A First Look at FEC Code Rate Determination from a Computational Cost Perspective2024 IEEE Wireless Communications and Networking Conference (WCNC)10.1109/WCNC57260.2024.10570633(1-6)Online publication date: 21-Apr-2024
  • Show More Cited By

Index Terms

  1. RL-AFEC: adaptive forward error correction for real-time video communication based on reinforcement learning

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MMSys '22: Proceedings of the 13th ACM Multimedia Systems Conference
    June 2022
    432 pages
    ISBN:9781450392839
    DOI:10.1145/3524273
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 05 August 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Badges

    Author Tags

    1. forward error correction
    2. real-time video communication
    3. reinforcement learning
    4. video quality assessment

    Qualifiers

    • Research-article

    Conference

    MMSys '22
    Sponsor:
    MMSys '22: 13th ACM Multimedia Systems Conference
    June 14 - 17, 2022
    Athlone, Ireland

    Acceptance Rates

    Overall Acceptance Rate 176 of 530 submissions, 33%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)121
    • Downloads (Last 6 weeks)14
    Reflects downloads up to 01 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Real-Time Video Streaming in MPT-GRE Multipath Networks2024 International Conference on Software, Telecommunications and Computer Networks (SoftCOM)10.23919/SoftCOM62040.2024.10721766(1-7)Online publication date: 26-Sep-2024
    • (2024)Mustang: Improving QoE for Real-Time Video in Cellular Networks by Masking JitterACM Transactions on Multimedia Computing, Communications, and Applications10.1145/367239920:9(1-23)Online publication date: 10-Jun-2024
    • (2024)A First Look at FEC Code Rate Determination from a Computational Cost Perspective2024 IEEE Wireless Communications and Networking Conference (WCNC)10.1109/WCNC57260.2024.10570633(1-6)Online publication date: 21-Apr-2024
    • (2024)Inferring Video Streaming Quality of Real-Time Communication Inside NetworkIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2024.337560434:8(7756-7770)Online publication date: Aug-2024
    • (2024)An Efficient FEC Scheme with SLA Consideration for Low Latency TransmissionsNOMS 2024-2024 IEEE Network Operations and Management Symposium10.1109/NOMS59830.2024.10575194(1-9)Online publication date: 6-May-2024
    • (2024)Optimizing Quality and Energy Efficiency in Webrtc with ML-Powered Adaptive FEC2024 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)10.1109/ICMEW63481.2024.10645390(1-6)Online publication date: 15-Jul-2024
    • (2024)Inferring in-Network Queue Management from End Hosts in Real-Time CommunicationsICC 2024 - IEEE International Conference on Communications10.1109/ICC51166.2024.10622436(3389-3395)Online publication date: 9-Jun-2024
    • (2024)An Adaptive Forward Error Correction Method based on Deep Learning for Real-Time Video Transmission2024 3rd International Conference on Big Data, Information and Computer Network (BDICN)10.1109/BDICN62775.2024.00024(92-96)Online publication date: 12-Jan-2024
    • (2024)Transport Layer on Data Path: Differentiating RetransmissionsLatency Optimization in Interactive Multimedia Streaming10.1007/978-981-97-6729-8_6(87-108)Online publication date: 30-Oct-2024
    • (2023)RTCSR: Zero-latency Aware Super-resolution for WebRTC Mobile Video StreamingProceedings of the 2023 Workshop on Emerging Multimedia Systems10.1145/3609395.3610601(54-59)Online publication date: 10-Sep-2023
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media