research-article

RL-AFEC: adaptive forward error correction for real-time video communication based on reinforcement learning

Authors:

H. Jonathan ChaoAuthors Info & Claims

MMSys '22: Proceedings of the 13th ACM Multimedia Systems Conference

Pages 96 - 108

https://doi.org/10.1145/3524273.3528184

Published: 05 August 2022 Publication History

Abstract

Real-time video communication is profoundly changing people's lives, especially in today's pandemic situation. However, packet loss during video transmission degrades reconstructed video quality, thus impairing users' Quality of Experience (QoE). Forward Error Correction (FEC) techniques are commonly employed in today's audio and video conferencing applications, such as Skype and Zoom, to mitigate the impact of packet loss. FEC helps recover the lost packets during transmissions at the receiver side, but the additional bandwidth consumption is also a concern. Since network conditions are highly dynamic, it is not trivial for FEC to maintain video quality with a fixed bandwidth overhead. In this paper, we propose RL-AFEC, an adaptive FEC scheme based on Reinforcement Learning (RL) to improve reconstructed video quality with an aim to mitigate bandwidth consumption for different network conditions. RL-AFEC learns to select a proper redundancy rate for each video frame, and then adds redundant packets based on the frame-level Reed-Solomon (RS) code. We also implement a novel packet-level Video Quality Assessment (VQA) method based on Video Multimethod Assessment Fusion (VMAF), which leverages Supervised Learning (SL) to generate video quality scores in real time by only extracting information from the packet stream without the need of visual contents. Extensive evaluations demonstrate the superiority of our scheme over other baseline FEC methods.

References

[1]

2019. Preparing Your IP Network for Video Conferencing. https://support.polycom.com/content/dam/polycom-support/products/uc-infrastructure-support/management-scheduling/dma/other-documents/en/preparing-ip-network-video-conferencing.pdf

[2]

Enrico Baccaglini, Tammam Tillo, and Gabriella Olmo. 2008. Slice sorting for unequal loss protection of video streams. IEEE Signal Processing Letters 15 (2008), 581--584.

[3]

BBC. 2021. Facebook remote working plan extended to all staff for long term. Retrieved January 13, 2022 from https://www.bbc.com/news/technology-57425636

[4]

Nicholas Bloom. 2020. Stanford research provides a snapshot of a new working-from-home economy. Retrieved January 13, 2022 from https://news.stanford.edu/2020/06/29/snapshot-new-working-home-economy

[5]

J-C Bolot, Sacha Fosse-Parisis, and Don Towsley. 1999. Adaptive FEC-based error control for Internet telephony. In IEEE INFOCOM'99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No. 99CH36320), Vol. 3. IEEE, 1453--1460.

[6]

Sheng Cheng, Han Hu, Xinggong Zhang, and Zongming Guo. 2020. Deeprs: Deep-learning based network-adaptive fec for real-time video communications. In 2020 IEEE International Symposium on Circuits and Systems (ISCAS). IEEE, 1--5.

[7]

Qin Dai and Ralf Lehnert. 2010. Impact of packet loss on the perceived video quality. In 2010 2nd International Conference on Evolving Internet. IEEE, 206--209.

Digital Library

[8]

Gabriel Dulac-Arnold, Richard Evans, Hado van Hasselt, Peter Sunehag, Timothy Lillicrap, Jonathan Hunt, Timothy Mann, Theophane Weber, Thomas Degris, and Ben Coppin. 2015. Deep reinforcement learning in large discrete action spaces. arXiv preprint arXiv:1512.07679 (2015).

[9]

Edwin O Elliott. 1963. Estimates of error rates for codes on burst-noise channels. The Bell System Technical Journal 42, 5 (1963), 1977--1997.

[10]

Salma Shukry Emara, Silas Fong, Baochun Li, Ashish Khisti, Wai-Tian Tan, Xiaoqing Zhu, and John Apostolopoulos. 2021. Low-latency network-adaptive error control for interactive streaming. IEEE Transactions on Multimedia (2021).

[11]

Edgar N Gilbert. 1960. Capacity of a burst-noise channel. Bell system technical journal 39, 5 (1960), 1253--1265.

[12]

Sheila S Hemami and Amy R Reibman. 2010. No-reference image and video quality estimation: Applications and human-motivated design. Signal processing: Image communication 25, 7 (2010), 469--481.

[13]

Te-Yuan Huang, Polly Huang, Kuan-Ta Chen, and Po-Jung Wang. 2010. Could Skype be more satisfying? A QoE-centric study of the FEC mechanism in an Internet-scale VoIP system. IEEE Network 24, 2 (2010), 42--48.

Digital Library

[14]

Mushahid Hussain and Abdul Hameed. 2018. Adaptive video-aware forward error correction code allocation for reliable video transmission. Signal, Image and Video Processing 12, 1 (2018), 161--169.

[15]

Cisco Global Cloud Index. 2018. Forecast and methodology, 2016--2021 white paper. Updated: February 1 (2018).

[16]

Wenyu Jiang and Henning Schulzrinne. 2000. Modeling of packet loss and delay and their effect on real-time multimedia service quality. In Proc. NOSSDAV.

[17]

Heather Kelly. 2020. Twitter employees don't ever have to go back to the office (unless they want to). Retrieved January 13, 2022 from https://www.washingtonpost.com/technology/2020/05/12/twitter-work-home

[18]

Eymen Kurdoglu, Yong Liu, and Yao Wang. 2017. Perceptual quality maximization for video calls with packet losses by optimizing FEC, frame rate, and quantization. IEEE Transactions on Multimedia 20, 7 (2017), 1876--1887.

[19]

Zhi Li, Christos Bampis, Julie Novak, Anne Aaron, Kyle Swanson, Anush Moorthy, and JD Cock. 2018. VMAF: The journey continues. Netflix Technology Blog 25 (2018).

[20]

Zhi Li, Christos Bampis, Julie Novak, Anne Aaron, Kyle Swanson, Anush Moorthy, and JD Cock. 2018. VMAF: The journey continues. Netflix Technology Blog 25 (2018).

[21]

Qiyong Liu, Zhaofeng Jia, Kai Jin, Jing Wu, and Huipin Zhang. 2019. Error resilience for interactive real-time multimedia application. US Patent 10,348,454.

[22]

Hongzi Mao, Ravi Netravali, and Mohammad Alizadeh. 2017. Neural adaptive video streaming with pensieve. In Proceedings of the Conference of the ACM Special Interest Group on Data Communication. 197--210.

Digital Library

[23]

Anish Mittal, Anush K Moorthy, and Alan C Bovik. 2011. Blind/referenceless image spatial quality evaluator. In 2011 conference record of the forty fifth asilomar conference on signals, systems and computers (ASILOMAR). IEEE, 723--727.

[24]

Anish Mittal, Rajiv Soundararajan, and Alan C Bovik. 2012. Making a "completely blind" image quality analyzer. IEEE Signal processing letters 20, 3 (2012), 209--212.

[25]

Volodymyr Mnih, Adria Puigdomenech Badia, Mehdi Mirza, Alex Graves, Timothy Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. 2016. Asynchronous methods for deep reinforcement learning. In International conference on machine learning. PMLR, 1928--1937.

[26]

Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. 2013. Playing Atari with Deep Reinforcement Learning. (2013). http://arxiv.org/abs/1312.5602 NIPS Deep Learning Workshop 2013.

[27]

Chinmay Padhye, Kenneth J Christensen, and Wilfrido Moreno. 2000. A new adaptive FEC loss control algorithm for voice over IP applications. In Conference Proceedings of the 2000 IEEE International Performance, Computing, and Communications Conference (Cat. No. 00CH37086). IEEE, 307--313.

[28]

Lawrence Rabiner and Biinghwang Juang. 1986. An introduction to hidden Markov models. ieee assp magazine 3, 1 (1986), 4--16.

[29]

S Rajagopalan. 2020. An Overview of SD-WAN Load Balancing for WAN Connections. In 2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA). IEEE, 1--4.

[30]

Ashwin Rao, Arnaud Legout, Yeon-sup Lim, Don Towsley, Chadi Barakat, and Walid Dabbous. 2011. Network characteristics of video streaming traffic. In Proceedings of the Seventh COnference on emerging Networking EXperiments and Technologies. 1--12.

Digital Library

[31]

Irving S Reed and Gustave Solomon. 1960. Polynomial codes over certain finite fields. Journal of the society for industrial and applied mathematics 8, 2 (1960), 300--304.

[32]

P. Seeling and M. Reisslein. 2012. Video Transport Evaluation With H.264 Video Traces. IEEE Communications Surveys and Tutorials, in print 14, 4 (2012), 1142--1165. Traces available at trace.eas.asu.edu.

[33]

David Silver, Aja Huang, Chris J Maddison, Arthur Guez, Laurent Sifre, George Van Den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al. 2016. Mastering the game of Go with deep neural networks and tree search. nature 529, 7587 (2016), 484--489.

[34]

Suramya Tomar. 2006. Converting video formats with FFmpeg. Linux Journal 2006, 146 (2006), 10.

Digital Library

[35]

Tomoaki Tsugawa, Norihito Fujita, Takayuki Hama, Hideyuki Shimonishi, and Tutomu Murase. 2007. TCP-AFEC: An adaptive FEC code control for end-to-end bandwidth guarantee. In Packet Video 2007. IEEE, 294--301.

[36]

Thierry Turletti and Christian Huitema. 1996. Videoconferencing on the Internet. IEEE/ACM Transactions on networking 4, 3 (1996), 340--351.

Digital Library

[37]

Michael Wood. 2017. How to make SD-WAN secure. Network Security 2017, 1 (2017), 12--14.

Digital Library

[38]

Huahui Wu, Mark Claypool, and Robert Kinicki. 2005. Adjusting forward error correction with temporal scaling for TCP-friendly streaming MPEG. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 1, 4 (2005), 315--337.

Digital Library

[39]

Jiyan Wu, Bo Cheng, Ming Wang, and Junliang Chen. 2016. Priority-aware FEC coding for high-definition mobile video delivery using TCP. IEEE Transactions on Mobile Computing 16, 4 (2016), 1090--1106.

Digital Library

[40]

Jiyan Wu, Chau Yuen, and Junliang Chen. 2015. Leveraging the delay-friendliness of TCP with FEC coding in real-time video communication. IEEE Transactions on Communications 63, 10 (2015), 3584--3599.

[41]

Jimin Xiao, Tammam Tillo, Chunyu Lin, and Yao Zhao. 2012. Dynamic sub-GOP forward error correction code for real-time video applications. IEEE Transactions on Multimedia 14, 4 (2012), 1298--1308.

Digital Library

[42]

Jimin Xiao, Tammam Tillo, and Yao Zhao. 2013. Real-time video streaming using randomized expanding Reed-Solomon code. IEEE transactions on circuits and systems for video technology 23, 11 (2013), 1825--1836.

Digital Library

[43]

XK Yang, Ce Zhu, ZG Li, Xiao Lin, GN Feng, Si Wu, and Nam Ling. 2003. Unequal loss protection for robust transmission of motion compensated video over the internet. Signal Processing: Image Communication 18, 3 (2003), 157--167.

[44]

Zhenjie Yang, Yong Cui, Baochun Li, Yadong Liu, and Yi Xu. 2019. Software-defined wide area network (SD-WAN): Architecture, advances and opportunities. In 2019 28th International Conference on Computer Communication and Networks (ICCCN). IEEE, 1--9.

[45]

Deheng Ye, Zhao Liu, Mingfei Sun, Bei Shi, Peilin Zhao, Hao Wu, Hongsheng Yu, Shaojie Yang, Xipeng Wu, Qingwei Guo, et al. 2020. Mastering complex control in moba games with deep reinforcement learning. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 6672--6679.

[46]

Minghao Ye, Junjie Zhang, Zehua Guo, and H. Jonathan Chao. 2021. DATE: Disturbance-Aware Traffic Engineering with Reinforcement Learning in Software-Defined Networks. In 2021 IEEE/ACM 29th International Symposium on Quality of Service (IWQOS). 1--10.

[47]

Junjie Zhang, Minghao Ye, Zehua Guo, Chen-Yu Yen, and H Jonathan Chao. 2020. CFR-RL: Traffic engineering with reinforcement learning in SDN. IEEE Journal on Selected Areas in Communications 38, 10 (2020), 2249--2259.

Cited By

Al-Imareen NLencse G(2024)Real-Time Video Streaming in MPT-GRE Multipath Networks2024 International Conference on Software, Telecommunications and Computer Networks (SoftCOM)10.23919/SoftCOM62040.2024.10721766(1-7)Online publication date: 26-Sep-2024
https://doi.org/10.23919/SoftCOM62040.2024.10721766
Yu EZhou JLi ZTyson GLi WZhang XXu ZXie G(2024)Mustang: Improving QoE for Real-Time Video in Cellular Networks by Masking JitterACM Transactions on Multimedia Computing, Communications, and Applications10.1145/367239920:9(1-23)Online publication date: 10-Jun-2024
https://dl.acm.org/doi/10.1145/3672399
Xu CWang JXie SWang J(2024)A First Look at FEC Code Rate Determination from a Computational Cost Perspective2024 IEEE Wireless Communications and Networking Conference (WCNC)10.1109/WCNC57260.2024.10570633(1-6)Online publication date: 21-Apr-2024
https://doi.org/10.1109/WCNC57260.2024.10570633
Show More Cited By

Index Terms

RL-AFEC: adaptive forward error correction for real-time video communication based on reinforcement learning
1. Information systems
  1. Information systems applications
    1. Multimedia information systems
      1. Multimedia streaming

Recommendations

On using forward error correction for loss recovery in optical burst switched networks

An important issue in optical burst switched (OBS) networks is the loss of bursts at intermediate nodes due to contention. Such contention losses, usually do not mean a situation of congestion. In this paper, we propose for the first time, a loss ...
Video Quality Protection Strategies for HDTV in the Presence of Buffer Overflow
ICN '09: Proceedings of the 2009 Eighth International Conference on Networks

We investigate the impact of Forward Error Correction (FEC) and retransmissions on the video quality of packetized HDTV video flows with possible buffer overflow at the ingress node of the video network. The use of protection is a mixed blessing. On the ...
Error correction and error detection techniques for wireless ATM systems
Abstract
Error correction and error detection techniques are often used in wireless transmission systems. The Asynchronous Transfer Mode (ATM) employs Header Error Control (HEC). Since ATM specifications have been developed for high‐quality optical fiber ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MMSys '22: Proceedings of the 13th ACM Multimedia Systems Conference

June 2022

432 pages

ISBN:9781450392839

DOI:10.1145/3524273

General Chairs:
Niall Murray
Technological University of the Shannon: Midlands Midwest
,
Gwendal Simon
Synamedia
,
Mylene Farias
University of Brasilia
,
Program Chairs:
Irene Viola
Centrum Wiskunde & Informatica
,
Mario Montagud
i2CAT Foundation & University of Valencia

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 August 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Badges

Author Tags

Qualifiers

Research-article

Conference

MMSys '22

Sponsor:

SIGMM

MMSys '22: 13th ACM Multimedia Systems Conference

June 14 - 17, 2022

Athlone, Ireland

Acceptance Rates

Overall Acceptance Rate 176 of 530 submissions, 33%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

14
Total Citations
View Citations
413
Total Downloads

Downloads (Last 12 months)121
Downloads (Last 6 weeks)14

Reflects downloads up to 01 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Al-Imareen NLencse G(2024)Real-Time Video Streaming in MPT-GRE Multipath Networks2024 International Conference on Software, Telecommunications and Computer Networks (SoftCOM)10.23919/SoftCOM62040.2024.10721766(1-7)Online publication date: 26-Sep-2024
https://doi.org/10.23919/SoftCOM62040.2024.10721766
Yu EZhou JLi ZTyson GLi WZhang XXu ZXie G(2024)Mustang: Improving QoE for Real-Time Video in Cellular Networks by Masking JitterACM Transactions on Multimedia Computing, Communications, and Applications10.1145/367239920:9(1-23)Online publication date: 10-Jun-2024
https://dl.acm.org/doi/10.1145/3672399
Xu CWang JXie SWang J(2024)A First Look at FEC Code Rate Determination from a Computational Cost Perspective2024 IEEE Wireless Communications and Networking Conference (WCNC)10.1109/WCNC57260.2024.10570633(1-6)Online publication date: 21-Apr-2024
https://doi.org/10.1109/WCNC57260.2024.10570633
Zhang YCheng SGuo ZZhang X(2024)Inferring Video Streaming Quality of Real-Time Communication Inside NetworkIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2024.337560434:8(7756-7770)Online publication date: Aug-2024
https://doi.org/10.1109/TCSVT.2024.3375604
Xu CWang JLi RLi ZWu HWang J(2024)An Efficient FEC Scheme with SLA Consideration for Low Latency TransmissionsNOMS 2024-2024 IEEE Network Operations and Management Symposium10.1109/NOMS59830.2024.10575194(1-9)Online publication date: 6-May-2024
https://doi.org/10.1109/NOMS59830.2024.10575194
Gerard JBonilla DBentaleb ACéspedes S(2024)Optimizing Quality and Energy Efficiency in Webrtc with ML-Powered Adaptive FEC2024 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)10.1109/ICMEW63481.2024.10645390(1-6)Online publication date: 15-Jul-2024
https://doi.org/10.1109/ICMEW63481.2024.10645390
Guo YMeng ZWang BXu M(2024)Inferring in-Network Queue Management from End Hosts in Real-Time CommunicationsICC 2024 - IEEE International Conference on Communications10.1109/ICC51166.2024.10622436(3389-3395)Online publication date: 9-Jun-2024
https://doi.org/10.1109/ICC51166.2024.10622436
Li PYuan KLi XZhang M(2024)An Adaptive Forward Error Correction Method based on Deep Learning for Real-Time Video Transmission2024 3rd International Conference on Big Data, Information and Computer Network (BDICN)10.1109/BDICN62775.2024.00024(92-96)Online publication date: 12-Jan-2024
https://doi.org/10.1109/BDICN62775.2024.00024
Meng ZXu MMeng ZXu M(2024)Transport Layer on Data Path: Differentiating RetransmissionsLatency Optimization in Interactive Multimedia Streaming10.1007/978-981-97-6729-8_6(87-108)Online publication date: 30-Oct-2024
https://doi.org/10.1007/978-981-97-6729-8_6
Yu QLi QHe RShi WJiang YDasari MJiang JGorlatova M(2023)RTCSR: Zero-latency Aware Super-resolution for WebRTC Mobile Video StreamingProceedings of the 2023 Workshop on Emerging Multimedia Systems10.1145/3609395.3610601(54-59)Online publication date: 10-Sep-2023
https://dl.acm.org/doi/10.1145/3609395.3610601
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents