Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
survey
Open access

Deep Neural Network–based Enhancement for Image and Video Streaming Systems: A Survey and Future Directions

Published: 04 October 2021 Publication History

Abstract

Internet-enabled smartphones and ultra-wide displays are transforming a variety of visual apps spanning from on-demand movies and 360°  videos to video-conferencing and live streaming. However, robustly delivering visual content under fluctuating networking conditions on devices of diverse capabilities remains an open problem. In recent years, advances in the field of deep learning on tasks such as super-resolution and image enhancement have led to unprecedented performance in generating high-quality images from low-quality ones, a process we refer to as neural enhancement. In this article, we survey state-of-the-art content delivery systems that employ neural enhancement as a key component in achieving both fast response time and high visual quality. We first present the components and architecture of existing content delivery systems, highlighting their challenges and motivating the use of neural enhancement models as a countermeasure. We then cover the deployment challenges of these models and analyze existing systems and their design decisions in efficiently overcoming these technical challenges. Additionally, we underline the key trends and common approaches across systems that target diverse use-cases. Finally, we present promising future directions based on the latest insights from deep learning research to further boost the quality of experience of content delivery systems.

References

[1]
Mohamed S. Abdelfattah, Łukasz Dudziak, Thomas Chau, Royson Lee, Hyeji Kim, and Nicholas D. Lane. 2020. Best of both worlds: AutoML codesign of a CNN and its hardware accelerator. In Proceedings of the Design Automation Conference (DAC’20).
[2]
Adnan Ahmed, Zubair Shafiq, and Amir Khakpour. 2016. QoE analysis of a large-scale live video streaming event. In Proceedings of the ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Science (SIGMETRICS’16). 395–396.
[3]
Namhyuk Ahn, Byungkon Kang, and Kyung-Ah Sohn. 2018. Fast, accurate, and lightweight super-resolution with cascading residual network. In Proceedings of the European Conference on Computer Vision (ECCV’18).
[4]
Zahaib Akhtar, Yun Seong Nam, Ramesh Govindan, Sanjay Rao, Jessica Chen, Ethan Katz-Bassett, Bruno Ribeiro, Jibin Zhan, and Hui Zhang. 2018. Oboe: Auto-tuning video ABR algorithms to network conditions. In Proceedings of the Conference of the ACM Special Interest Group on Data Communication (SIGCOMM’18). 44–58.
[5]
Mario Almeida, Muhammad Bilal, Alessandro Finamore, Ilias Leontiadis, Yan Grunenberger, Matteo Varvello, and Jeremy Blackburn. 2018. CHIMP: Crowdsourcing human inputs for mobile phones. In Proceedings of the World Wide Web Conference (WWW’18). International World Wide Web Conferences Steering Committee, 45–54.
[6]
Mario Almeida, Stefanos Laskaridis, Ilias Leontiadis, Stylianos I. Venieris, and Nicholas D. Lane. 2019. EmBench: Quantifying performance variations of deep neural networks across modern commodity devices. In Proceedings of the 3rd International Workshop on Deep Learning for Mobile Systems and Applications (EMDL’19). 6.
[7]
Ghufran Baig, Jian He, Mubashir Adnan Qureshi, Lili Qiu, Guohai Chen, Peng Chen, and Yinliang Hu. 2019. Jigsaw: Robust live 4K video streaming. In Proceedings of the 25th International Conference on Mobile Computing and Networking (MobiCom’19).
[8]
S. Baker, D. Scharstein, J. Lewis, S. Roth, Michael J. Black, and R. Szeliski. 2011. A database and evaluation methodology for optical flow. Int. Journal of Computer Vision 92, 1 (2011), 1–31.
[9]
Sefi Bell-Kligler, Assaf Shocher, and Michal Irani. 2019. Blind super-resolution kernel estimation using an internal-GAN. In Proceedings of the International Conference on Advances in Neural Information Processing Systems.
[10]
Ibrahim Ben Mustafa, Tamer Nadeem, and Emir Halepovic. 2018. FlexStream: Towards flexible adaptive video streaming on end devices using extreme SDN. In Proceedings of the 26th ACM International Conference on Multimedia (MM’18). 555–563.
[11]
A. Bentaleb, B. Taani, A. C. Begen, C. Timmerer, and R. Zimmermann. 2019. A survey on bitrate adaptation schemes for streaming media over HTTP. IEEE Commun. Surv. Tutor. 21, 1 (2019), 562–585.
[12]
Marco Bevilacqua, Aline Roumy, Christine Guillemot, and Marie line Alberi Morel. 2012. Low-complexity single-image super-resolution based on nonnegative neighbor embedding. In Proceedings of the British Machine Vision Conference.
[13]
Yochai Blau and Tomer Michaeli. 2018. The perception-distortion tradeoff. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’18).
[14]
Dingding Cai, Ke Chen, Y. Qian, and J. Kämäräinen. 2019. Convolutional low-resolution fine-grained classification. Pattern Recog. Lett. 119 (2019), 166–171.
[15]
Christopher Canel, Thomas Kim, Giulio Zhou, Conglong Li, Hyeontaek Lim, David G. Andersen, Michael Kaminsky, and Subramanya R. Dulloor. 2019. Scaling video analytics on constrained edge nodes. In Proceedings of the Conference on Machine Learning and Systems (MLSys’19).
[16]
Kelvin C. K. Chan, Xintao Wang, Xiangyu Xu, Jinwei Gu, and Chen Change Loy. 2021. GLEAN: Generative latent bank for large-factor image super-resolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’21).
[17]
Kelvin C. K. Chan, Xintao Wang, Ke Yu, Chao Dong, and Chen Change Loy. 2021. BasicVSR: The search for essential components in video super-resolution and beyond. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’21).
[18]
Chun-Ming Chang, Cheng-Hsin Hsu, Chih-Fan Hsu, and Kuan-Ta Chen. 2016. Performance measurements of virtual reality systems: Quantifying the timing and positioning accuracy. In Proceedings of the 24th ACM International Conference on Multimedia (MM’16). 655–659.
[19]
T. Chau, L. Dudziak, M. Abdelfattah, Royson Lee, H. Kim, and N. Lane. 2020. BRP-NAS: Prediction-based NAS using GCNs. In Proceedings of the International Conference on Advances in Neural Information Processing Systems (NeurIPS’20).
[20]
Tianqi Chen, Thierry Moreau, Ziheng Jiang, Lianmin Zheng, Eddie Yan, Haichen Shen, Meghan Cowan, Leyuan Wang, Yuwei Hu, Luis Ceze, Carlos Guestrin, and Arvind Krishnamurthy. 2018. TVM: An automated end-to-end optimizing compiler for deep learning. In Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI’18). 578–594.
[21]
T. Chen, L. Ravindranath, Shuo Deng, P. Bahl, and H. Balakrishnan. 2015. Glimpse: Continuous, real-time object recognition on mobile devices. In Proceedings of the SenSys Conference.
[22]
Cisco. 2020. Cisco Annual Internet Report (2018–2023) White Paper. Technical Report. Cisco Systems, Inc. Retrieved from https://www.cisco.com/c/en/us/solutions/collateral/executive-perspectives/annual-internet-report/white-paper-c11-741490.html.
[23]
Cisco. 2020. Cisco Visual Networking Index (VNI) Complete Forecast Update, 2017–2022. Technical Report. Cisco Systems, Inc. Retrieved from https://www.cisco.com/c/dam/m/en_us/network-intelligence/service-provider/digital-transformation/knowledge-network-webinars/pdfs/1213-business-services-ckn.pdf.
[24]
Marius Cordts, Mohamed Omran, Sebastian Ramos, Timo Rehfeld, Markus Enzweiler, Rodrigo Benenson, Uwe Franke, Stefan Roth, and Bernt Schiele. 2016. The cityscapes dataset for semantic urban scene understanding. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16).
[25]
Daniel Crankshaw, Gur-Eyal Sela, Xiangxi Mo, Corey Zumar, Ion Stoica, Joseph Gonzalez, and Alexey Tumanov. 2020. InferLine: Latency-aware provisioning and scaling for prediction serving pipelines. In Proceedings of the 11th ACM Symposium on Cloud Computing (SoCC’20). 477–491.
[26]
Simon Da Silva, Sonia Ben Mokhtar, Stefan Contiu, Daniel Négru, Laurent Réveillère, and Etienne Rivière. 2019. PrivaTube: Privacy-preserving edge-assisted video streaming. In Proceedings of the 20th International Middleware Conference (Middleware’19). 189–201.
[27]
Jifeng Dai, Haozhi Qi, Y. Xiong, Y. Li, Guodong Zhang, H. Hu, and Y. Wei. 2017. Deformable convolutional networks.In Proceedings of the IEEE International Conference on Computer Vision (ICCV’17).
[28]
Mallesham Dasari, A. Bhattacharya, Santiago Vargas, Pranjal Sahu, A. Balasubramanian, and S. Das. 2020. Streaming 360-degree videos using super-resolution. In Proceedings of the IEEE Conference on Computer Communications (INFOCOM’20).
[29]
L. De Cicco, V. Caldaralo, V. Palmisano, and S. Mascolo. 2013. ELASTIC: A client-side controller for dynamic adaptive streaming over HTTP (DASH). In Proceedings of the 20th International Packet Video Workshop. 1–8.
[30]
Jonathan Deber, Ricardo Jota, Clifton Forlines, and Daniel Wigdor. 2015. How much faster is fast enough? User perception of latency & latency improvements in direct and indirect touch. In Proceedings of the 33rd ACM Conference on Human Factors in Computing Systems (CHI’15). 1827–1836.
[31]
Xin Deng. 2018. Enhancing image quality via style transfer for single image super-resolution. IEEE Sig. Process. Lett. 25, 4 (2018), 571–575.
[32]
Giorgos Dimopoulos, Ilias Leontiadis, Pere Barlet-Ros, and Konstantina Papagiannaki. 2016. Measuring video QoE from encrypted traffic. In Proceedings of the Internet Measurement Conference (IMC’16). 513–526.
[33]
Pradeep Dogga, Sandip Chakraborty, Subrata Mitra, and Ravi Netravali. 2019. Edge-based transcoding for adaptive live video streaming. In Proceedings of the 2nd USENIX Workshop on Hot Topics in Edge Computing (HotEdge’19). USENIX Association.
[34]
Chao Dong, Chen Change Loy, Kaiming He, and Xiaoou Tang. 2016. Image super-resolution using deep convolutional networks. IEEE Trans. Pattern Anal. Mach. Intell. 38, 2 (2016), 295–307.
[35]
Chao Dong, Chen Change Loy, and Xiaoou Tang. 2016. Accelerating the super-resolution convolutional neural network. In Proceedings of the European Computer on Computer Vision (ECCV’16).
[36]
Kuntai Du, Ahsan Pervaiz, Xin Yuan, Aakanksha Chowdhery, Qizheng Zhang, Henry Hoffmann, and J. Jiang. 2020. Server-driven video streaming for deep learning inference. In Proceedings of the SIGCOMM Conference.
[37]
Ching-Ling Fan, J. Lee, Wen-Chih Lo, C. Huang, Kuan-Ta Chen, and C. Hsu. 2017. Fixation prediction for 360 video streaming in head-mounted virtual reality. In Network and Operating System Support for Digital Audio and Video. Association for Computing Machinery, 67–72.
[38]
Biyi Fang, Xiao Zeng, and Mi Zhang. 2018. NestDNN: Resource-aware multi-tenant on-device deep learning for continuous mobile vision. In Proceedings of the 24th International Conference on Mobile Computing and Networking (MobiCom’18). 115–127.
[39]
M. Fiedler, T. Hossfeld, and P. Tran-Gia. 2010. A generic quantitative relationship between quality of experience and quality of service. IEEE Netw. 24, 2 (2010), 36–41.
[40]
Sadjad Fouladi, John Emmons, Emre Orbay, C. Wu, Riad S. Wahby, and Keith Winstein. 2018. Salsify: Low-latency network video through tighter integration between a video codec and a transport protocol. In Proceedings of the 15th USENIX Symposium on Networked Systems Design and Implementation (NSDI’18).
[41]
M. Gadaleta, F. Chiariotti, M. Rossi, and A. Zanella. 2017. D-DASH: A deep q-learning framework for DASH video streaming. IEEE Trans. Cognit. Commun. Netw. 3, 4 (2017), 703–718.
[42]
Xitong Gao, Yiren Zhao, Łukasz Dudziak, Robert Mullins, and Cheng Zhong Xu. 2019. Dynamic channel pruning: Feature boosting and suppression. In Proceedings of the International Conference on Learning Representations.
[43]
C. Ge, N. Wang, G. Foster, and M. Wilson. 2017. Toward QoE-assured 4K video-on-demand delivery through mobile edge virtualization with adaptive prefetching. IEEE Trans. Multimedia 19, 10 (2017), 2222–2237.
[44]
Rafael C. Gonzalez and Richard E. Woods. 2008. Digital Image Processing. Prentice Hall, Upper Saddle River, N.J.
[45]
Ian Goodfellow et al. 2014. Generative adversarial nets. In Proceedings of the International Conference on Advances in Neural Information Processing Systems (NeurIPS’14).
[46]
Jinjin Gu, Hannan Lu, W. Zuo, and C. Dong. 2019. Blind super-resolution with iterative kernel correction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’19).
[47]
T. Guarnieri, I. Drago, A. B. Vieira, I. Cunha, and J. Almeida. 2017. Characterizing QoE in large-scale live streaming. In Proceedings of the IEEE Global Communications Conference (GLOBECOM’17). 1–7.
[48]
Seungyeop Han, Haichen Shen, Matthai Philipose, Sharad Agarwal, Alec Wolman, and Arvind Krishnamurthy. 2016. MCDNN: An approximation-based execution framework for deep stream processing under resource constraints. In Proceedings of the 14th International Conference on Mobile Systems, Applications, and Services (MobiSys’16).
[49]
Kaiming He, X. Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition.In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16).
[50]
Z. He, H. Huang, M. Jiang, Y. Bai, and G. Luo. 2018. FPGA-based real-time super-resolution system for ultra high definition videos. In Proceedings of the IEEE 26th International Symposium on Field-programmable Custom Computing Machines (FCCM’18). 181–188.
[51]
Timothy Hospedales, Antreas Antoniou, Paul Micaelli, and Amos Storkey. 2020. Meta-Learning in Neural Networks: A Survey. arXiv:cs.LG/2004.05439.
[52]
A. Howard, Mark Sandler, G. Chu, Liang-Chieh Chen, B. Chen, M. Tan, W. Wang, Y. Zhu, R. Pang, V. Vasudevan, Quoc V. Le, and H. Adam. 2019. Searching for mobileNetV3.In Proceedings of the International Conference on Computer Vision (ICCV’19).
[53]
Kevin Hsieh, Ganesh Ananthanarayanan, Peter Bodik, Shivaram Venkataraman, Paramvir Bahl, Matthai Philipose, Phillip B. Gibbons, and Onur Mutlu. 2018. Focus: Querying large video datasets with low latency and low cost. In Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation (OSDI’18). 269–286.
[54]
Pan Hu, Rakesh Misra, and Sachin Katti. 2019. Dejavu: Enhancing videoconferencing with prior knowledge. In Proceedings of the HotMobile Conference.
[55]
Weizhe Hua, Yuan Zhou, Christopher M. De Sa, Zhiru Zhang, and G. Edward Suh. 2019. Channel gating neural networks. In Proceedings of the International Conference on Advances in Neural Information Processing Systems (NeurIPS’19). 1886–1896.
[56]
Cheng Huang, Jin Li, and Keith W. Ross. 2007. Can internet video-on-demand be profitable? In Proceedings of the Conference on Applications, Technologies, Architectures, and Protocols for Computer Communications (SIGCOMM’07). 133–144.
[57]
Gao Huang, Danlu Chen, Tianhong Li, Felix Wu, Laurens van der Maaten, and Kilian Weinberger. 2018. Multi-scale dense networks for resource efficient image classification. In Proceedings of theInternational Conference on Learning Representations (ICLR’18).
[58]
Gao Huang, Zhuang Liu, and K. Weinberger. 2017. Densely connected convolutional networks.In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17).
[59]
J. Huang, A. Singh, and N. Ahuja. 2015. Single image super-resolution from transformed self-exemplars. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’15).
[60]
Tianchi Huang, Rui-Xiao Zhang, Chao Zhou, and Lifeng Sun. 2018. QARC: Video quality aware rate control for real-time video streaming based on deep reinforcement learning. In Proceedings of the 26th ACM International Conference on Multimedia (MM’18). 1208–1216.
[61]
T. Huang, C. Zhou, X. Yao, R. X. Zhang, C. Wu, B. Yu, and L. Sun. 2020. Quality-aware neural adaptive video streaming with lifelong imitation learning. IEEE J. Select. Areas Commun. 38, 10 (2020), 2324–2342.
[62]
Te-Yuan Huang, Chaitanya Ekanadham, Andrew J. Berglund, and Zhi Li. 2019. Hindsight: Evaluate video bitrate adaptation at scale. In Proceedings of the 10th ACM Multimedia Systems Conference (MMSys’19). Association for Computing Machinery, New York, NY, 86–97.
[63]
Te-Yuan Huang, Ramesh Johari, Nick McKeown, Matthew Trunnell, and Mark Watson. 2014. A buffer-based approach to rate adaptation: Evidence from a large video streaming service. In Proceedings of the SIGCOMM Conference.
[64]
Zheng Hui, Xiumei Wang, and Xinbo Gao. 2018. Fast and accurate single image super-resolution via information distillation network. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’18).
[65]
Andrey Ignatov, Radu Timofte, Andrei Kulik, Seungsoo Yang, Ke Wang, Felix Baum, Max Wu, Lirong Xu, and Luc Van Gool. 2019. AI benchmark: All about deep learning on smartphones in 2019. In Proceedings of the International Conference on Computer Vision (ICCV’19) Workshops.
[66]
Eddy Ilg, N. Mayer, Tonmoy Saikia, Margret Keuper, A. Dosovitskiy, and T. Brox. 2017. FlowNet 2.0: Evolution of optical flow estimation with deep networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17).
[67]
S. Ioffe and Christian Szegedy. 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In Proceedings of the International Conference on Machine Learning (ICML’15).
[68]
ITU. 2008. ITU-T Recommendation P.910. Subjective video quality assessment methods for multimedia applications. Retrieved from https://www.itu.int/rec/T-REC-P.910-200804-I.
[69]
Max Jaderberg, K. Simonyan, Andrew Zisserman, and K. Kavukcuoglu. 2015. Spatial transformer networks. In Proceedings of the International Conference on Advances in Neural Information Processing Systems.
[70]
Puneet Jain, Justin Manweiler, and Romit Roy Choudhury. 2015. OverLay: Practical mobile augmented reality. In Proceedings of the 13th International Conference on Mobile Systems, Applications, and Services (MobiSys’15).
[71]
J. Jiang, V. Sekar, and H. Zhang. 2014. Improving fairness, efficiency, and stability in HTTP-based adaptive video streaming with festive. IEEE/ACM Trans. Netw. 22, 1 (2014), 326–340.
[72]
Younghyun Jo, S. Oh, Jaeyeon Kang, and S. Kim. 2018. Deep video super-resolution network using dynamic upsampling filters without explicit motion compensation.In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[73]
Justin Johnson, Alexandre Alahi, and Li Fei-Fei. 2016. Perceptual losses for real-time style transfer and super-resolution. In Proceedings of the European Conference on Computer Vision (ECCV’16).
[74]
Daniel Kang, John Emmons, Firas Abuzaid, Peter Bailis, and Matei Zaharia. 2017. NoScope: Optimizing neural network queries over video at scale. Proc. VLDB Endow. 10, 11 (2017), 1586–1597.
[75]
Armin Kappeler, Seunghwan Yoo, Qiqin Dai, and A. Katsaggelos. 2016. Video super-resolution with convolutional neural networks. IEEE Trans. Computat. Imag. 2, 2 (2016), 109–122.
[76]
Yigitcan Kaya, Sanghyun Hong, and Tudor Dumitras. 2019. Shallow-deep networks: Understanding and mitigating network overthinking. In Proceedings of the International Conference on Machine Learning (ICML’19), Vol. 97. PMLR, 3301–3310.
[77]
Heewon Kim, Myungsub Choi, Bee Lim, and Kyoung Mu Lee. 2018. Task-aware image downscaling. In Proceedings of the European Conference on Computer Vision (ECCV’18).
[78]
Jiwon Kim, Jung Kwon Lee, and Kyoung Mu Lee. 2016. Accurate image super-resolution using very deep convolutional networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’16).
[79]
Jae-Hong Kim, Youngmok Jung, H. Yeo, Juncheol Ye, and D. Han. 2020. Neural-enhanced live streaming: Improving live video ingest via online learning. In Proceedings of the SIGCOMM Conference.
[80]
Y. Kim, J. Choi, and M. Kim. 2018. A real-time convolutional neural network for super-resolution on FPGA with applications to 4K UHD 60 fps video services. IEEE Trans. Circ. Syst. Vid. Technol. 29, 8 (2018), 2521–2534.
[81]
G. T. Kolyvas, S. E. Polykalas, and I. S. Venieris. 1997. Performance evaluation of intelligent signaling servers for broadband multimedia networks. In Proceedings 2nd IEEE Symposium on Computer and Communications (ICC’97). 96–103.
[82]
A. Kouris, S. I. Venieris, and C. Bouganis. 2020. A throughput-latency co-optimised cascade of convolutional neural network classifiers. In Proceedings of the Design, Automation Test in Europe Conference Exhibition (DATE’20). 1656–1661.
[83]
A. Kouris, S. I. Venieris, and C. S. Bouganis. 2018. CascadeCNN: Pushing the performance limits of quantisation in convolutional neural networks. In Proceedings of the 28th International Conference on Field Programmable Logic and Applications (FPL’18).
[84]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. ImageNet classification with deep convolutional neural networks. In Proceedings of the International Conference on Advances in Neural Processing Systems (NeurIPS’12).
[85]
N. D. Lane, S. Bhattacharya, A. Mathur, P. Georgiev, C. Forlivesi, and F. Kawsar. 2017. Squeezing deep learning into mobile and embedded devices. IEEE Pervas. Comput. 16, 3 (2017), 82–88.
[86]
Stefanos Laskaridis, Stylianos I. Venieris, Mario Almeida, Ilias Leontiadis, and Nicholas D. Lane. 2020. SPINN: Synergistic progressive inference of neural networks over device and cloud. In Proceedings of the 26th International Conference on Mobile Computing and Networking (MobiCom’20).
[87]
Stefanos Laskaridis, Stylianos I. Venieris, Hyeji Kim, and Nicholas D. Lane. 2020. HAPI: Hardware-aware progressive inference. In Proceedings of the IEEE/ACM International Conference on Computer-Aided Design (ICCAD’20).
[88]
C. Ledig, L. Theis, Ferenc Huszár, J. Caballero, Andrew Aitken, Alykhan Tejani, J. Totz, Zehan Wang, and W. Shi. 2017. Photo-realistic single image super-resolution using a generative adversarial network. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’17).
[89]
J. Lee, J. Lee, and H. J. Yoo. 2020. SRNPU: An energy-efficient CNN-based super-resolution processor with tile-based selective super-resolution in mobile devices. IEEE J. Emerg. Select. Topics Circ. Syst. 10, 3 (2020), 320–334.
[90]
Kyungjin Lee, Juheon Yi, Youngki Lee, Sunghyun Choi, and Young Min Kim. 2020. GROOT: A real-time streaming system of high-fidelity volumetric videos. In Proceedings of the 26th International Conference on Mobile Computing and Networking (MobiCom’20).
[91]
Royson Lee, L. Dudziak, M. Abdelfattah, Stylianos I. Venieris, H. Kim, Hongkai Wen, and N. Lane. 2020. Journey towards tiny perceptual super-resolution. In Proceedings of the European Conference on Computer Vision (ECCV’20).
[92]
Royson Lee, Stylianos I. Venieris, L. Dudziak, S. Bhattacharya, and N. Lane. 2019. MobiSR: Efficient on-device super-resolution through heterogeneous mobile processors. In Proceedings of the 25th International Conference on Mobile Computing and Networking (MobiCom’19).
[93]
Huixia Li, Chenqian Yan, Shaohui Lin, Xiawu Zheng, B. Zhang, Fan Yang, and Rongrong Ji. 2020. PAMS: Quantized super-resolution via parameterized max scale. In Proceedings of the European Conference on Computer Vision.
[94]
X. Li, M. A. Salehi, M. Bayoumi, N. Tzeng, and R. Buyya. 2018. Cost-efficient and robust on-demand video transcoding using heterogeneous cloud services. IEEE Trans. Parallel Distrib. Syst. 29, 3 (2018), 556–571.
[95]
Y. Li, D. Liu, H. Li, Lianghuan Li, Z. Li, and F. Wu. 2019. Learning a convolutional neural network for image compact-resolution. IEEE Trans. Image Process. 28, 3 (2019), 1092–1107.
[96]
Yule Li, J. Shi, and D. Lin. 2018. Low-latency video semantic segmentation.In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’18).
[97]
Yuheng Li, Yiping Zhang, and Ruixi Yuan. 2011. Measurement and analysis of a large scale commercial mobile internet TV system. In Proceedings of the ACM SIGCOMM Conference on Internet Measurement Conference (IMC’11). 209–224.
[98]
Z. Li, M. A. Kaafar, K. Salamatian, and G. Xie. 2017. Characterizing and modeling user behavior in a large-scale mobile live streaming system. IEEE Trans. Circ. Syst. Vid. Technol. 27, 12 (2017), 2675–2686.
[99]
Zhenyu Li, Jiali Lin, Marc-Ismael Akodjenou, Gaogang Xie, Mohamed Ali Kaafar, Yun Jin, and Gang Peng. 2012. Watching videos from everywhere: A study of the PPTV mobile VoD system. In Proceedings of the Internet Measurement Conference (IMC’12). 185–198.
[100]
Z. Li, X. Zhu, J. Gahm, R. Pan, H. Hu, A. C. Begen, and D. Oran. 2014. Probe and adapt: Rate adaptation for HTTP video streaming at scale. IEEE J. Select. Areas Commun. 32, 4 (2014), 719–733.
[101]
Li-Shen Juhn and Li-Ming Tseng. 1997. Harmonic broadcasting for video-on-demand service. IEEE Trans. Broadcast. 43, 3 (1997), 268–271.
[102]
Renjie Liao, X. Tao, R. Li, Z. Ma, and J. Jia. 2015. Video super-resolution via deep draft-ensemble learning.In Proceedings of the IEEE International Conference on Computer Vision (ICCV’15).
[103]
Melissa Licciardello, Maximilian Grüner, and Ankit Singla. 2020. Understanding video streaming algorithms in the wild. In Passive and Active Measurement (PAM), Anna Sperotto, Alberto Dainotti, and Burkhard Stiller (Eds.). Springer International Publishing, Cham, 298–313.
[104]
Bee Lim, Sanghyun Son, Heewon Kim, Seungjun Nah, and Kyoung Mu Lee. 2017. Enhanced deep residual networks for single image super-resolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW’17).
[105]
H. Liu, Zhubo Ruan, Peng Zhao, F. Shang, Linlin Yang, and Yuanyuan Liu. 2020. Video super resolution based on deep learning: A comprehensive survey. ArXiv abs/2007.12928 (2020).
[106]
Luyang Liu, Hongyu Li, and Marco Gruteser. 2019. Edge assisted real-time object detection for mobile augmented reality. In Proceedings of the 25th International Conference on Mobile Computing and Networking (MobiCom’19).
[107]
Luyang Liu, Ruiguang Zhong, Wuyang Zhang, Yunxin Liu, Jiansong Zhang, Lintao Zhang, and Marco Gruteser. 2018. Cutting the cord: Designing a high-quality untethered VR system with low latency remote rendering. In Proceedings of the 16th International Conference on Mobile Systems, Applications, and Services (MobiSys’18). 68–80.
[108]
Xin Liu, Yuang Li, Josh Fromm, Yuntao Wang, Ziheng Jiang, Alex Mariakakis, and Shwetak Patel. 2021. SplitSR: An end-to-end approach to super-resolution on mobile devices. Proc. ACM Interact. Mob. Wear. Ubiq. Technol. 5, 1 (2021), 20 pages.
[109]
Wen-Chih Lo, Ching-Ling Fan, Jean Lee, Chun-Ying Huang, Kuan-Ta Chen, and Cheng-Hsin Hsu. 2017. 360° video viewing dataset in head-mounted virtual reality. In Proceedings of the 8th ACM on Multimedia Systems Conference (MMSys’17). 211–216.
[110]
Jonathan Long, Evan Shelhamer, and Trevor Darrell. 2015. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’15). 3431–3440.
[111]
Cheng Ma, Yongming Rao, Yean Cheng, Ce Chen, Jiwen Lu, and J. Zhou. 2020. Structure-preserving super resolution with gradient guidance. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20).
[112]
Ningning Ma, X. Zhang, Hai-Tao Zheng, and Jian Sun. 2018. ShuffleNet V2: Practical guidelines for efficient CNN architecture design. In Proceedings of the European Conference on Computer Vision (ECCV’18).
[113]
Y. Ma, Hongyu Xiong, Zhe Hu, and L. Ma. 2019. Efficient super resolution using binarized neural network.In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW’19).
[114]
Hongzi Mao, R. Netravali, and M. Alizadeh. 2017. Neural adaptive video streaming with pensieve.In Proceedings of the SIGCOMM Conference.
[115]
D. Martin, C. Fowlkes, D. Tal, and J. Malik. 2001. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’01).
[116]
Roey Mechrez, I. Talmi, Firas Shama, and L. Zelnik-Manor. 2018. Maintaining natural image statistics with the contextual loss. In Proceedings of the Asian Conference on Computer Vision.
[117]
Anish Mittal, Rajiv Soundararajan, and Alan C. Bovik. 2013. Making a “completely blind” image quality analyzer. IEEE Sig. Process. Lett. 20, 3 (2013), 209–212.
[118]
R. K. P. Mok, E. W. W. Chan, and R. K. C. Chang. 2011. Measuring the quality of experience of HTTP video streaming. In Proceedings of the 12th IFIP/IEEE International Symposium on Integrated Network Management (IM’11) and Workshops. 485–492.
[119]
Vinod Nair and Geoffrey E. Hinton. 2010. Rectified linear units improve restricted Boltzmann machines. In Proceedings of the 27th International Conference on International Conference on Machine Learning (ICML’10). Omnipress, 807–814.
[120]
NVIDIA. 2020. NVIDIA Maxine - Cloud-AI Video-Streaming Platform. Retrieved from https://developer.nvidia.com/maxine.
[121]
Seobin Park, Jinsu Yoo, Donghyeon Cho, Jiwon Kim, and Tae Hyun Kim. 2020. Fast adaptation to super-resolution networks via meta-learning. In Proceedings of the European Conference on Computer Vision (ECCV’20).
[122]
K. Piamrat, C. Viho, J. Bonnin, and A. Ksentini. 2009. Quality of experience measurements for video streaming over wireless networks. In Proceedings of the 6th International Conference on Information Technology: New Generations (ITNG’09). 1184–1189.
[123]
F. Qian, B. Han, Qingyang Xiao, and V. Gopalakrishnan. 2018. Flare: Practical viewport-adaptive 360-degree video streaming for mobile devices.In Proceedings of the 24th International Conference on Mobile Computing and Networking (MobiCom’18).
[124]
Parthasarathy Ranganathan, Daniel Stodolsky, Jeff Calow, Jeremy Dorfman, Marisabel Guevara, Clinton Wills Smullen IV, Aki Kuusela, Raghu Balasubramanian, Sandeep Bhatia, Prakash Chauhan, Anna Cheung, In Suk Chong, Niranjani Dasharathi, Jia Feng, Brian Fosco, Samuel Foss, Ben Gelb, Sara J. Gwin, Yoshiaki Hase, Da-ke He, C. Richard Ho, Roy W. Huffman Jr., Elisha Indupalli, Indira Jayaram, Poonacha Kongetira, Cho Mon Kyaw, Aaron Laursen, Yuan Li, and Fong Lou. 2021. Warehouse-scale video acceleration: Co-design and deployment in the wild. In Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS’21). 600–615.
[125]
A. Ranjan and Michael J. Black. 2017. Optical flow estimation using a spatial pyramid network.In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17).
[126]
Leonid I. Rudin, Stanley Osher, and Emad Fatemi. 1992. Nonlinear total variation based noise removal algorithms. Phys. D 60, 1–4 (Nov. 1992), 259–268.
[127]
Mehdi S. M. Sajjadi, Raviteja Vemulapalli, and M. Brown. 2018. Frame-recurrent video super-resolution.In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[128]
Haichen Shen, Seungyeop Han, Matthai Philipose, and Arvind Krishnamurthy. 2017. Fast video classification via adaptive cascading of deep models. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17).
[129]
W. Shi, J. Caballero, Ferenc Huszár, J. Totz, A. Aitken, R. Bishop, D. Rueckert, and Zehan Wang. 2016. Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network.In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16).
[130]
Assaf Shocher, N. Cohen, and M. Irani. 2018. “Zero-shot” super-resolution using deep internal learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’18).
[131]
Laurent Sifre and Stéphane Mallat. 2014. Rigid-motion scattering for image classification author. English. Supervisor: Prof. Stéphane Mallat. Ph. D. Thesis. Ecole Polytechnique.
[132]
Jae Woong Soh, Sunwoo Cho, and N. I. Cho. 2020. Meta-transfer learning for zero-shot super-resolution.In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20).
[133]
Jae Woong Soh, G. Y. Park, Junho Jo, and N. I. Cho. 2019. Natural and realistic single image super-resolution with explicit natural manifold discrimination. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’19).
[134]
Dehua Song, Chang Xu, Xu Jia, Yiyi Chen, Chunjing Xu, and Yunhe Wang. 2020. Efficient residual dense block search for image super-resolution. In Proceedings of the Association for the Advancement of Artificial Intelligence Conference (AAAI’20).
[135]
J. Song, Y. Cho, J. Park, J. Jang, S. Lee, J. Song, J. Lee, and I. Kang. 2019. 7.1 An 11.5TOPS/W 1024-MAC butterfly structure dual-core sparsity-aware neural processing unit in 8nm flagship mobile SoC. In Proceedings of the IEEE International Solid-State Circuits Conference (ISSCC’19). 130–132.
[136]
Kevin Spiteri, Ramesh Sitaraman, and Daniel Sparacio. 2018. From theory to practice: Improving bitrate adaptation in the DASH reference player. In Proceedings of the 9th ACM Multimedia Systems Conference (MMSys’18). 123–137.
[137]
K. Spiteri, R. Urgaonkar, and R. K. Sitaraman. 2016. BOLA: Near-optimal bitrate adaptation for online videos. In Proceedings of the 35th IEEE International Conference on Computer Communications. 1–9.
[138]
Thomas Stockhammer. 2011. Dynamic adaptive streaming over HTTP –: Standards and design principles. In Proceedings of the 2nd ACM Conference on Multimedia Systems (MMSys’11). 133–144.
[139]
Deqing Sun, X. Yang, Ming-Yu Liu, and J. Kautz. 2018. PWC-Net: CNNs for optical flow using pyramid, warping, and cost volume. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’18).
[140]
Yi Sun, Xiaoqi Yin, Junchen Jiang, Vyas Sekar, Fuyuan Lin, Nanshu Wang, Tao Liu, and Bruno Sinopoli. 2016. CS2P: Improving video bitrate selection and adaptation with data-driven throughput prediction. In Proceedings of the ACM SIGCOMM Conference (SIGCOMM’16). 272–285.
[141]
X. Tao, H. Gao, Renjie Liao, J. Wang, and J. Jia. 2017. Detail-revealing deep video super-resolution.In Proceedings of the IEEE International Conference on Computer Vision (ICCV’17).
[142]
Surat Teerapittayanon, Bradley McDanel, and H. T. Kung. 2016. BranchyNet: Fast inference via early exiting from deep neural networks. In Proceedings of the 23rd International Conference on Pattern Recognition (ICPR’16). 2464–2469.
[143]
Yapeng Tian, Yulun Zhang, Yun Fu, and Chenliang Xu. 2020. TDAN: Temporally-deformable alignment network for video super-resolution.In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20).
[144]
R. Timofte, Eirikur Agustsson, L. Gool, M. Yang, Lei Zhang, Bee Lim, Sanghyun Son, Heewon Kim, Seungjun Nah, K. Lee, Xintao Wang, Yapeng Tian, K. Yu, Yulun Zhang, Shixiang Wu, C. Dong, L. Lin, Y. Qiao, C. C. Loy, W. Bae, Jae Jun Yoo, Yoseob Han, J. C. Ye, Jae-Seok Choi, M. Kim, Yuchen Fan, J. Yu, Wei Han, Ding Liu, Haichao Yu, Zhangyang Wang, Humphrey Shi, X. Wang, T. Huang, Yunjin Chen, Kai Zhang, W. Zuo, Z. Tang, Linkai Luo, S. Li, M. Fu, L. Cao, Wen Heng, G. Bui,Truc Le, Ye Duan, D. Tao, Ruxin Wang, Xu Lin, Jianxin Pang, Jinchang Xu, Y. Zhao, Xiangyu Xu, Jin shan Pan, Deqing Sun, Y. Zhang, X. Song, Yuchao Dai, X. Qin, X. Huynh, Tiantong Guo, H. Mousavi, T. Vu, V. Monga, C. Cruz, K. Egiazarian, V. Katkovnik, Rakesh Mehta, A. Jain, Abhinav Agarwalla, Ch V., Sai Praveen, Ruofan Zhou, Hongdiao Wen, C. Zhu, Zhiqiang Xia, Z. Wang, and Q. Guo. 2017. NTIRE 2017 challenge on single image super-resolution: Methods and results.In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW’17).
[145]
S. I. Venieris and C. S. Bouganis. 2019. fpgaConvNet: Mapping regular and irregular convolutional neural networks on FPGAs. IEEE Trans. Neural Netw. Learn. Syst. 30, 2 (2019), 326–342.
[146]
Stylianos I. Venieris, Alexandros Kouris, and Christos-Savvas Bouganis. 2018. Deploying deep neural networks in the embedded space. In Proceedings of the 2nd International Workshop on Embedded and Mobile Deep Learning (EMDL’18).
[147]
Stylianos I. Venieris, Alexandros Kouris, and Christos-Savvas Bouganis. 2018. Toolflows for mapping convolutional neural networks on FPGAs: A survey and future directions. ACM Comput. Surv. 51, 3 (June 2018).
[148]
Xintao Wang, Kelvin C. K. Chan, K. Yu, C. Dong, and Chen Change Loy. 2019. EDVR: Video restoration with enhanced deformable convolutional networks.In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW’19).
[149]
Xin Wang, Fisher Yu, Zi-Yi Dou, Trevor Darrell, and Joseph E. Gonzalez. 2018. SkipNet: Learning dynamic routing in convolutional networks. In Proceedings of the European Conference on Computer Vision (ECCV’18). 409–424.
[150]
Xintao Wang, K. Yu, C. Dong, X. Tang, and Chen Change Loy. 2019. Deep network interpolation for continuous imagery effect transition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’19).
[151]
Xintao Wang, Ke Yu, Shixiang Wu, Jinjin Gu, Yihao Liu, Chao Dong, Chen Change Loy, Yu Qiao, and Xiaoou Tang. 2018. ESRGAN: Enhanced super-resolution generative adversarial networks. In Proceedings of the European Conference on Computer Vision Workshops (ECCVW’18).
[152]
Yiding Wang, Weiyan Wang, Junxue Zhang, J. Jiang, and K. Chen. 2019. Bridging the edge-cloud barrier for real-time advanced vision analytics. In Proceedings of the HotCloud Conference.
[153]
Yulong Wang, Xiaolu Zhang, Xiaolin Hu, Bo Zhang, and Hang Su. 2020. Dynamic network pruning with interpretable layerwise channel selection. In Proceedings of the AAAI Conference. 6299–6306.
[154]
Zizhao Wang, Wei Bao, Dong Yuan, Liming Ge, Nguyen H. Tran, and Albert Y. Zomaya. 2019. SEE: Scheduling early exit for mobile DNN inference during service outage. In Proceedings of the 22nd International ACM Conference on Modeling, Analysis and Simulation of Wireless and Mobile Systems (MSWIM’19). 279–288.
[155]
Zhou Wang, Alan C. Bovik, Hamid R. Sheikh, and Eero P. Simoncelli. 2004. Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13, 4 (2004), 600–612.
[156]
Zhihao Wang, Jian Chen, and S. Hoi. 2020. Deep learning for image super-resolution: A survey. IEEE Trans. Pattern Anal. Mach. Intell. (2020).
[157]
C. Wu, D. Brooks, K. Chen, D. Chen, S. Choudhury, M. Dukhan, K. Hazelwood, E. Isaac, Y. Jia, B. Jia, T. Leyvand, H. Lu, Y. Lu, L. Qiao, B. Reagen, J. Spisak, F. Sun, A. Tulloch, P. Vajda, X. Wang, Y. Wang, B. Wasti, Y. Wu, R. Xian, S. Yoo, and P. Zhang. 2019. Machine learning at Facebook: Understanding inference at the edge. In Proceedings of the IEEE International Symposium on High Performance Computer Architecture (HPCA’19). 331–344.
[158]
X. Xiang, Yapeng Tian, Yulun Zhang, Y. Fu, J. Allebach, and Chenliang Xu. 2020. Zooming slow-mo: Fast and accurate one-stage space-time video super-resolution.In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20).
[159]
Mingqing Xiao, Shuxin Zheng, Chang Liu, Yaolong Wang, Di He, Guolin Ke, Jiang Bian, Zhouchen Lin, and Tie-Yan Liu. 2020. Invertible image rescaling. In Proceedings of the European Conference on Computer Vision (ECCV’20).
[160]
Jingwei Xin, Nannan Wang, Xinrui Jiang, Jie Li, Heng Huang, and Xinbo Gao. 2020. Binarized neural network for single image super resolution. In Proceedings of the European Conference on Computer Vision (ECCV’20).
[161]
Qunliang Xing, Mai Xu, Tianyi Li, and Zhenyu Guan. 2020. Early exit or not: Resource-efficient blind quality enhancement for compressed images. In Proceedings of the European Conference on Computer Vision (ECCV’20). Springer.
[162]
Jianchao Yang, John Wright, Thomas S. Huang, and Yi Ma. 2010. Image super-resolution via sparse representation. IEEE Trans. Image Process. 19, 11 (2010), 2861–2873.
[163]
Wenming Yang, X. Zhang, Yapeng Tian, W. Wang, Jing-Hao Xue, and Qingmin Liao. 2019. Deep learning for single image super-resolution: A brief review. IEEE Trans. Multimedia 21, 12 (2019), 3106–3121.
[164]
Hyunho Yeo, Chan Ju Chong, Youngmok Jung, Juncheol Ye, and Dongsu Han. 2020. NEMO: Enabling neural-enhanced video streaming on commodity mobile devices. In Proceedings of the 26th International Conference on Mobile Computing and Networking (MobiCom’20).
[165]
H. Yeo, Sunghyun Do, and D. Han. 2017. How will deep learning change internet video delivery? In Proceedings of the HotNets Conference.
[166]
H. Yeo, Youngmok Jung, Jaehong Kim, Jinwoo Shin, and D. Han. 2018. Neural adaptive content-aware internet video delivery. In Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI’18).
[167]
Junheon Yi, Seongwon Kim, Joongheon Kim, and Sunghyun Choi. 2020. Supremo: Cloud-assisted low-latency super-resolution in mobile devices. IEEE Trans. Mob. Comput. (2020).
[168]
Xiaoqi Yin, A. Jindal, V. Sekar, and B. Sinopoli. 2015. A control-theoretic approach for dynamic adaptive video streaming over HTTP. In Proceedings of the SIGCOMM Conference.
[169]
Hongliang Yu, Dongdong Zheng, Ben Y. Zhao, and Weimin Zheng. 2006. Understanding user behavior in large-scale video-on-demand systems. In Proceedings of the 1st ACM SIGOPS/EuroSys European Conference on Computer Systems (EuroSys’06). 333–344.
[170]
E. Zakharov, Aleksei Ivakhnenko, Aliaksandra Shysheya, and V. Lempitsky. 2020. Fast bi-layer neural synthesis of one-shot realistic head avatars. In Proceedings of the European Conference on Computer Vision (ECCV’20).
[171]
Haoyu Zhang, Ganesh Ananthanarayanan, Peter Bodik, Matthai Philipose, Paramvir Bahl, and Michael J. Freedman. 2017. Live video analytics at scale with approximation and delay-tolerance. In Proceedings of the 14th USENIX Conference on Networked Systems Design and Implementation (NSDI’17). 377–392.
[172]
Huanhuan Zhang, Anfu Zhou, Jiamin Lu, Ruoxuan Ma, Yuhan Hu, Cong Li, Xinyu Zhang, Huadong Ma, and Xiaojiang Chen. 2020. OnRL: Improving mobile video telephony via online reinforcement learning. In Proceedings of the 26th International Conference on Mobile Computing and Networking (MobiCom’20).
[173]
Linfeng Zhang, Zhanhong Tan, Jiebo Song, Jingwei Chen, Chenglong Bao, and Kaisheng Ma. 2019. SCAN: A scalable neural networks framework towards compact and efficient models. In Proceedings of the International Conference on Advances in Neural Information Processing Systems (NeurIPS’19). 4027–4036.
[174]
Richard Zhang, Phillip Isola, Alexei A. Efros, Eli Shechtman, and Oliver Wang. 2018. The unreasonable effectiveness of deep features as a perceptual metric. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’18).
[175]
Yulun Zhang, Kunpeng Li, Kai Li, Lichen Wang, Bineng Zhong, and Yun Fu. 2018. Image super-resolution using very deep residual channel attention networks. In Proceedings of the European Conference on Computer Vision (ECCV’18).
[176]
Hengshuang Zhao, Xiaojuan Qi, Xiaoyong Shen, J. Shi, and J. Jia. 2018. ICNet for real-time semantic segmentation on high-resolution images. In Proceedings of the European Conference on Computer Vision (ECCV’18).
[177]
C. Zhu, X. Lin, and Lap-Pui Chau. 2002. Hexagon-based search pattern for fast block motion estimation. IEEE Trans. Circ. Syst. Vid. Technol. 12, 5 (2002), 349–355.
[178]
X. Zhu, H. Hu, Stephen Lin, and Jifeng Dai. 2019. Deformable ConvNets V2: More deformable, better results.In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’19).
[179]
Barret Zoph and Quoc V. Le. 2017. Neural architecture search with reinforcement learning. In Proceedings of the International Conference on Learning Representations (ICLR’17).
[180]
Liad Pollak Zuckerman, S. Bagon, Eyal Naor, George Pisha, and M. Irani. 2020. Across scales & across dimensions: Temporal super-resolution using deep internal learning. In Proceedings of the European Conference on Computer Vision (ECCV’20).

Cited By

View all
  • (2024)Exploring the Benefits of Virtual Reality (VR) in Manufacturing TrainingEmerging Technologies in Digital Manufacturing and Smart Factories10.4018/979-8-3693-0920-9.ch012(205-221)Online publication date: 23-Feb-2024
  • (2024)Meta-Learned Kernel For Blind Super-Resolution Kernel Estimation2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV57701.2024.00152(1485-1494)Online publication date: 3-Jan-2024
  • (2024)Generative AI-Enabled Mobile Tactical Multimedia Networks: Distribution, Generation, and PerceptionIEEE Communications Magazine10.1109/MCOM.003.230064562:10(96-102)Online publication date: Oct-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Computing Surveys
ACM Computing Surveys  Volume 54, Issue 8
November 2022
754 pages
ISSN:0360-0300
EISSN:1557-7341
DOI:10.1145/3481697
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 October 2021
Accepted: 01 May 2021
Revised: 01 May 2021
Received: 01 October 2020
Published in CSUR Volume 54, Issue 8

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Deep learning
  2. content delivery networks
  3. distributed systems

Qualifiers

  • Survey
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1,359
  • Downloads (Last 6 weeks)153
Reflects downloads up to 02 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Exploring the Benefits of Virtual Reality (VR) in Manufacturing TrainingEmerging Technologies in Digital Manufacturing and Smart Factories10.4018/979-8-3693-0920-9.ch012(205-221)Online publication date: 23-Feb-2024
  • (2024)Meta-Learned Kernel For Blind Super-Resolution Kernel Estimation2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV57701.2024.00152(1485-1494)Online publication date: 3-Jan-2024
  • (2024)Generative AI-Enabled Mobile Tactical Multimedia Networks: Distribution, Generation, and PerceptionIEEE Communications Magazine10.1109/MCOM.003.230064562:10(96-102)Online publication date: Oct-2024
  • (2024)Time-series Initialization and Conditioning for Video-agnostic Stabilization of Video Super-Resolution using Recurrent Networks2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10651401(1-8)Online publication date: 30-Jun-2024
  • (2024)Real-Time Context-Aware Early Filtering for High-Definition Video Analytics on Commodity Edge Devices Using GenAI for Data AugmentationIEEE Access10.1109/ACCESS.2024.352080712(194728-194749)Online publication date: 2024
  • (2024)The Application of Knowledge Graph Convolutional Network-Based Film and Television Interaction Under Artificial IntelligenceIEEE Access10.1109/ACCESS.2024.345944812(132127-132138)Online publication date: 2024
  • (2024)HVASR: Enhancing 360-Degree Video Delivery with Viewport-Aware Super ResolutionInformation Sciences10.1016/j.ins.2024.121609(121609)Online publication date: Nov-2024
  • (2024)Efficient Recurrent Real Video RestorationDigital Signal Processing10.1016/j.dsp.2024.104851(104851)Online publication date: Nov-2024
  • (2024)Super-resolution with perceptual quality for improved live streaming delivery on edge computingComputer Networks: The International Journal of Computer and Telecommunications Networking10.1016/j.comnet.2024.110463248:COnline publication date: 1-Jun-2024
  • (2023)Deep Learning-Based Image Enhancement Techniques for Maritime Video in Storage and Transmission Systems: A Research StudyJOURNAL OF BROADCAST ENGINEERING10.5909/JBE.2023.28.4.41028:4(410-428)Online publication date: 31-Jul-2023
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Login options

Full Access

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media