Abstract
360\(^{\circ }\) videos have become increasingly popular recently, but consume much more bandwidth than non-360\(^{\circ }\) videos. Usually, 360\(^{\circ }\) video streaming partitions the video surface into multiple tiles and encodes the tiles independently to effectively and flexibly use limited link bandwidth. However, current bitrate adaptive algorithms generally aim to maximize the bitrate, rather than perceptual quality, resulting in degradation of user experience. More importantly, we reveal that the distribution of tile size is very skewed, that is, a small number of large tiles consumes more bandwidth than a large number of small tiles, further hurting the overall viewing quality. Therefore, in this paper, we propose a tile-size aware bitrate allocation scheme TSA for adaptive 360\(^{\circ }\) video streaming to improve the viewing experience of users. Specifically, TSA cautiously decreases the quality of a few large tiles to allocate more bandwidth to large number of small tiles, thus improving the perceptual quality of overall video, without sacrificing large tiles excessively. Experiments over real-world datasets show that TSA effectively improves V-VMAF by up to 39% compared with several state-of-the-art adaptive bitrate algorithms.
Similar content being viewed by others
References
Qian F, Ji L, Han B, Gopalakrishnan V (2016) Optimizing 360 video delivery over cellular networks. In: Proceedings of the Workshop on All Things Cellular: Operations, Applications and Challenges, pp 1–6
Petrangeli S, Swaminathan V, Hosseini M, De Turck F (2017) An http/2-based adaptive streaming framework for 360 virtual reality videos. In: Proceedings of the ACM international conference on Multimedia, pp 306–314
Xie L, Xu Z, Ban Y, Zhang X, Guo Z (2017) 360probdash: Improving qoe of 360 video streaming using tile-based http adaptive streaming. In: Proceedings of the ACM international conference on Multimedia, pp 315–323
Xie L, Zhang X, Guo Z (2018) Cls: A cross-user learning based system for improving qoe in 360-degree video adaptive streaming. In: Proceedings of the ACM international conference on Multimedia, pp 564–572
Graf M, Timmerer C, Mueller C (2017) Towards bandwidth efficient adaptive streaming of omnidirectional video over http: Design, implementation, and evaluation. In: Proceedings of the ACM on Multimedia Systems Conference, pp 261–271
Dasari M, Bhattacharya A, Vargas S, Sahu P, Balasubramanian A, Das SR (2020) Streaming 360-degree videos using super-resolution. In: Proceedings of the IEEE International Conference on Computer Communications, pp 1977–1986
Qian F, Han B, Xiao Q, Gopalakrishnan V (2018) Flare: Practical viewport-adaptive 360-degree video streaming for mobile devices. In: Proceedings of the Annual International Conference on Mobile Computing and Networking, pp 99–114
Chopra L, Chakraborty S, Mondal A, Chakraborty S (2021) Parima: Viewport adaptive 360-degree video streaming. In: Proceedings of the Web Conference, pp 2379–2391
Yadav PK, Ooi WT (2020) Tile rate allocation for 360-degree tiled adaptive video streaming. In: Proceedings of the ACM International Conference on Multimedia, pp 3724–3733
Sodagar I (2011) The mpeg-dash standard for multimedia streaming over the internet. IEEE Multimedia 18(4):62–67
Duanmu Z, Rehman A, Wang Z (2018) A quality-of-experience database for adaptive video streaming. IEEE Trans Broadcast 64(2):474–487
Fan CL, Lee J, Lo WC, Huang CY, Chen KT, Hsu CH (2017) Fixation prediction for 360 video streaming in head-mounted virtual reality. In: Proceedings of the Workshop on Network and Operating Systems Support for Digital Audio and Video, pp 67–72
Yin X, Jindal A, Sekar V, Sinopoli B (2015) A control-theoretic approach for dynamic adaptive video streaming over HTTP. In: Proceedings of the ACM conference on SIGCOMM, pp 325–338
Wu C, Tan Z, Wang Z, Yang S (2017) A dataset for exploring user behaviors in VR spherical video streaming. In: Proceedings of the ACM on Multimedia Systems Conference, pp 193–198
FFmpeg. http://ffmpeg.org
4G/LTE Bandwidth Logs. http://users.ugent.be/~jvdrhoof/dataset-4g/
FCC. https://www.fcc.gov/
Mahimahi. http://mahimahi.mit.edu
Snyder JP (1997) Flattening the earth: two thousand years of map projections. University of Chicago Press
Akhtar Z, Nam YS, Govindan R, Rao S, Chen J, Katz-Bassett E, Zhang H (2018) Oboe: Auto-tuning video ABR algorithms to network conditions. In: Proceedings of the Conference of the ACM Special Interest Group on Data Communication, pp 44–58
Niamut OA, Thomas E, D’Acunto L, Concolato C, Denoual F, Lim SY (2016) MPEG DASH SRD: spatial relationship description. In: Proceedings of the International Conference on Multimedia Systems, pp 1–8
Xiao M, Zhou C, Liu Y, Chen S (2017) Optile: Toward optimal tiling in 360-degree video streaming. In: Proceedings of the ACM international conference on Multimedia, pp 708–716
Guan Y, Zheng C, Zhang X, Guo Z, Jiang J (2019) Pano: Optimizing 360 video streaming with a better understanding of quality perception. In: Proceedings of the ACM conference on SIGCOMM, pp 394–407
He J, Qureshi MA, Qiu L, Li J, Li F, Han L (2018) Rubiks: Practical 360-degree streaming for smartphones. In: Proceedings of the Annual International Conference on Mobile Systems, Applications, and Services, pp 482–494
Huang T, Zhou C, Zhang RX, Wu C, Yao X, Sun L (2019) Comyco: Quality-aware adaptive video streaming via imitation learning. In: Proceedings of the ACM International Conference on Multimedia, pp 429–437
Jiang J, Sekar V, Zhang H (2012) Improving fairness, efficiency, and stability in http-based adaptive video streaming with festive. In: Proceedings of the international conference on Emerging networking experiments and technologies, pp 97–108
Huang TY, Johari R, McKeown N, Trunnell M, Watson M (2014) A buffer-based approach to rate adaptation: Evidence from a large video streaming service. In: Proceedings of the ACM conference on SIGCOMM, pp 187–198
Zhang Y, Zhao P, Bian K, Liu Y, Song L, Li X (2019) DRL360: 360-degree video streaming with deep reinforcement learning. In: Proceedings of IEEE Conference on Computer Communications, pp 1252–1260
Ban Y, Zhang Y, Zhang H, Zhang X, Guo Z (2020) MA360: Multi-agent deep reinforcement learning based live 360-degree video streaming on edge. In: Proceedings of IEEE International Conference on Multimedia and Expo, pp 1–6
Li Z, Zhong P, Huang J, Gao F, Wang J (2023) Achieving QoE fairness in Bitrate Allocation of 360\(^{\circ }\) Video Streaming. IEEE Trans Multimedia 26:1169–1178
Shi J, Pu L, Yuan X, Gong Q, Xu J (2022) Sophon: Super-resolution enhanced 360 video streaming with visual saliency-aware prefetch. In: Proceedings of ACM International Conference on Multimedia, pp 3124–3133
Yaqoob A, Muntean GM (2023) Advanced Predictive Tile Selection Using Dynamic Tiling for Prioritized 360 Video VR Streaming. ACM Trans Multimed Comput Commun Appl 20(1):1–28
Guimard Q, Sassatelli L (2023) SMART360: Simulating Motion prediction and Adaptive bitRate sTrategies for 360\(^{\circ }\) video streaming. In: Proceedings of the Conference on ACM Multimedia Systems, pp 384–390
Li J, Han L, Zhang C, Li Q, Liu Z (2023) Spherical convolution empowered viewport prediction in 360 video multicast with limited FoV feedback. ACM Trans Multimed Comput Commun Appl 19(1):1–23
Li J, Zhang C, Liu Z, Hong R, Hu H (2023) Optimal volumetric video streaming with hybrid saliency based tiling. IEEE Trans Multimedia 25:2939–2953
Chen J, Song Y, Li D, Lin X, Zhou S, Xu W (2024) Specular Removal of Industrial Metal Objects Without Changing Lighting Configuration. IEEE Trans Industr Inf 20(3):3144–3153
Xu H, Li Q, Chen J (2022) Highlight removal from a single grayscale image using attentive GAN. Appl Artif Intell 36(1)
Di Y, Li R, Tian H, Guo J, Shi B, Wang Z, Yan K, Liu Y (2023) A maneuvering target tracking based on fastIMM-extended Viterbi algorithm. Neural Comput & Applic
Cheng D, Chen L, Lv C, Guo L, Kou Q (2022) Light-guided and cross-fusion U-Net for anti-illumination image super-resolution. IEEE Trans Circuits Syst Video Technol 32(12):8436–8449
Zhang H, Liu H, Kim C (2024) Semantic and Instance Segmentation in Coastal Urban Spatial Perception: A Multi-Task Learning Framework with an Attention Mechanism. Sustainability 16(2):833
Pan S, Xu GJ, Guo K, Park SH, Ding H (2023) Video-Based Engagement Estimation of Game Streamers: An Interpretable Multimodal Neural Network Approach. Early Access, IEEE Transactions on Games. https://doi.org/10.1109/TG.2023.3348230
Wu J, Zhu J, Zhang J, Dang P, Li W, Guo Y, Fu L, Lai J, You J, Xie Y, Liang C (2023) A dynamic holographic modelling method of digital twin scenes for bridge construction. Int J Digit Earth 16(1):2404–2425
Liu H, Yuan H, Liu Q, Hou J, Zeng H, Kwong S (2021) A hybrid compression framework for color attributes of static 3D point clouds. IEEE Trans Circuits Syst Video Technol 32(3):1564–1577
Liu Q, Yuan H, Hamzaoui R, Su H, Hou J, Yang H (2021) Reduced reference perceptual quality model with application to rate control for video-based point cloud compression. IEEE Trans Image Process 30:6623–6636
Fu C, Yuan H, Xu H, Zhang H, Shen L (2023) TMSO-Net: Texture adaptive multi-scale observation for light field image depth estimation. J Vis Commun Image Represent 90:103731
Sheng H, Wang S, Yang D, Cong R, Cui Z, Chen R (2023) Cross-view recurrence-based self-supervised super-resolution of light field. IEEE Trans Circuits Syst Video Technol 33(12):7252–7266
Shi Y, Xi J, Hu D, Cai Z, Xu K (2023) RayMVSNet++: learning ray-based 1D implicit fields for accurate multi-view stereo. IEEE Trans Pattern Anal Mach Intell 45(11):13666–13682
Zheng W, Lu S, Yang Y, Yin Z, Yin L (2024) Lightweight transformer image feature extraction network. PeerJ Computer Science 10:e1755
Yang D, Cui Z, Sheng H, Chen R, Cong R, Wang S, Xiong Z (2024) An Occlusion and Noise-aware Stereo Framework Based on Light Field Imaging for Robust Disparity Estimation. IEEE Trans Comput 73(3):764–777
Liu S, Huang S, Xu X, Lloret J, Muhammad K (2023) Efficient Visual Tracking Based on Fuzzy Inference for Intelligent Transportation Systems. IEEE Trans Intell Trans Syst 24(12):15795–15806
Liu S, Wang S, Liu X, Gandomi A, Daneshmand M, Muhammad K, De Albuquerque V (2021) Human Memory Update Strategy: A Multi-Layer Template Update Mechanism for Remote Visual Monitoring. IEEE Trans Multimed 23:2188–2198
Liu S, Wang S, Liu X, Lin C, Lv Z (2021) Fuzzy Detection Aided Real-Time and Robust Visual Tracking Under Complex Environments. IEEE Trans Fuzzy Syst 29(1):90–102
Acknowledgements
We thank the reviewers for their insightful feedback. This work was supported by the National Natural Science Foundation of China(62302524, 62132022), and the Science and Technology Innovation Program of Hunan Province (2024JJ6531). This work was also carried out in part using computing resources at the High Performance Computing Center of Central South University.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
The authors have no financial or proprietary interests in any material discussed in this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Huang, J., Liu, M., Liu, J. et al. Tile-size aware bitrate allocation for adaptive 360\(^{\circ }\) video streaming. Multimed Tools Appl (2024). https://doi.org/10.1007/s11042-024-19486-0
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11042-024-19486-0