research-article

Robust Video Stabilization based on Motion Decomposition

Authors:

Peiyan LiAuthors Info & Claims

ACM Transactions on Multimedia Computing, Communications and Applications, Volume 19, Issue 5

Article No.: 164, Pages 1 - 24

https://doi.org/10.1145/3580498

Published: 16 March 2023 Publication History

Abstract

Video stabilization aims to eliminate camera jitter and improve the visual experience of shaky videos. Video stabilization methods often ignore the active movement of the foreground objects and the camera, and may result in distortion and over-smoothing problems. To resolve these issues, this paper proposes a novel video stabilization method based on motion decomposition. Since the inter-frame movement of foreground objects is different from that of the background, we separate foreground feature points from background feature points by modifying the classic density based spatial clustering method of applications with noise (DBSCAN). The movement of background feature points is consistent with the movement of the camera, which can be decomposed into the camera jitter and the active movement of the camera. And the movement of foreground feature points can be decomposed into the movement of the camera and the active movement of foreground objects. Based on motion decomposition, we design first-order and second-order trajectory smoothing constraints to eliminate the high-frequency and low-frequency components of the camera jitter. To reduce content distortion, shape-preserving constraints, and regularization constraints are taken to generate stabilized views of all feature points. Experimental results demonstrate the effectiveness and robustness of the proposed video stabilization method on a variety of challenging videos.

References

[1]

Chris Buehler, Michael Bosse, and Leonard McMillan. 2001. Non-metric image-based rendering for video stabilization. In Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2]

Hung-Chang Chang, Shang-Hong Lai, and Kuang-Rong Lu. 2004. A robust and efficient video stabilization algorithm. In Proceedings of the 2004 IEEE International Conference on Multimedia and Expo.29–32.

[3]

Hung-Chang Chang, Shang-Hong Lai, and Kuang-Rong Lu. 2006. A robust real-time video stabilization algorithm. In Proceedings of the Journal of Visual Communication and Image Representation. 659–673.

[4]

Jinsoo Choi and In So Kweon. 2020. Deep iterative frame interpolation for full-frame video stabilization. In Proceedings of the ACM Transactions on Graphics.1–9.

Digital Library

[5]

Chung-Hua Chu. 2015. Visual comfort for stereoscopic 3D by using motion sensors on 3D mobile devices. ACM Transactions on Multimedia Computing, Communications, and Applications 12, 1s (2015), 1–20.

Digital Library

[6]

Jing Dong and Haibo Liu. 2016. Video stabilization for strict real-time applications. In Proceedings of the IEEE Transactions on Circuits and Systems for Video Technology. 716–724.

[7]

Martin Ester, Hans-Peter Kriegel, Jörg Sander, and Xiaowei Xu. 1996. A density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the kdd. 226–231.

[8]

Martin A. Fischler and Robert C. Bolles. 1981. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. In Proceedings of the Association for Computing Machinery. 381–395.

Digital Library

[9]

Michael L. Gleicher and Feng Liu. 2008. Re-cinematography: Improving the camerawork of casual video. ACM Transactions on Multimedia Computing, Communications, and Applications 5, 1 (2008), 1–28.

Digital Library

[10]

Amit Goldstein and Raanan Fattal. 2012. Video stabilization using epipolar geometry. In Proceedings of the ACM Transactions on Graphics.1–10.

Digital Library

[11]

Matthias Grundmann, Vivek Kwatra, Daniel Castro, and Irfan Essa. 2012. Calibration-free rolling shutter removal. In Proceedings of the 2012 IEEE International Conference on Computational Photography. 1–8.

[12]

M. Grundmann, V. Kwatra, D. Castro, and I. Essa. 2012. Calibration-free rolling shutter removal. In Proceedings of the 2012 IEEE International Conference on Computational Photography. 1–8.

[13]

M. Grundmann, V. Kwatra, and I. Essa. 2011. Auto-directed video stabilization with robust L1 optimal camera paths. In Proceedings of the CVPR 2011. 225–232.

Digital Library

[14]

Takeo Igarashi, Tomer Moscovich, and John F. Hughes. 2005. As-rigid-as-possible shape manipulation. In Proceedings of the ACM Transactions on Graphics. 1134–1141.

Digital Library

[15]

Zdenek Kalal, Krystian Mikolajczyk, and Jiri Matas. 2010. Forward-backward error: Automatic detection of tracking failures. In Proceedings of the 2010 20th International Conference on Pattern Recognition. 2756–2759.

Digital Library

[16]

Y. J. Koh, C. Lee, and C. Kim. 2015. Video stabilization based on feature trajectory augmentation and selection and robust mesh grid warping. In Proceedings of the IEEE Transactions on Image Processing. 5260–5273.

Digital Library

[17]

Der-Tsai Lee and Bruce J. Schachter. 1980. Two algorithms for constructing a delaunay triangulation. In Proceedings of the International Journal of Computer Information Sciences. 219–242.

[18]

Ken-Yi Lee, Yung-Yu Chuang, Bing-Yu Chen, and Ming Ouhyoung. 2009. Video stabilization using robust feature trajectories. In Proceedings of the 2009 IEEE 12th International Conference on Computer Vision. 1397–1404.

[19]

Se-Ho Lee, Jin-Hwan Kim, Kwang Pyo Choi, Jae-Young Sim, and Chang-Su Kim. 2014. Video saliency detection based on spatiotemporal feature learning. In Proceedings of the 2014 IEEE International Conference on Image Processing. 1120–1124.

[20]

Qiang Ling, Sibin Deng, Feng Li, Qinghua Huang, and Xuelong Li. 2016. A feedback-based robust video stabilization method for traffic videos. In Proceedings of the IEEE Transactions on Circuits and Systems for Video Technology. 561–572.

[21]

Feng Liu, Michael Gleicher, Hailin Jin, and Aseem Agarwala. 2009. Content-preserving warps for 3D video stabilization. ACM Transactions on Graphics 28, 3 (2009), 1–9.

Digital Library

[22]

Feng Liu, Michael Gleicher, Jue Wang, Hailin Jin, and Aseem Agarwala. 2011. Subspace video stabilization. In Proceedings of the ACM Transactions on Graphics. 1–10.

Digital Library

[23]

Shuaicheng Liu, Ping Tan, Lu Yuan, Jian Sun, and Bing Zeng. 2016. Meshflow: Minimum latency online video stabilization. In Proceedings of the European Conference on Computer Vision. 800–815.

[24]

Shuaicheng Liu, Yinting Wang, Lu Yuan, Jiajun Bu, Ping Tan, and Jian Sun. 2012. Video stabilization with a depth camera. In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition. 89–95.

[25]

Shuaicheng Liu, Binhan Xu, Chuang Deng, Shuyuan Zhu, Bing Zeng, and Moncef Gabbouj. 2016. A hybrid approach for near-range video stabilization. In Proceedings of the IEEE Transactions on Circuits and Systems for Video Technology. 1922–1933.

[26]

Shuaicheng Liu, Lu Yuan, Ping Tan, and Jian Sun. 2013. Bundled camera paths for video stabilization. In Proceedings of the ACM Trans. Graph. 10 pages.

Digital Library

[27]

Shuaicheng Liu, Lu Yuan, Ping Tan, and Jian Sun. 2014. Steadyflow: Spatially smooth optical flow for video stabilization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 4209–4216.

Digital Library

[28]

Yu-Lun Liu, Wei-Sheng Lai, Ming-Hsuan Yang, Yung-Yu Chuang, and Jia-Bin Huang. 2021. Hybrid neural fusion for full-frame video stabilization. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 2299–2308.

[29]

Tiezheng Ma, Yongwei Nie, Qing Zhang, Zhensong Zhang, Hanqiu Sun, and Guiqing Li. 2019. Effective video stabilization via joint trajectory smoothing and frame warping. IEEE Transactions on Visualization and Computer Graphics 26, 11 (2019), 3163–3176.

[30]

Y. Matsushita, E. Ofek, Weina Ge, Xiaoou Tang, and Heung-Yeung Shum. 2006. Full-frame video stabilization with motion inpainting. In Proceedings of the IEEE Transactions on Pattern Analysis and Machine Intelligence. 1150–1163.

Digital Library

[31]

Quanling Meng, Heyan Zhu, Weigang Zhang, Xuefeng Piao, and Aijie Zhang. 2020. Action recognition using form and motion modalities. ACM Transactions on Multimedia Computing, Communications, and Applications 16, 1s (2020), 1–16.

Digital Library

[32]

Carlos Morimoto and Ramalingam Chellappa. 1996. Fast electronic digital image stabilization. In Proceedings of the 13th International Conference on Pattern Recognition. 284–288.

[33]

Yongwei Nie, Tan Su, Zhensong Zhang, Hanqiu Sun, and Guiqing Li. 2017. Dynamic video stitching via shakiness removing. In Proceedings of the IEEE Transactions on Image Processing. 164–178.

[34]

Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. In Proceedings of the 28th International Conference on Neural Information Processing Systems, 91–99.

[35]

R. Hartley and A. Zisserman. 2003. Multiple view geometry in computer vision. In Proceedings of the Cambridge University Press.

[36]

Brandon M Smith, Li Zhang, Hailin Jin, and Aseem Agarwala. 2009. Light field video stabilization. In Proceedings of the 2009 IEEE 12th International Conference on Computer Vision. 341–348.

[37]

Carlo Tomasi and Takeo Kanade. 1991. Detection and tracking of point features. In Proceedings of the International Journal of Computer Vision.

[38]

Miao Wang, Guo-Ye Yang, Jin-Kun Lin, Song-Hai Zhang, Ariel Shamir, Shao-Ping Lu, and Shi-Min Hu. 2018. Deep online video stabilization with multi-grid warping transformation learning. IEEE Transactions on Image Processing 28, 5 (2018), 2283–2292.

[39]

Yu-Shuen Wang, Feng Liu, Pu-Sheng Hsu, and Tong-Yee Lee. 2013. Spatially and temporally optimized video stabilization. In Proceedings of the IEEE Transactions on Visualization and Computer Graphics. 1354–1361.

[40]

Zhong-Qiang Wang, Lei Zhang, and Hua Huang. 2013. Multiplane video stabilization. In Proceedings of the Computer Graphics Forum. 265–273.

[41]

Huicong Wu, Liang Xiao, Zhichao Lian, and Hiuk Jae Shim. 2019. Locally low-rank regularized video stabilization with motion diversity constraints. In Proceedings of the IEEE Transactions on Circuits and Systems for Video Technology. 2873–2887.

[42]

Huicong Wu, Liang Xiao, and Zhihui Wei. 2021. Simultaneous video stabilization and rolling shutter removal. IEEE Transactions on Image Processing 30 (2021), 4637–4652.

Digital Library

[43]

Jingjing Wu, Jianguo Jiang, Meibin Qi, Cuiqun Chen, and Yimin Liu. 2022. Improving feature discrimination for object tracking by structural-similarity-based metric learning. ACM Transactions on Multimedia Computing, Communications, and Applications 18, 4 (2022), 1–23.

Digital Library

[44]

Yufei Xu, Jing Zhang, Stephen J. Maybank, and Dacheng Tao. 2022. Dut: Learning video stabilization by simply watching unstable videos. IEEE Transactions on Image Processing 31 (2022), 4306–4320.

Digital Library

[45]

Junlan Yang, Dan Schonfeld, and Magdi Mohamed. 2009. Robust video stabilization based on particle filter tracking of projected camera motion. 19, 7 (2009), 945–954.

[46]

Jiyang Yu and Ravi Ramamoorthi. 2020. Learning video stabilization using optical flow. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8159–8167.

[47]

Fang-Lue Zhang, Jue Wang, Han Zhao, Ralph R. Martin, and Shi-Min Hu. 2015. Simultaneous camera path optimization and distraction removal for improving amateur video. In Proceedings of the IEEE Transactions on Image Processing. 5982–5994.

Digital Library

[48]

G. Zhang, W. Hua, X. Qin Y. Shao, and H. Bao. 2009. Video stabilization based on a 3D perspective camera model. In Proceedings of the Visual Computer. 997–1008.

Digital Library

[49]

Lei Zhang, Qian-Kun Xu, and Hua Huang. 2015. A global approach to fast video stabilization. In Proceedings of the IEEE Transactions on Circuits and Systems for Video Technology. 225–235.

[50]

Minda Zhao and Qiang Ling. 2020. PWStableNet: Learning pixel-wise warping maps for video stabilization. In Proceedings of the IEEE Transactions on Image Processing. 3582–3595.

Digital Library

[51]

M. Zhao and Q. Ling. 2021. Adaptively meshed video stabilization. In Proceedings of the IEEE Transactions on Circuits and Systems for Video Technology. 3504–3517.

[52]

Zihan Zhou, Hailin Jin, and Yi Ma. 2013. Plane-based content preserving warps for video stabilization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2299–2306.

Digital Library

Cited By

Zhang YLin XYang HHe JQing LHe XLi YChen H(2024)A Multi-Attention Feature Distillation Neural Network for Lightweight Single Image Super-ResolutionInternational Journal of Intelligent Systems10.1155/2024/32552332024Online publication date: 15-Feb-2024
https://dl.acm.org/doi/10.1155/2024/3255233
Zhang DZhu WLiao XQi FYang GDing X(2024)Spatiotemporal Inconsistency Learning and Interactive Fusion for Deepfake Video DetectionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/3664654Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3664654
Zhang PLiu MSong XCao DGao ZNie L(2024)Universal Relocalizer for Weakly Supervised Referring Expression GroundingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365604520:7(1-23)Online publication date: 16-May-2024
https://dl.acm.org/doi/10.1145/3656045
Show More Cited By

Index Terms

Robust Video Stabilization based on Motion Decomposition
1. Computing methodologies
  1. Computer graphics
    1. Image manipulation
      1. Image processing

Recommendations

Dual-mode video stabilization based on adaptive motion clustering
ICIMCS '15: Proceedings of the 7th International Conference on Internet Multimedia Computing and Service

Many target tracking videos suffer from significant shake due to the undesirable camera motion or target vibration. These degraded video records have a negative impact on viewing experience as well as the follow-up analysis and application. Most ...
Video stabilization using epipolar geometry

We present a new video stabilization technique that uses projective scene reconstruction to treat jittered video sequences. Unlike methods that recover the full three-dimensional geometry of the scene, this model accounts for simple geometric relations ...
Rethinking two-dimensional camera motion estimation assessment for digital video stabilization: A camera motion field-based metric
Abstract
Digital video stabilization aims to remove camera motion jitters through software implementation. The first step of the classical video stabilization methodology is called camera motion estimation, which is usually performed using only RGB frames ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Multimedia Computing, Communications, and Applications

ACM Transactions on Multimedia Computing, Communications, and Applications Volume 19, Issue 5

September 2023

262 pages

ISSN:1551-6857

EISSN:1551-6865

DOI:10.1145/3585398

Editor:
Abdulmotaleb El Saddik
Mohamed Bin Zayed University of Artificial Intelligence, UAE and University of Ottawa, Canada

Issue’s Table of Contents

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 March 2023

Online AM: 30 January 2023

Accepted: 12 January 2023

Revised: 20 December 2022

Received: 06 April 2022

Published in TOMM Volume 19, Issue 5

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Key Science and Technology Program of Anhui
Provincial Quality Program of High Education Schools of Anhui Province
Applied Science and Technology Achievement Cultivation Project of Institute of Advanced Technology, University of Science and Technology of China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

12
Total Citations
View Citations
331
Total Downloads

Downloads (Last 12 months)176
Downloads (Last 6 weeks)10

Reflects downloads up to

Other Metrics

View Author Metrics

Citations

Cited By

Zhang YLin XYang HHe JQing LHe XLi YChen H(2024)A Multi-Attention Feature Distillation Neural Network for Lightweight Single Image Super-ResolutionInternational Journal of Intelligent Systems10.1155/2024/32552332024Online publication date: 15-Feb-2024
https://dl.acm.org/doi/10.1155/2024/3255233
Zhang DZhu WLiao XQi FYang GDing X(2024)Spatiotemporal Inconsistency Learning and Interactive Fusion for Deepfake Video DetectionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/3664654Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3664654
Zhang PLiu MSong XCao DGao ZNie L(2024)Universal Relocalizer for Weakly Supervised Referring Expression GroundingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365604520:7(1-23)Online publication date: 16-May-2024
https://dl.acm.org/doi/10.1145/3656045
Ben HWang SWang MHong RGurrin CKongkachandra RSchoeffmann KDang-Nguyen DRossetto LSatoh SZhou L(2024)Pseudo Content Hallucination for Unpaired Image CaptioningProceedings of the 2024 International Conference on Multimedia Retrieval10.1145/3652583.3658080(320-329)Online publication date: 30-May-2024
https://dl.acm.org/doi/10.1145/3652583.3658080
Antil ADhiman C(2024)MF2ShrT: Multimodal Feature Fusion Using Shared Layered Transformer for Face Anti-spoofingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/364081720:6(1-21)Online publication date: 8-Mar-2024
https://dl.acm.org/doi/10.1145/3640817
Li MZhou THuang ZYang JYang JGong C(2024)Dynamic Weighted Adversarial Learning for Semi-Supervised Classification under Intersectional Class MismatchACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363531020:4(1-24)Online publication date: 11-Jan-2024
https://dl.acm.org/doi/10.1145/3635310
Shi PHu MShi XRen F(2024)Deep Modular Co-Attention Shifting Network for Multimodal Sentiment AnalysisACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363470620:4(1-23)Online publication date: 11-Jan-2024
https://dl.acm.org/doi/10.1145/3634706
Feng ZXu JMa LZhang S(2024)Efficient Video Transformers via Spatial-temporal Token Merging for Action RecognitionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363378120:4(1-21)Online publication date: 11-Jan-2024
https://dl.acm.org/doi/10.1145/3633781
Nai KChen S(2024)Learning a Novel Ensemble Tracker for Robust Visual TrackingIEEE Transactions on Multimedia10.1109/TMM.2023.330793926(3194-3206)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1109/TMM.2023.3307939
Anand SDevulapally NBhattacharjee SYuan JEl Saddik AMei TCucchiara RBertini MTobon Vallejo DAtrey PHossain M(2023)Multi-label Emotion Analysis in Conversation via Multimodal Knowledge DistillationProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3612517(6090-6100)Online publication date: 26-Oct-2023
https://dl.acm.org/doi/10.1145/3581783.3612517
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Full Text

View this article in Full Text.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View full text|Download PDF

View Issue’s Table of Contents