Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Robust Video Stabilization based on Motion Decomposition

Published: 16 March 2023 Publication History
  • Get Citation Alerts
  • Abstract

    Video stabilization aims to eliminate camera jitter and improve the visual experience of shaky videos. Video stabilization methods often ignore the active movement of the foreground objects and the camera, and may result in distortion and over-smoothing problems. To resolve these issues, this paper proposes a novel video stabilization method based on motion decomposition. Since the inter-frame movement of foreground objects is different from that of the background, we separate foreground feature points from background feature points by modifying the classic density based spatial clustering method of applications with noise (DBSCAN). The movement of background feature points is consistent with the movement of the camera, which can be decomposed into the camera jitter and the active movement of the camera. And the movement of foreground feature points can be decomposed into the movement of the camera and the active movement of foreground objects. Based on motion decomposition, we design first-order and second-order trajectory smoothing constraints to eliminate the high-frequency and low-frequency components of the camera jitter. To reduce content distortion, shape-preserving constraints, and regularization constraints are taken to generate stabilized views of all feature points. Experimental results demonstrate the effectiveness and robustness of the proposed video stabilization method on a variety of challenging videos.

    References

    [1]
    Chris Buehler, Michael Bosse, and Leonard McMillan. 2001. Non-metric image-based rendering for video stabilization. In Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
    [2]
    Hung-Chang Chang, Shang-Hong Lai, and Kuang-Rong Lu. 2004. A robust and efficient video stabilization algorithm. In Proceedings of the 2004 IEEE International Conference on Multimedia and Expo.29–32.
    [3]
    Hung-Chang Chang, Shang-Hong Lai, and Kuang-Rong Lu. 2006. A robust real-time video stabilization algorithm. In Proceedings of the Journal of Visual Communication and Image Representation. 659–673.
    [4]
    Jinsoo Choi and In So Kweon. 2020. Deep iterative frame interpolation for full-frame video stabilization. In Proceedings of the ACM Transactions on Graphics.1–9.
    [5]
    Chung-Hua Chu. 2015. Visual comfort for stereoscopic 3D by using motion sensors on 3D mobile devices. ACM Transactions on Multimedia Computing, Communications, and Applications 12, 1s (2015), 1–20.
    [6]
    Jing Dong and Haibo Liu. 2016. Video stabilization for strict real-time applications. In Proceedings of the IEEE Transactions on Circuits and Systems for Video Technology. 716–724.
    [7]
    Martin Ester, Hans-Peter Kriegel, Jörg Sander, and Xiaowei Xu. 1996. A density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the kdd. 226–231.
    [8]
    Martin A. Fischler and Robert C. Bolles. 1981. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. In Proceedings of the Association for Computing Machinery. 381–395.
    [9]
    Michael L. Gleicher and Feng Liu. 2008. Re-cinematography: Improving the camerawork of casual video. ACM Transactions on Multimedia Computing, Communications, and Applications 5, 1 (2008), 1–28.
    [10]
    Amit Goldstein and Raanan Fattal. 2012. Video stabilization using epipolar geometry. In Proceedings of the ACM Transactions on Graphics.1–10.
    [11]
    Matthias Grundmann, Vivek Kwatra, Daniel Castro, and Irfan Essa. 2012. Calibration-free rolling shutter removal. In Proceedings of the 2012 IEEE International Conference on Computational Photography. 1–8.
    [12]
    M. Grundmann, V. Kwatra, D. Castro, and I. Essa. 2012. Calibration-free rolling shutter removal. In Proceedings of the 2012 IEEE International Conference on Computational Photography. 1–8.
    [13]
    M. Grundmann, V. Kwatra, and I. Essa. 2011. Auto-directed video stabilization with robust L1 optimal camera paths. In Proceedings of the CVPR 2011. 225–232.
    [14]
    Takeo Igarashi, Tomer Moscovich, and John F. Hughes. 2005. As-rigid-as-possible shape manipulation. In Proceedings of the ACM Transactions on Graphics. 1134–1141.
    [15]
    Zdenek Kalal, Krystian Mikolajczyk, and Jiri Matas. 2010. Forward-backward error: Automatic detection of tracking failures. In Proceedings of the 2010 20th International Conference on Pattern Recognition. 2756–2759.
    [16]
    Y. J. Koh, C. Lee, and C. Kim. 2015. Video stabilization based on feature trajectory augmentation and selection and robust mesh grid warping. In Proceedings of the IEEE Transactions on Image Processing. 5260–5273.
    [17]
    Der-Tsai Lee and Bruce J. Schachter. 1980. Two algorithms for constructing a delaunay triangulation. In Proceedings of the International Journal of Computer Information Sciences. 219–242.
    [18]
    Ken-Yi Lee, Yung-Yu Chuang, Bing-Yu Chen, and Ming Ouhyoung. 2009. Video stabilization using robust feature trajectories. In Proceedings of the 2009 IEEE 12th International Conference on Computer Vision. 1397–1404.
    [19]
    Se-Ho Lee, Jin-Hwan Kim, Kwang Pyo Choi, Jae-Young Sim, and Chang-Su Kim. 2014. Video saliency detection based on spatiotemporal feature learning. In Proceedings of the 2014 IEEE International Conference on Image Processing. 1120–1124.
    [20]
    Qiang Ling, Sibin Deng, Feng Li, Qinghua Huang, and Xuelong Li. 2016. A feedback-based robust video stabilization method for traffic videos. In Proceedings of the IEEE Transactions on Circuits and Systems for Video Technology. 561–572.
    [21]
    Feng Liu, Michael Gleicher, Hailin Jin, and Aseem Agarwala. 2009. Content-preserving warps for 3D video stabilization. ACM Transactions on Graphics 28, 3 (2009), 1–9.
    [22]
    Feng Liu, Michael Gleicher, Jue Wang, Hailin Jin, and Aseem Agarwala. 2011. Subspace video stabilization. In Proceedings of the ACM Transactions on Graphics. 1–10.
    [23]
    Shuaicheng Liu, Ping Tan, Lu Yuan, Jian Sun, and Bing Zeng. 2016. Meshflow: Minimum latency online video stabilization. In Proceedings of the European Conference on Computer Vision. 800–815.
    [24]
    Shuaicheng Liu, Yinting Wang, Lu Yuan, Jiajun Bu, Ping Tan, and Jian Sun. 2012. Video stabilization with a depth camera. In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition. 89–95.
    [25]
    Shuaicheng Liu, Binhan Xu, Chuang Deng, Shuyuan Zhu, Bing Zeng, and Moncef Gabbouj. 2016. A hybrid approach for near-range video stabilization. In Proceedings of the IEEE Transactions on Circuits and Systems for Video Technology. 1922–1933.
    [26]
    Shuaicheng Liu, Lu Yuan, Ping Tan, and Jian Sun. 2013. Bundled camera paths for video stabilization. In Proceedings of the ACM Trans. Graph. 10 pages.
    [27]
    Shuaicheng Liu, Lu Yuan, Ping Tan, and Jian Sun. 2014. Steadyflow: Spatially smooth optical flow for video stabilization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 4209–4216.
    [28]
    Yu-Lun Liu, Wei-Sheng Lai, Ming-Hsuan Yang, Yung-Yu Chuang, and Jia-Bin Huang. 2021. Hybrid neural fusion for full-frame video stabilization. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 2299–2308.
    [29]
    Tiezheng Ma, Yongwei Nie, Qing Zhang, Zhensong Zhang, Hanqiu Sun, and Guiqing Li. 2019. Effective video stabilization via joint trajectory smoothing and frame warping. IEEE Transactions on Visualization and Computer Graphics 26, 11 (2019), 3163–3176.
    [30]
    Y. Matsushita, E. Ofek, Weina Ge, Xiaoou Tang, and Heung-Yeung Shum. 2006. Full-frame video stabilization with motion inpainting. In Proceedings of the IEEE Transactions on Pattern Analysis and Machine Intelligence. 1150–1163.
    [31]
    Quanling Meng, Heyan Zhu, Weigang Zhang, Xuefeng Piao, and Aijie Zhang. 2020. Action recognition using form and motion modalities. ACM Transactions on Multimedia Computing, Communications, and Applications 16, 1s (2020), 1–16.
    [32]
    Carlos Morimoto and Ramalingam Chellappa. 1996. Fast electronic digital image stabilization. In Proceedings of the 13th International Conference on Pattern Recognition. 284–288.
    [33]
    Yongwei Nie, Tan Su, Zhensong Zhang, Hanqiu Sun, and Guiqing Li. 2017. Dynamic video stitching via shakiness removing. In Proceedings of the IEEE Transactions on Image Processing. 164–178.
    [34]
    Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. In Proceedings of the 28th International Conference on Neural Information Processing Systems, 91–99.
    [35]
    R. Hartley and A. Zisserman. 2003. Multiple view geometry in computer vision. In Proceedings of the Cambridge University Press.
    [36]
    Brandon M Smith, Li Zhang, Hailin Jin, and Aseem Agarwala. 2009. Light field video stabilization. In Proceedings of the 2009 IEEE 12th International Conference on Computer Vision. 341–348.
    [37]
    Carlo Tomasi and Takeo Kanade. 1991. Detection and tracking of point features. In Proceedings of the International Journal of Computer Vision.
    [38]
    Miao Wang, Guo-Ye Yang, Jin-Kun Lin, Song-Hai Zhang, Ariel Shamir, Shao-Ping Lu, and Shi-Min Hu. 2018. Deep online video stabilization with multi-grid warping transformation learning. IEEE Transactions on Image Processing 28, 5 (2018), 2283–2292.
    [39]
    Yu-Shuen Wang, Feng Liu, Pu-Sheng Hsu, and Tong-Yee Lee. 2013. Spatially and temporally optimized video stabilization. In Proceedings of the IEEE Transactions on Visualization and Computer Graphics. 1354–1361.
    [40]
    Zhong-Qiang Wang, Lei Zhang, and Hua Huang. 2013. Multiplane video stabilization. In Proceedings of the Computer Graphics Forum. 265–273.
    [41]
    Huicong Wu, Liang Xiao, Zhichao Lian, and Hiuk Jae Shim. 2019. Locally low-rank regularized video stabilization with motion diversity constraints. In Proceedings of the IEEE Transactions on Circuits and Systems for Video Technology. 2873–2887.
    [42]
    Huicong Wu, Liang Xiao, and Zhihui Wei. 2021. Simultaneous video stabilization and rolling shutter removal. IEEE Transactions on Image Processing 30 (2021), 4637–4652.
    [43]
    Jingjing Wu, Jianguo Jiang, Meibin Qi, Cuiqun Chen, and Yimin Liu. 2022. Improving feature discrimination for object tracking by structural-similarity-based metric learning. ACM Transactions on Multimedia Computing, Communications, and Applications 18, 4 (2022), 1–23.
    [44]
    Yufei Xu, Jing Zhang, Stephen J. Maybank, and Dacheng Tao. 2022. Dut: Learning video stabilization by simply watching unstable videos. IEEE Transactions on Image Processing 31 (2022), 4306–4320.
    [45]
    Junlan Yang, Dan Schonfeld, and Magdi Mohamed. 2009. Robust video stabilization based on particle filter tracking of projected camera motion. 19, 7 (2009), 945–954.
    [46]
    Jiyang Yu and Ravi Ramamoorthi. 2020. Learning video stabilization using optical flow. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8159–8167.
    [47]
    Fang-Lue Zhang, Jue Wang, Han Zhao, Ralph R. Martin, and Shi-Min Hu. 2015. Simultaneous camera path optimization and distraction removal for improving amateur video. In Proceedings of the IEEE Transactions on Image Processing. 5982–5994.
    [48]
    G. Zhang, W. Hua, X. Qin Y. Shao, and H. Bao. 2009. Video stabilization based on a 3D perspective camera model. In Proceedings of the Visual Computer. 997–1008.
    [49]
    Lei Zhang, Qian-Kun Xu, and Hua Huang. 2015. A global approach to fast video stabilization. In Proceedings of the IEEE Transactions on Circuits and Systems for Video Technology. 225–235.
    [50]
    Minda Zhao and Qiang Ling. 2020. PWStableNet: Learning pixel-wise warping maps for video stabilization. In Proceedings of the IEEE Transactions on Image Processing. 3582–3595.
    [51]
    M. Zhao and Q. Ling. 2021. Adaptively meshed video stabilization. In Proceedings of the IEEE Transactions on Circuits and Systems for Video Technology. 3504–3517.
    [52]
    Zihan Zhou, Hailin Jin, and Yi Ma. 2013. Plane-based content preserving warps for video stabilization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2299–2306.

    Cited By

    View all
    • (2024)A Multi-Attention Feature Distillation Neural Network for Lightweight Single Image Super-ResolutionInternational Journal of Intelligent Systems10.1155/2024/32552332024Online publication date: 15-Feb-2024
    • (2024)Spatiotemporal Inconsistency Learning and Interactive Fusion for Deepfake Video DetectionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/3664654Online publication date: 13-May-2024
    • (2024)Universal Relocalizer for Weakly Supervised Referring Expression GroundingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365604520:7(1-23)Online publication date: 16-May-2024
    • Show More Cited By

    Index Terms

    1. Robust Video Stabilization based on Motion Decomposition

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Multimedia Computing, Communications, and Applications
      ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 19, Issue 5
      September 2023
      262 pages
      ISSN:1551-6857
      EISSN:1551-6865
      DOI:10.1145/3585398
      • Editor:
      • Abdulmotaleb El Saddik
      Issue’s Table of Contents

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 16 March 2023
      Online AM: 30 January 2023
      Accepted: 12 January 2023
      Revised: 20 December 2022
      Received: 06 April 2022
      Published in TOMM Volume 19, Issue 5

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Video stabilization
      2. motion decomposition
      3. DBSCAN
      4. feature point clustering

      Qualifiers

      • Research-article

      Funding Sources

      • Key Science and Technology Program of Anhui
      • Provincial Quality Program of High Education Schools of Anhui Province
      • Applied Science and Technology Achievement Cultivation Project of Institute of Advanced Technology, University of Science and Technology of China

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)176
      • Downloads (Last 6 weeks)10
      Reflects downloads up to

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)A Multi-Attention Feature Distillation Neural Network for Lightweight Single Image Super-ResolutionInternational Journal of Intelligent Systems10.1155/2024/32552332024Online publication date: 15-Feb-2024
      • (2024)Spatiotemporal Inconsistency Learning and Interactive Fusion for Deepfake Video DetectionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/3664654Online publication date: 13-May-2024
      • (2024)Universal Relocalizer for Weakly Supervised Referring Expression GroundingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365604520:7(1-23)Online publication date: 16-May-2024
      • (2024)Pseudo Content Hallucination for Unpaired Image CaptioningProceedings of the 2024 International Conference on Multimedia Retrieval10.1145/3652583.3658080(320-329)Online publication date: 30-May-2024
      • (2024)MF2ShrT: Multimodal Feature Fusion Using Shared Layered Transformer for Face Anti-spoofingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/364081720:6(1-21)Online publication date: 8-Mar-2024
      • (2024)Dynamic Weighted Adversarial Learning for Semi-Supervised Classification under Intersectional Class MismatchACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363531020:4(1-24)Online publication date: 11-Jan-2024
      • (2024)Deep Modular Co-Attention Shifting Network for Multimodal Sentiment AnalysisACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363470620:4(1-23)Online publication date: 11-Jan-2024
      • (2024)Efficient Video Transformers via Spatial-temporal Token Merging for Action RecognitionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363378120:4(1-21)Online publication date: 11-Jan-2024
      • (2024)Learning a Novel Ensemble Tracker for Robust Visual TrackingIEEE Transactions on Multimedia10.1109/TMM.2023.330793926(3194-3206)Online publication date: 1-Jan-2024
      • (2023)Multi-label Emotion Analysis in Conversation via Multimodal Knowledge DistillationProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3612517(6090-6100)Online publication date: 26-Oct-2023
      • Show More Cited By

      View Options

      Get Access

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Full Text

      View this article in Full Text.

      Full Text

      HTML Format

      View this article in HTML Format.

      HTML Format

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media