Video Dynamics Prior: An Internal Learning Approach for Robust Video Enhancements

Shrivastava, Gaurav; Lim, Ser-Nam; Shrivastava, Abhinav

Computer Science > Computer Vision and Pattern Recognition

arXiv:2312.07835 (cs)

[Submitted on 13 Dec 2023]

Title:Video Dynamics Prior: An Internal Learning Approach for Robust Video Enhancements

Authors:Gaurav Shrivastava, Ser-Nam Lim, Abhinav Shrivastava

View PDF HTML (experimental)

Abstract:In this paper, we present a novel robust framework for low-level vision tasks, including denoising, object removal, frame interpolation, and super-resolution, that does not require any external training data corpus. Our proposed approach directly learns the weights of neural modules by optimizing over the corrupted test sequence, leveraging the spatio-temporal coherence and internal statistics of videos. Furthermore, we introduce a novel spatial pyramid loss that leverages the property of spatio-temporal patch recurrence in a video across the different scales of the video. This loss enhances robustness to unstructured noise in both the spatial and temporal domains. This further results in our framework being highly robust to degradation in input frames and yields state-of-the-art results on downstream tasks such as denoising, object removal, and frame interpolation. To validate the effectiveness of our approach, we conduct qualitative and quantitative evaluations on standard video datasets such as DAVIS, UCF-101, and VIMEO90K-T.

Comments:	NeurIPS 2023; Webpage - this http URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2312.07835 [cs.CV]
	(or arXiv:2312.07835v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2312.07835

Submission history

From: Gaurav Shrivastava [view email]
[v1] Wed, 13 Dec 2023 01:57:11 UTC (7,550 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Video Dynamics Prior: An Internal Learning Approach for Robust Video Enhancements

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Video Dynamics Prior: An Internal Learning Approach for Robust Video Enhancements

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators