research-article

SRHEN: Stepwise-Refining Homography Estimation Network via Parsing Geometric Correspondences in Deep Latent Space

Authors:

Yi Li,

Wenjie Pei,

Zhenyu HeAuthors Info & Claims

MM '20: Proceedings of the 28th ACM International Conference on Multimedia

Pages 3063 - 3071

https://doi.org/10.1145/3394171.3413870

Published: 12 October 2020 Publication History

Get Access

Abstract

The crux of homography estimation is that the homography is characterized by the geometric correspondences between two related images rather than appearance features, which differs from typical image recognition tasks. Existing methods either decompose the task of homography estimation into several individual sub-problems and optimize them sequentially, or attempt to tackle it in an end-to-end manner by delegating the whole task to deep convolutional networks (CNNs). However, it is quite arduous for CNNs to learn the mapping function from appearance features of related images to the homography directly. In this paper, we propose to parse the geometric correspondences between related images explicitly to bridge the gap between deep appearance features and the homography. Furthermore, we propose a coarse-to-fine estimation framework to capture different scale of homography transformations and thus predict the homography in a stepwise-refining manner. Additionally, we propose a pyramidal supervision scheme to leverage an important prior concerning the homography estimation. Extensive experiments on two large-scale datasets demonstrate that our model advances the state-of-the-art performance significantly.

Supplementary Material

MP4 File (3394171.3413870.mp4)

The crux of homography estimation is that the homography is characterized by the geometric correspondences between two related images rather than appearance features, which differs from typical image recognition tasks. Existing methods either decompose the task of homography estimation into several individual sub-problems and optimize them sequentially, or attempt to tackle it in an end-to-end manner by delegating the whole task to CNNs. However, it is quite arduous for CNNs to learn the mapping function from appearance features to the homography directly. In this paper, we parse the geometric correspondences between related images explicitly to bridge the gap between deep appearance features and the homography. Furthermore, we propose a coarse-to-fine estimation framework to capture different scale of homography transformations and thus predict the homography in a stepwise-refining manner.

Download
48.96 MB

References

[1]

Simon Baker, Ankur Datta, and Takeo Kanade. 2006. Parameterizing Homographies . Technical Report Carnegie Mellon University-RI-TR-06--11. Carnegie Mellon University.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Learning Pixel-wise Alignment for Unsupervised Image Stitching

Proposition and Comparison of Catadioptric Homography Estimation Methods

Proposition and comparison of catadioptric homography estimation methods

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations