Cross-Domain 3D Equivariant Image Embeddings

Esteves, Carlos; Sud, Avneesh; Luo, Zhengyi; Daniilidis, Kostas; Makadia, Ameesh

Computer Science > Computer Vision and Pattern Recognition

arXiv:1812.02716v2 (cs)

[Submitted on 6 Dec 2018 (v1), last revised 14 May 2019 (this version, v2)]

Title:Cross-Domain 3D Equivariant Image Embeddings

Authors:Carlos Esteves, Avneesh Sud, Zhengyi Luo, Kostas Daniilidis, Ameesh Makadia

View PDF

Abstract:Spherical convolutional networks have been introduced recently as tools to learn powerful feature representations of 3D shapes. Spherical CNNs are equivariant to 3D rotations making them ideally suited to applications where 3D data may be observed in arbitrary orientations. In this paper we learn 2D image embeddings with a similar equivariant structure: embedding the image of a 3D object should commute with rotations of the object. We introduce a cross-domain embedding from 2D images into a spherical CNN latent space. This embedding encodes images with 3D shape properties and is equivariant to 3D rotations of the observed object. The model is supervised only by target embeddings obtained from a spherical CNN pretrained for 3D shape classification. We show that learning a rich embedding for images with appropriate geometric structure is sufficient for tackling varied applications, such as relative pose estimation and novel view synthesis, without requiring additional task-specific supervision.

Comments:	Accepted to the International Conference on Machine Learning, ICML 2019
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1812.02716 [cs.CV]
	(or arXiv:1812.02716v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1812.02716

Submission history

From: Carlos Esteves [view email]
[v1] Thu, 6 Dec 2018 18:51:12 UTC (3,737 KB)
[v2] Tue, 14 May 2019 19:21:59 UTC (5,819 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Cross-Domain 3D Equivariant Image Embeddings

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Cross-Domain 3D Equivariant Image Embeddings

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators