Latent Transformations for Object View Points Synthesis

Kim, Sangpil; Winovich, Nick; Lin, Guang; Ramani, Karthik

Computer Science > Computer Vision and Pattern Recognition

arXiv:1807.04812 (cs)

[Submitted on 12 Jul 2018 (v1), last revised 28 Nov 2018 (this version, v4)]

Title:Latent Transformations for Object View Points Synthesis

Authors:Sangpil Kim, Nick Winovich, Guang Lin, Karthik Ramani

View PDF

Abstract:We propose a fully-convolutional conditional generative model, the latent transformation neural network (LTNN), capable of view synthesis using a light-weight neural network suited for real-time applications. In contrast to existing conditional generative models which incorporate conditioning information via concatenation, we introduce a dedicated network component, the conditional transformation unit (CTU), designed to learn the latent space transformations corresponding to specified target views. In addition, a consistency loss term is defined to guide the network toward learning the desired latent space mappings, a task-divided decoder is constructed to refine the quality of generated views, and an adaptive discriminator is introduced to improve the adversarial training process. The generality of the proposed methodology is demonstrated on a collection of three diverse tasks: multi-view reconstruction on real hand depth images, view synthesis of real and synthetic faces, and the rotation of rigid objects. The proposed model is shown to exceed state-of-the-art results in each category while simultaneously achieving a reduction in the computational demand required for inference by 30% on average.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1807.04812 [cs.CV]
	(or arXiv:1807.04812v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1807.04812

Submission history

From: Sangpil Kim [view email]
[v1] Thu, 12 Jul 2018 20:46:43 UTC (6,047 KB)
[v2] Wed, 18 Jul 2018 00:47:23 UTC (6,047 KB)
[v3] Fri, 31 Aug 2018 14:27:40 UTC (6,047 KB)
[v4] Wed, 28 Nov 2018 17:52:39 UTC (9,327 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Latent Transformations for Object View Points Synthesis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Latent Transformations for Object View Points Synthesis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators