Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–9 of 9 results for author: Jakab, T

.
  1. arXiv:2401.02400  [pdf, other

    cs.CV

    Learning the 3D Fauna of the Web

    Authors: Zizhang Li, Dor Litvak, Ruining Li, Yunzhi Zhang, Tomas Jakab, Christian Rupprecht, Shangzhe Wu, Andrea Vedaldi, Jiajun Wu

    Abstract: Learning 3D models of all animals on the Earth requires massively scaling up existing solutions. With this ultimate goal in mind, we develop 3D-Fauna, an approach that learns a pan-category deformable 3D animal model for more than 100 animal species jointly. One crucial bottleneck of modeling animals is the limited availability of training data, which we overcome by simply learning from 2D Interne… ▽ More

    Submitted 1 April, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

    Comments: The first two authors contributed equally to this work. The last three authors contributed equally. Project page: https://kyleleey.github.io/3DFauna/

  2. arXiv:2312.12419  [pdf, other

    cs.CV

    Scene-Conditional 3D Object Stylization and Composition

    Authors: Jinghao Zhou, Tomas Jakab, Philip Torr, Christian Rupprecht

    Abstract: Recently, 3D generative models have made impressive progress, enabling the generation of almost arbitrary 3D assets from text or image inputs. However, these approaches generate objects in isolation without any consideration for the scene where they will eventually be placed. In this paper, we propose a framework that allows for the stylization of an existing 3D asset to fit into a given 2D scene,… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  3. arXiv:2312.02350  [pdf, other

    cs.CV

    Instant Uncertainty Calibration of NeRFs Using a Meta-calibrator

    Authors: Niki Amini-Naieni, Tomas Jakab, Andrea Vedaldi, Ronald Clark

    Abstract: Although Neural Radiance Fields (NeRFs) have markedly improved novel view synthesis, accurate uncertainty quantification in their image predictions remains an open problem. The prevailing methods for estimating uncertainty, including the state-of-the-art Density-aware NeRF Ensembles (DANE) [29], quantify uncertainty without calibration. This frequently leads to over- or under-confidence in image p… ▽ More

    Submitted 19 March, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

  4. arXiv:2304.10535  [pdf, other

    cs.CV

    Farm3D: Learning Articulated 3D Animals by Distilling 2D Diffusion

    Authors: Tomas Jakab, Ruining Li, Shangzhe Wu, Christian Rupprecht, Andrea Vedaldi

    Abstract: We present Farm3D, a method for learning category-specific 3D reconstructors for articulated objects, relying solely on "free" virtual supervision from a pre-trained 2D diffusion-based image generator. Recent approaches can learn a monocular network that predicts the 3D shape, albedo, illumination, and viewpoint of any object occurrence, given a collection of single-view images of an object catego… ▽ More

    Submitted 14 May, 2024; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: In 3DV 2024, Project page: http://farm3d.github.io

  5. arXiv:2211.12497  [pdf, other

    cs.CV

    MagicPony: Learning Articulated 3D Animals in the Wild

    Authors: Shangzhe Wu, Ruining Li, Tomas Jakab, Christian Rupprecht, Andrea Vedaldi

    Abstract: We consider the problem of predicting the 3D shape, articulation, viewpoint, texture, and lighting of an articulated animal like a horse given a single test image as input. We present a new method, dubbed MagicPony, that learns this predictor purely from in-the-wild single-view images of the object category, with minimal assumptions about the topology of deformation. At its core is an implicit-exp… ▽ More

    Submitted 3 April, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: CVPR 2023. Project Page: https://3dmagicpony.github.io/

  6. arXiv:2107.10844  [pdf, other

    cs.CV

    DOVE: Learning Deformable 3D Objects by Watching Videos

    Authors: Shangzhe Wu, Tomas Jakab, Christian Rupprecht, Andrea Vedaldi

    Abstract: Learning deformable 3D objects from 2D images is often an ill-posed problem. Existing methods rely on explicit supervision to establish multi-view correspondences, such as template shape models and keypoint annotations, which restricts their applicability on objects "in the wild". A more natural way of establishing correspondences is by watching videos of objects moving around. In this paper, we p… ▽ More

    Submitted 29 June, 2022; v1 submitted 22 July, 2021; originally announced July 2021.

    Comments: Project Page: https://dove3d.github.io/

  7. arXiv:2104.11224  [pdf, other

    cs.CV cs.GR

    KeypointDeformer: Unsupervised 3D Keypoint Discovery for Shape Control

    Authors: Tomas Jakab, Richard Tucker, Ameesh Makadia, Jiajun Wu, Noah Snavely, Angjoo Kanazawa

    Abstract: We introduce KeypointDeformer, a novel unsupervised method for shape control through automatically discovered 3D keypoints. We cast this as the problem of aligning a source 3D object to a target 3D object from the same object category. Our method analyzes the difference between the shapes of the two objects by comparing their latent representations. This latent representation is in the form of 3D… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

    Comments: CVPR 2021 (oral). Project page: http://tomasjakab.github.io/KeypointDeformer

  8. Self-supervised Learning of Interpretable Keypoints from Unlabelled Videos

    Authors: Tomas Jakab, Ankush Gupta, Hakan Bilen, Andrea Vedaldi

    Abstract: We propose KeypointGAN, a new method for recognizing the pose of objects from a single image that for learning uses only unlabelled videos and a weak empirical prior on the object poses. Video frames differ primarily in the pose of the objects they contain, so our method distils the pose information by analyzing the differences between frames. The distillation uses a new dual representation of the… ▽ More

    Submitted 23 December, 2020; v1 submitted 3 July, 2019; originally announced July 2019.

    Comments: CVPR 2020 (oral). Project page: http://www.robots.ox.ac.uk/~vgg/research/unsupervised_pose/

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, pp. 8787-8797

  9. arXiv:1806.07823  [pdf, other

    cs.CV

    Unsupervised Learning of Object Landmarks through Conditional Image Generation

    Authors: Tomas Jakab, Ankush Gupta, Hakan Bilen, Andrea Vedaldi

    Abstract: We propose a method for learning landmark detectors for visual objects (such as the eyes and the nose in a face) without any manual supervision. We cast this as the problem of generating images that combine the appearance of the object as seen in a first example image with the geometry of the object as seen in a second example image, where the two examples differ by a viewpoint change and/or an ob… ▽ More

    Submitted 13 December, 2018; v1 submitted 20 June, 2018; originally announced June 2018.

    Comments: In NeurIPS 2018. Project page: http://www.robots.ox.ac.uk/~vgg/research/unsupervised_landmarks/