Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–25 of 25 results for author: Makadia, A

.
  1. arXiv:2312.09250  [pdf, other

    cs.CV cs.GR cs.LG

    Single Mesh Diffusion Models with Field Latents for Texture Generation

    Authors: Thomas W. Mitchel, Carlos Esteves, Ameesh Makadia

    Abstract: We introduce a framework for intrinsic latent diffusion models operating directly on the surfaces of 3D shapes, with the goal of synthesizing high-quality textures. Our approach is underpinned by two contributions: field latents, a latent representation encoding textures as discrete vector fields on the mesh vertices, and field latent diffusion models, which learn to denoise a diffusion process in… ▽ More

    Submitted 28 May, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: CVPR 2024. Code and additional visualizations available: https://single-mesh-diffusion.github.io/

  2. arXiv:2309.16672  [pdf, other

    cs.CV cs.LG

    Learning to Transform for Generalizable Instance-wise Invariance

    Authors: Utkarsh Singhal, Carlos Esteves, Ameesh Makadia, Stella X. Yu

    Abstract: Computer vision research has long aimed to build systems that are robust to spatial transformations found in natural data. Traditionally, this is done using data augmentation or hard-coding invariances into the architecture. However, too much or too little invariance can hurt, and the correct amount is unknown a priori and dependent on the instance. Ideally, the appropriate invariance would be lea… ▽ More

    Submitted 15 February, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: Accepted to ICCV 2023

  3. arXiv:2306.09109  [pdf, other

    cs.CV

    NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations

    Authors: Varun Jampani, Kevis-Kokitsi Maninis, Andreas Engelhardt, Arjun Karpur, Karen Truong, Kyle Sargent, Stefan Popov, André Araujo, Ricardo Martin-Brualla, Kaushal Patel, Daniel Vlasic, Vittorio Ferrari, Ameesh Makadia, Ce Liu, Yuanzhen Li, Howard Zhou

    Abstract: Recent advances in neural reconstruction enable high-quality 3D object reconstruction from casually captured image collections. Current techniques mostly analyze their progress on relatively simple image collections where Structure-from-Motion (SfM) techniques can provide ground-truth (GT) camera poses. We note that SfM techniques tend to fail on in-the-wild image collections such as image search… ▽ More

    Submitted 13 October, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 camera ready. Project page: https://navidataset.github.io

  4. arXiv:2306.05420  [pdf, other

    cs.LG cs.CV

    Scaling Spherical CNNs

    Authors: Carlos Esteves, Jean-Jacques Slotine, Ameesh Makadia

    Abstract: Spherical CNNs generalize CNNs to functions on the sphere, by using spherical convolutions as the main linear operation. The most accurate and efficient way to compute spherical convolutions is in the spectral domain (via the convolution theorem), which is still costlier than the usual planar convolutions. For this reason, applications of spherical CNNs have so far been limited to small problems t… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: Accepted to ICML'23

  5. arXiv:2306.05410  [pdf, other

    cs.CV

    LU-NeRF: Scene and Pose Estimation by Synchronizing Local Unposed NeRFs

    Authors: Zezhou Cheng, Carlos Esteves, Varun Jampani, Abhishek Kar, Subhransu Maji, Ameesh Makadia

    Abstract: A critical obstacle preventing NeRF models from being deployed broadly in the wild is their reliance on accurate camera poses. Consequently, there is growing interest in extending NeRF models to jointly optimize camera poses and scene representation, which offers an alternative to off-the-shelf SfM pipelines which have well-understood failure modes. Existing approaches for unposed NeRF operate und… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: Project website: https://people.cs.umass.edu/~zezhoucheng/lu-nerf/

  6. arXiv:2303.16201  [pdf, other

    cs.CV cs.AI cs.LG

    ASIC: Aligning Sparse in-the-wild Image Collections

    Authors: Kamal Gupta, Varun Jampani, Carlos Esteves, Abhinav Shrivastava, Ameesh Makadia, Noah Snavely, Abhishek Kar

    Abstract: We present a method for joint alignment of sparse in-the-wild image collections of an object category. Most prior works assume either ground-truth keypoint annotations or a large dataset of images of a single object category. However, neither of the above assumptions hold true for the long-tail of the objects present in the world. We present a self-supervised technique that directly optimizes on a… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: Web: https://kampta.github.io/asic

  7. Stable Object Reorientation using Contact Plane Registration

    Authors: Richard Li, Carlos Esteves, Ameesh Makadia, Pulkit Agrawal

    Abstract: We present a system for accurately predicting stable orientations for diverse rigid objects. We propose to overcome the critical issue of modelling multimodality in the space of rotations by using a conditional generative model to accurately classify contact surfaces. Our system is capable of operating from noisy and partially-observed pointcloud observations captured by real world depth cameras.… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: 7 pages, 1 additional page for references

    Journal ref: 2022 International Conference on Robotics and Automation (ICRA), 2022, pp. 6379-6385

  8. arXiv:2207.10662  [pdf, other

    cs.CV

    Generalizable Patch-Based Neural Rendering

    Authors: Mohammed Suhail, Carlos Esteves, Leonid Sigal, Ameesh Makadia

    Abstract: Neural rendering has received tremendous attention since the advent of Neural Radiance Fields (NeRF), and has pushed the state-of-the-art on novel-view synthesis considerably. The recent focus has been on models that overfit to a single scene, and the few attempts to learn models that can synthesize novel views of unseen scenes mostly consist of combining deep convolutional features with a NeRF-li… ▽ More

    Submitted 28 July, 2022; v1 submitted 21 July, 2022; originally announced July 2022.

    Comments: Project Page with code and results at https://mohammedsuhail.net/gen_patch_neural_rendering/

  9. arXiv:2112.09687  [pdf, other

    cs.CV

    Light Field Neural Rendering

    Authors: Mohammed Suhail, Carlos Esteves, Leonid Sigal, Ameesh Makadia

    Abstract: Classical light field rendering for novel view synthesis can accurately reproduce view-dependent effects such as reflection, refraction, and translucency, but requires a dense view sampling of the scene. Methods based on geometric reconstruction need only sparse views, but cannot accurately model non-Lambertian effects. We introduce a model that combines the strengths and mitigates the limitations… ▽ More

    Submitted 28 March, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

    Comments: Project page with code and videos at https://light-field-neural-rendering.github.io

  10. arXiv:2106.05965  [pdf, other

    cs.CV

    Implicit-PDF: Non-Parametric Representation of Probability Distributions on the Rotation Manifold

    Authors: Kieran Murphy, Carlos Esteves, Varun Jampani, Srikumar Ramalingam, Ameesh Makadia

    Abstract: Single image pose estimation is a fundamental problem in many vision and robotics tasks, and existing deep learning approaches suffer by not completely modeling and handling: i) uncertainty about the predictions, and ii) symmetric objects with multiple (sometimes infinite) correct poses. To this end, we introduce a method to estimate arbitrary, non-parametric distributions on SO(3). Our key idea i… ▽ More

    Submitted 1 July, 2022; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: Additional implementation details

  11. arXiv:2106.03336  [pdf, other

    cs.CV

    Wide-Baseline Relative Camera Pose Estimation with Directional Learning

    Authors: Kefan Chen, Noah Snavely, Ameesh Makadia

    Abstract: Modern deep learning techniques that regress the relative camera pose between two images have difficulty dealing with challenging scenarios, such as large camera motions resulting in occlusions and significant changes in perspective that leave little overlap between images. These models continue to struggle even with the benefit of large supervised training datasets. To address the limitations of… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  12. arXiv:2104.11224  [pdf, other

    cs.CV cs.GR

    KeypointDeformer: Unsupervised 3D Keypoint Discovery for Shape Control

    Authors: Tomas Jakab, Richard Tucker, Ameesh Makadia, Jiajun Wu, Noah Snavely, Angjoo Kanazawa

    Abstract: We introduce KeypointDeformer, a novel unsupervised method for shape control through automatically discovered 3D keypoints. We cast this as the problem of aligning a source 3D object to a target 3D object from the same object category. Our method analyzes the difference between the shapes of the two objects by comparing their latent representations. This latent representation is in the form of 3D… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

    Comments: CVPR 2021 (oral). Project page: http://tomasjakab.github.io/KeypointDeformer

  13. arXiv:2104.03954  [pdf, other

    cs.CV cs.GR

    De-rendering the World's Revolutionary Artefacts

    Authors: Shangzhe Wu, Ameesh Makadia, Jiajun Wu, Noah Snavely, Richard Tucker, Angjoo Kanazawa

    Abstract: Recent works have shown exciting results in unsupervised image de-rendering -- learning to decompose 3D shape, appearance, and lighting from single-image collections without explicit supervision. However, many of these assume simplistic material and lighting models. We propose a method, termed RADAR, that can recover environment illumination and surface materials from real single-image collections… ▽ More

    Submitted 31 August, 2021; v1 submitted 8 April, 2021; originally announced April 2021.

    Comments: CVPR 2021. Project page: https://sorderender.github.io/

  14. arXiv:2103.03240  [pdf, other

    cs.LG cs.CV

    Learning ABCs: Approximate Bijective Correspondence for isolating factors of variation with weak supervision

    Authors: Kieran A. Murphy, Varun Jampani, Srikumar Ramalingam, Ameesh Makadia

    Abstract: Representational learning forms the backbone of most deep learning applications, and the value of a learned representation is intimately tied to its information content regarding different factors of variation. Finding good representations depends on the nature of supervision and the learning algorithm. We propose a novel algorithm that utilizes a weak form of supervision where the data is partiti… ▽ More

    Submitted 30 March, 2022; v1 submitted 4 March, 2021; originally announced March 2021.

    Comments: CVPR 2022. Code: https://github.com/google-research/google-research/tree/master/isolating_factors

  15. arXiv:2012.09855  [pdf, other

    cs.CV cs.GR

    Infinite Nature: Perpetual View Generation of Natural Scenes from a Single Image

    Authors: Andrew Liu, Richard Tucker, Varun Jampani, Ameesh Makadia, Noah Snavely, Angjoo Kanazawa

    Abstract: We introduce the problem of perpetual view generation - long-range generation of novel views corresponding to an arbitrarily long camera trajectory given a single image. This is a challenging problem that goes far beyond the capabilities of current view synthesis methods, which quickly degenerate when presented with large camera motions. Methods for video generation also have limited ability to pr… ▽ More

    Submitted 30 November, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: ICCV 2021 (oral); Project page: https://infinite-nature.github.io/; Video: https://www.youtube.com/watch?v=oXUf6anNAtc

  16. arXiv:2006.14616  [pdf, ps, other

    cs.CV

    An Analysis of SVD for Deep Rotation Estimation

    Authors: Jake Levinson, Carlos Esteves, Kefan Chen, Noah Snavely, Angjoo Kanazawa, Afshin Rostamizadeh, Ameesh Makadia

    Abstract: Symmetric orthogonalization via SVD, and closely related procedures, are well-known techniques for projecting matrices onto $O(n)$ or $SO(n)$. These tools have long been used for applications in computer vision, for example optimal 3D alignment problems solved by orthogonal Procrustes, rotation averaging, or Essential matrix decomposition. Despite its utility in different settings, SVD orthogonali… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

  17. arXiv:2006.10731  [pdf, other

    cs.CV cs.LG

    Spin-Weighted Spherical CNNs

    Authors: Carlos Esteves, Ameesh Makadia, Kostas Daniilidis

    Abstract: Learning equivariant representations is a promising way to reduce sample and model complexity and improve the generalization performance of deep neural networks. The spherical CNNs are successful examples, producing SO(3)-equivariant representations of spherical inputs. There are two main types of spherical CNNs. The first type lifts the inputs to functions on the rotation group SO(3) and applies… ▽ More

    Submitted 26 October, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: Accepted to NeurIPS'20

  18. arXiv:2005.01906  [pdf, other

    cs.LG stat.ML

    Time Dependence in Non-Autonomous Neural ODEs

    Authors: Jared Quincy Davis, Krzysztof Choromanski, Jake Varley, Honglak Lee, Jean-Jacques Slotine, Valerii Likhosterov, Adrian Weller, Ameesh Makadia, Vikas Sindhwani

    Abstract: Neural Ordinary Differential Equations (ODEs) are elegant reinterpretations of deep networks where continuous time can replace the discrete notion of depth, ODE solvers perform forward propagation, and the adjoint method enables efficient, constant memory backpropagation. Neural ODEs are universal approximators only when they are non-autonomous, that is, the dynamics depends explicitly on time. We… ▽ More

    Submitted 6 May, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

  19. arXiv:2003.08981  [pdf, other

    cs.CV cs.CG cs.LG

    Local Implicit Grid Representations for 3D Scenes

    Authors: Chiyu Max Jiang, Avneesh Sud, Ameesh Makadia, Jingwei Huang, Matthias Nießner, Thomas Funkhouser

    Abstract: Shape priors learned from data are commonly used to reconstruct 3D objects from partial or noisy data. Yet no such shape priors are available for indoor scenes, since typical 3D autoencoders cannot handle their scale, complexity, or diversity. In this paper, we introduce Local Implicit Grid Representations, a new 3D shape representation designed for scalability and generality. The motivating idea… ▽ More

    Submitted 19 March, 2020; originally announced March 2020.

    Comments: CVPR 2020. Supplementary Video: https://youtu.be/XCyl1-vxfII

  20. arXiv:1906.03281  [pdf, other

    cs.LG stat.ML

    Latent feature disentanglement for 3D meshes

    Authors: Jake Levinson, Avneesh Sud, Ameesh Makadia

    Abstract: Generative modeling of 3D shapes has become an important problem due to its relevance to many applications across Computer Vision, Graphics, and VR. In this paper we build upon recently introduced 3D mesh-convolutional Variational AutoEncoders which have shown great promise for learning rich representations of deformable 3D shapes. We introduce a supervised generative 3D mesh model that disentangl… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

  21. arXiv:1812.02716  [pdf, other

    cs.CV

    Cross-Domain 3D Equivariant Image Embeddings

    Authors: Carlos Esteves, Avneesh Sud, Zhengyi Luo, Kostas Daniilidis, Ameesh Makadia

    Abstract: Spherical convolutional networks have been introduced recently as tools to learn powerful feature representations of 3D shapes. Spherical CNNs are equivariant to 3D rotations making them ideally suited to applications where 3D data may be observed in arbitrary orientations. In this paper we learn 2D image embeddings with a similar equivariant structure: embedding the image of a 3D object should co… ▽ More

    Submitted 14 May, 2019; v1 submitted 6 December, 2018; originally announced December 2018.

    Comments: Accepted to the International Conference on Machine Learning, ICML 2019

  22. arXiv:1809.02123  [pdf, other

    cs.CV

    Labeling Panoramas with Spherical Hourglass Networks

    Authors: Carlos Esteves, Kostas Daniilidis, Ameesh Makadia

    Abstract: With the recent proliferation of consumer-grade 360° cameras, it is worth revisiting visual perception challenges with spherical cameras given the potential benefit of their global field of view. To this end we introduce a spherical convolutional hourglass network (SCHN) for the dense labeling on the sphere. The SCHN is invariant to camera orientation (lifting the usual requirement for `upright' p… ▽ More

    Submitted 6 September, 2018; originally announced September 2018.

    Comments: Accepted to the 360° Perception and Interaction Workshop at ECCV 2018

  23. arXiv:1712.00268  [pdf, other

    cs.CV

    Deformable Shape Completion with Graph Convolutional Autoencoders

    Authors: Or Litany, Alex Bronstein, Michael Bronstein, Ameesh Makadia

    Abstract: The availability of affordable and portable depth sensors has made scanning objects and people simpler than ever. However, dealing with occlusions and missing parts is still a significant challenge. The problem of reconstructing a (possibly non-rigidly moving) 3D object from a single or multiple partial scans has received increasing attention in recent years. In this work, we propose a novel learn… ▽ More

    Submitted 3 April, 2018; v1 submitted 1 December, 2017; originally announced December 2017.

    Comments: CVPR 2018

  24. arXiv:1711.06721  [pdf, other

    cs.CV

    Learning SO(3) Equivariant Representations with Spherical CNNs

    Authors: Carlos Esteves, Christine Allen-Blanchette, Ameesh Makadia, Kostas Daniilidis

    Abstract: We address the problem of 3D rotation equivariance in convolutional neural networks. 3D rotations have been a challenging nuisance in 3D classification tasks requiring higher capacity and extended data augmentation in order to tackle it. We model 3D data with multi-valued spherical functions and we propose a novel spherical convolutional network that implements exact convolutions on the sphere by… ▽ More

    Submitted 27 September, 2018; v1 submitted 17 November, 2017; originally announced November 2017.

    Comments: Camera-ready. Accepted to ECCV'18 as oral presentation

  25. arXiv:1611.07369  [pdf, other

    math.OC cs.CG cs.CV cs.GR

    Geometry of 3D Environments and Sum of Squares Polynomials

    Authors: Amir Ali Ahmadi, Georgina Hall, Ameesh Makadia, Vikas Sindhwani

    Abstract: Motivated by applications in robotics and computer vision, we study problems related to spatial reasoning of a 3D environment using sublevel sets of polynomials. These include: tightly containing a cloud of points (e.g., representing an obstacle) with convex or nearly-convex basic semialgebraic sets, computation of Euclidean distances between two such sets, separation of two convex basic semalgebr… ▽ More

    Submitted 7 March, 2017; v1 submitted 22 November, 2016; originally announced November 2016.