Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–11 of 11 results for author: Abrevaya, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.15228  [pdf, other

    cs.CV cs.CL

    Re-Thinking Inverse Graphics With Large Language Models

    Authors: Peter Kulits, Haiwen Feng, Weiyang Liu, Victoria Abrevaya, Michael J. Black

    Abstract: Inverse graphics -- the task of inverting an image into physical variables that, when rendered, enable reproduction of the observed scene -- is a fundamental challenge in computer vision and graphics. Disentangling an image into its constituent elements, such as the shape, color, and material properties of the objects of the 3D scene that produced it, requires a comprehensive understanding of the… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 31 pages; project page: https://ig-llm.is.tue.mpg.de/

  2. arXiv:2404.13040  [pdf, other

    cs.CV cs.LG

    Analysis of Classifier-Free Guidance Weight Schedulers

    Authors: Xi Wang, Nicolas Dufour, Nefeli Andreou, Marie-Paule Cani, Victoria Fernandez Abrevaya, David Picard, Vicky Kalogeiton

    Abstract: Classifier-Free Guidance (CFG) enhances the quality and condition adherence of text-to-image diffusion models. It operates by combining the conditional and unconditional predictions using a fixed weight. However, recent works vary the weights throughout the diffusion process, reporting superior results but without providing any rationale or analysis. By conducting comprehensive experiments, this p… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  3. arXiv:2404.04104  [pdf, other

    cs.CV

    3D Facial Expressions through Analysis-by-Neural-Synthesis

    Authors: George Retsinas, Panagiotis P. Filntisis, Radek Danecek, Victoria F. Abrevaya, Anastasios Roussos, Timo Bolkart, Petros Maragos

    Abstract: While existing methods for 3D face reconstruction from in-the-wild images excel at recovering the overall face shape, they commonly miss subtle, extreme, asymmetric, or rarely observed expressions. We improve upon these methods with SMIRK (Spatial Modeling for Image-based Reconstruction of Kinesics), which faithfully reconstructs expressive 3D faces from images. We identify two key limitations in… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  4. arXiv:2403.14611  [pdf, other

    cs.CV

    Explorative Inbetweening of Time and Space

    Authors: Haiwen Feng, Zheng Ding, Zhihao Xia, Simon Niklaus, Victoria Abrevaya, Michael J. Black, Xuaner Zhang

    Abstract: We introduce bounded generation as a generalized task to control video generation to synthesize arbitrary camera and subject motion based only on a given start and end frame. Our objective is to fully leverage the inherent generalization capability of an image-to-video model without additional training or fine-tuning of the original model. This is achieved through the proposed new sampling strateg… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: project page at https://time-reversal.github.io

  5. arXiv:2307.09882  [pdf, other

    cs.LG cs.AI

    Adversarial Likelihood Estimation With One-Way Flows

    Authors: Omri Ben-Dov, Pravir Singh Gupta, Victoria Abrevaya, Michael J. Black, Partha Ghosh

    Abstract: Generative Adversarial Networks (GANs) can produce high-quality samples, but do not provide an estimate of the probability density around the samples. However, it has been noted that maximizing the log-likelihood within an energy-based setting can lead to an adversarial framework where the discriminator provides unnormalized density (often called energy). We further develop this perspective, incor… ▽ More

    Submitted 2 October, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

  6. arXiv:2304.10528  [pdf, other

    cs.CV

    Generalizing Neural Human Fitting to Unseen Poses With Articulated SE(3) Equivariance

    Authors: Haiwen Feng, Peter Kulits, Shichen Liu, Michael J. Black, Victoria Abrevaya

    Abstract: We address the problem of fitting a parametric human body model (SMPL) to point cloud data. Optimization-based methods require careful initialization and are prone to becoming trapped in local optima. Learning-based methods address this but do not generalize well when the input pose is far from those seen during training. For rigid point clouds, remarkable generalization has been achieved by lever… ▽ More

    Submitted 19 September, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: Accepted at ICCV 2023 as an oral presentation. Project page: https://arteq.is.tue.mpg.de ; Update V2: Camera-Ready version, fix metric issues and numeric bug of ID performance

  7. arXiv:2206.11563  [pdf, other

    cs.LG cs.AI

    LED: Latent Variable-based Estimation of Density

    Authors: Omri Ben-Dov, Pravir Singh Gupta, Victoria Fernandez Abrevaya, Michael J. Black, Partha Ghosh

    Abstract: Modern generative models are roughly divided into two main categories: (1) models that can produce high-quality random samples, but cannot estimate the exact density of new data points and (2) those that provide exact density estimation, at the expense of sample quality and compactness of the latent space. In this work we propose LED, a new generative model closely related to GANs, that allows not… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

  8. arXiv:2205.03962  [pdf, other

    cs.CV

    Towards Racially Unbiased Skin Tone Estimation via Scene Disambiguation

    Authors: Haiwen Feng, Timo Bolkart, Joachim Tesch, Michael J. Black, Victoria Abrevaya

    Abstract: Virtual facial avatars will play an increasingly important role in immersive communication, games and the metaverse, and it is therefore critical that they be inclusive. This requires accurate recovery of the appearance, represented by albedo, regardless of age, sex, or ethnicity. While significant progress has been made on estimating 3D facial geometry, albedo estimation has received less attenti… ▽ More

    Submitted 23 July, 2022; v1 submitted 8 May, 2022; originally announced May 2022.

    Comments: Camera-Ready version, accepted at ECCV2022

  9. arXiv:2112.07471  [pdf, other

    cs.CV

    I M Avatar: Implicit Morphable Head Avatars from Videos

    Authors: Yufeng Zheng, Victoria Fernández Abrevaya, Marcel C. Bühler, Xu Chen, Michael J. Black, Otmar Hilliges

    Abstract: Traditional 3D morphable face models (3DMMs) provide fine-grained control over expression but cannot easily capture geometric and appearance details. Neural volumetric representations approach photorealism but are hard to animate and do not generalize well to unseen expressions. To tackle this problem, we propose IMavatar (Implicit Morphable avatar), a novel method for learning implicit head avata… ▽ More

    Submitted 4 November, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: Accepted at CVPR 2022 as an oral presentation. Project page https://ait.ethz.ch/projects/2022/IMavatar/ ; Github page: https://github.com/zhengyuf/IMavatar

  10. arXiv:2003.09691  [pdf, other

    cs.CV cs.LG eess.IV

    Cross-modal Deep Face Normals with Deactivable Skip Connections

    Authors: Victoria Fernandez Abrevaya, Adnane Boukhayma, Philip H. S. Torr, Edmond Boyer

    Abstract: We present an approach for estimating surface normals from in-the-wild color images of faces. While data-driven strategies have been proposed for single face images, limited available ground truth data makes this problem difficult. To alleviate this issue, we propose a method that can leverage all available image and normal data, whether paired or not, thanks to a novel cross-modal learning archit… ▽ More

    Submitted 30 March, 2020; v1 submitted 21 March, 2020; originally announced March 2020.

    Comments: CVPR 2020

  11. arXiv:1902.03619  [pdf, other

    cs.CV cs.LG

    A Decoupled 3D Facial Shape Model by Adversarial Training

    Authors: Victoria Fernandez Abrevaya, Adnane Boukhayma, Stefanie Wuhrer, Edmond Boyer

    Abstract: Data-driven generative 3D face models are used to compactly encode facial shape data into meaningful parametric representations. A desirable property of these models is their ability to effectively decouple natural sources of variation, in particular identity and expression. While factorized representations have been proposed for that purpose, they are still limited in the variability they can cap… ▽ More

    Submitted 7 September, 2019; v1 submitted 10 February, 2019; originally announced February 2019.

    Comments: camera-ready version for ICCV'19