Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–14 of 14 results for author: Gehler, P V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2207.09239  [pdf, other

    cs.LG stat.ML

    Assaying Out-Of-Distribution Generalization in Transfer Learning

    Authors: Florian Wenzel, Andrea Dittadi, Peter Vincent Gehler, Carl-Johann Simon-Gabriel, Max Horn, Dominik Zietlow, David Kernert, Chris Russell, Thomas Brox, Bernt Schiele, Bernhard Schölkopf, Francesco Locatello

    Abstract: Since out-of-distribution generalization is a generally ill-posed problem, various proxy targets (e.g., calibration, adversarial robustness, algorithmic corruptions, invariance across shifts) were studied across different research programs resulting in different recommendations. While sharing the same aspirational goal, these approaches have never been tested under the same experimental conditions… ▽ More

    Submitted 21 October, 2022; v1 submitted 19 July, 2022; originally announced July 2022.

  2. arXiv:1909.03677  [pdf, other

    cs.CV

    Learning Task-Specific Generalized Convolutions in the Permutohedral Lattice

    Authors: Anne S. Wannenwetsch, Martin Kiefel, Peter V. Gehler, Stefan Roth

    Abstract: Dense prediction tasks typically employ encoder-decoder architectures, but the prevalent convolutions in the decoder are not image-adaptive and can lead to boundary artifacts. Different generalized convolution operations have been introduced to counteract this. We go beyond these by leveraging guidance data to redefine their inherent notion of proximity. Our proposed network layer builds on the pe… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: To appear at GCPR 2019

  3. arXiv:1808.05942  [pdf, other

    cs.CV

    Neural Body Fitting: Unifying Deep Learning and Model-Based Human Pose and Shape Estimation

    Authors: Mohamed Omran, Christoph Lassner, Gerard Pons-Moll, Peter V. Gehler, Bernt Schiele

    Abstract: Direct prediction of 3D body pose and shape remains a challenge even for highly parameterized deep learning models. Mapping from the 2D image space to the prediction space is difficult: perspective ambiguities make the loss function noisy and training data is scarce. In this paper, we propose a novel approach (Neural Body Fitting (NBF)). It integrates a statistical body model within a CNN, leverag… ▽ More

    Submitted 17 August, 2018; originally announced August 2018.

    Comments: 3DV 2018

  4. arXiv:1708.03088  [pdf, other

    cs.CV

    Semantic Video CNNs through Representation Warping

    Authors: Raghudeep Gadde, Varun Jampani, Peter V. Gehler

    Abstract: In this work, we propose a technique to convert CNN models for semantic segmentation of static images into CNNs for video data. We describe a warping method that can be used to augment existing architectures with very little extra computational cost. This module is called NetWarp and we demonstrate its use for a range of network architectures. The main design principle is to use optical flow of ad… ▽ More

    Submitted 10 August, 2017; originally announced August 2017.

    Comments: ICCV 2017

  5. arXiv:1707.07548  [pdf, other

    cs.CV

    Towards Accurate Markerless Human Shape and Pose Estimation over Time

    Authors: Yinghao Huang, Federica Bogo, Christoph Lassner, Angjoo Kanazawa, Peter V. Gehler, Ijaz Akhter, Michael J. Black

    Abstract: Existing marker-less motion capture methods often assume known backgrounds, static cameras, and sequence specific motion priors, which narrows its application scenarios. Here we propose a fully automatic method that given multi-view video, estimates 3D human motion and body shape. We take recent SMPLify \cite{bogo2016keep} as the base method, and extend it in several ways. First we fit the body to… ▽ More

    Submitted 30 April, 2018; v1 submitted 24 July, 2017; originally announced July 2017.

    Comments: 10 pages, 6 figures, 5 tables, published in 3DV-2017

  6. arXiv:1705.04098  [pdf, other

    cs.CV

    A Generative Model of People in Clothing

    Authors: Christoph Lassner, Gerard Pons-Moll, Peter V. Gehler

    Abstract: We present the first image-based generative model of people in clothing for the full body. We sidestep the commonly used complex graphics rendering pipeline and the need for high-quality 3D scans of dressed people. Instead, we learn generative models from a large image database. The main challenge is to cope with the high variance in human pose, shape and appearance. For this reason, pure image-ba… ▽ More

    Submitted 31 July, 2017; v1 submitted 11 May, 2017; originally announced May 2017.

  7. arXiv:1701.02468  [pdf, other

    cs.CV

    Unite the People: Closing the Loop Between 3D and 2D Human Representations

    Authors: Christoph Lassner, Javier Romero, Martin Kiefel, Federica Bogo, Michael J. Black, Peter V. Gehler

    Abstract: 3D models provide a common ground for different representations of human bodies. In turn, robust 2D estimation has proven to be a powerful tool to obtain 3D fits "in-the- wild". However, depending on the level of detail, it can be hard to impossible to acquire labeled data for training 2D estimators on large scale. We propose a hybrid approach to this problem: with an extended version of the recen… ▽ More

    Submitted 24 July, 2017; v1 submitted 10 January, 2017; originally announced January 2017.

  8. arXiv:1612.05478  [pdf, other

    cs.CV

    Video Propagation Networks

    Authors: Varun Jampani, Raghudeep Gadde, Peter V. Gehler

    Abstract: We propose a technique that propagates information forward through video data. The method is conceptually simple and can be applied to tasks that require the propagation of structured information, such as semantic labels, based on video content. We propose a 'Video Propagation Network' that processes video frames in an adaptive manner. The model is applied online: it propagates information forward… ▽ More

    Submitted 11 April, 2017; v1 submitted 16 December, 2016; originally announced December 2016.

    Comments: Appearing in Computer Vision and Pattern Recognition, 2017 (CVPR'17)

  9. arXiv:1612.05062  [pdf, other

    cs.CV

    Reflectance Adaptive Filtering Improves Intrinsic Image Estimation

    Authors: Thomas Nestmeyer, Peter V. Gehler

    Abstract: Separating an image into reflectance and shading layers poses a challenge for learning approaches because no large corpus of precise and realistic ground truth decompositions exists. The Intrinsic Images in the Wild~(IIW) dataset provides a sparse set of relative human reflectance judgments, which serves as a standard benchmark for intrinsic images. A number of methods use IIW to learn statistical… ▽ More

    Submitted 12 June, 2017; v1 submitted 15 December, 2016; originally announced December 2016.

    Comments: CVPR 2017

  10. arXiv:1606.06437  [pdf, other

    cs.CV

    Efficient 2D and 3D Facade Segmentation using Auto-Context

    Authors: Raghudeep Gadde, Varun Jampani, Renaud Marlet, Peter V. Gehler

    Abstract: This paper introduces a fast and efficient segmentation technique for 2D images and 3D point clouds of building facades. Facades of buildings are highly structured and consequently most methods that have been proposed for this problem aim to make use of this strong prior information. Contrary to most prior work, we are describing a system that is almost domain independent and consists of standard… ▽ More

    Submitted 21 June, 2016; originally announced June 2016.

    Comments: 8 pages

  11. arXiv:1511.06739  [pdf, other

    cs.CV

    Superpixel Convolutional Networks using Bilateral Inceptions

    Authors: Raghudeep Gadde, Varun Jampani, Martin Kiefel, Daniel Kappler, Peter V. Gehler

    Abstract: In this paper we propose a CNN architecture for semantic image segmentation. We introduce a new 'bilateral inception' module that can be inserted in existing CNN architectures and performs bilateral filtering, at multiple feature-scales, between superpixels in an image. The feature spaces for bilateral filtering and other parameters of the module are learned end-to-end using standard backpropagati… ▽ More

    Submitted 8 August, 2016; v1 submitted 20 November, 2015; originally announced November 2015.

    Comments: European Conference on Computer Vision (ECCV), 2016

    ACM Class: I.2.10; I.2.6

  12. arXiv:1503.04949  [pdf, other

    cs.CV

    Learning Sparse High Dimensional Filters: Image Filtering, Dense CRFs and Bilateral Neural Networks

    Authors: Varun Jampani, Martin Kiefel, Peter V. Gehler

    Abstract: Bilateral filters have wide spread use due to their edge-preserving properties. The common use case is to manually choose a parametric filter type, usually a Gaussian filter. In this paper, we will generalize the parametrization and in particular derive a gradient descent algorithm so the filter parameters can be learned from data. This derivation allows to learn high dimensional linear filters th… ▽ More

    Submitted 25 November, 2015; v1 submitted 17 March, 2015; originally announced March 2015.

  13. arXiv:1412.6618  [pdf, ps, other

    cs.CV cs.LG cs.NE

    Permutohedral Lattice CNNs

    Authors: Martin Kiefel, Varun Jampani, Peter V. Gehler

    Abstract: This paper presents a convolutional layer that is able to process sparse input features. As an example, for image recognition problems this allows an efficient filtering of signals that do not lie on a dense grid (like pixel position), but of more general features (such as color values). The presented algorithm makes use of the permutohedral lattice data structure. The permutohedral lattice was in… ▽ More

    Submitted 3 May, 2015; v1 submitted 20 December, 2014; originally announced December 2014.

  14. arXiv:1402.0859  [pdf, other

    cs.CV cs.LG stat.ML

    The Informed Sampler: A Discriminative Approach to Bayesian Inference in Generative Computer Vision Models

    Authors: Varun Jampani, Sebastian Nowozin, Matthew Loper, Peter V. Gehler

    Abstract: Computer vision is hard because of a large variability in lighting, shape, and texture; in addition the image signal is non-additive due to occlusion. Generative models promised to account for this variability by accurately modelling the image formation process as a function of latent variables with prior beliefs. Bayesian posterior inference could then, in principle, explain the observation. Whil… ▽ More

    Submitted 7 March, 2015; v1 submitted 4 February, 2014; originally announced February 2014.

    Comments: Appearing in Computer Vision and Image Understanding Journal (Special Issue on Generative Models in Computer Vision)