Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–17 of 17 results for author: Bregier, R

.
  1. arXiv:2402.14654  [pdf, other

    cs.CV

    Multi-HMR: Multi-Person Whole-Body Human Mesh Recovery in a Single Shot

    Authors: Fabien Baradel, Matthieu Armando, Salma Galaaoui, Romain Brégier, Philippe Weinzaepfel, Grégory Rogez, Thomas Lucas

    Abstract: We present Multi-HMR, a strong single-shot model for multi-person 3D human mesh recovery from a single RGB image. Predictions encompass the whole body, i.e, including hands and facial expressions, using the SMPL-X parametric model and spatial location in the camera coordinate system. Our model detects people by predicting coarse 2D heatmaps of person centers, using features produced by a standard… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: https://github.com/naver/multi-hmr

  2. arXiv:2311.09104  [pdf, other

    cs.CV

    Cross-view and Cross-pose Completion for 3D Human Understanding

    Authors: Matthieu Armando, Salma Galaaoui, Fabien Baradel, Thomas Lucas, Vincent Leroy, Romain Brégier, Philippe Weinzaepfel, Grégory Rogez

    Abstract: Human perception and understanding is a major domain of computer vision which, like many other vision subdomains recently, stands to gain from the use of large models pre-trained on large datasets. We hypothesize that the most common pre-training strategy of relying on general purpose, object-centric image datasets such as ImageNet, is limited by an important domain shift. On the other hand, colle… ▽ More

    Submitted 18 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: CVPR 2024

  3. arXiv:2310.01897  [pdf, other

    cs.CV

    MFOS: Model-Free & One-Shot Object Pose Estimation

    Authors: JongMin Lee, Yohann Cabon, Romain Brégier, Sungjoo Yoo, Jerome Revaud

    Abstract: Existing learning-based methods for object pose estimation in RGB images are mostly model-specific or category based. They lack the capability to generalize to new object categories at test time, hence severely hindering their practicability and scalability. Notably, recent attempts have been made to solve this issue, but they still require accurate 3D data of the object surface at both train and… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  4. arXiv:2309.10748  [pdf, other

    cs.CV cs.AI cs.GR cs.LG cs.RO

    SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction

    Authors: Anilkumar Swamy, Vincent Leroy, Philippe Weinzaepfel, Fabien Baradel, Salma Galaaoui, Romain Bregier, Matthieu Armando, Jean-Sebastien Franco, Gregory Rogez

    Abstract: Recent hand-object interaction datasets show limited real object variability and rely on fitting the MANO parametric model to obtain groundtruth hand shapes. To go beyond these limitations and spur further research, we introduce the SHOWMe dataset which consists of 96 videos, annotated with real and detailed hand-object 3D textured meshes. Following recent work, we consider a rigid hand-object sce… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: Paper and Appendix, Accepted in ACVR workshop at ICCV conference

  5. arXiv:2307.11702  [pdf, other

    cs.CV

    SACReg: Scene-Agnostic Coordinate Regression for Visual Localization

    Authors: Jerome Revaud, Yohann Cabon, Romain Brégier, JongMin Lee, Philippe Weinzaepfel

    Abstract: Scene coordinates regression (SCR), i.e., predicting 3D coordinates for every pixel of a given image, has recently shown promising potential. However, existing methods remain limited to small scenes memorized during training, and thus hardly scale to realistic datasets and scenarios. In this paper, we propose a generalized SCR model trained once to be deployed in new test scenes, regardless of the… ▽ More

    Submitted 30 November, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

  6. arXiv:2211.10408  [pdf, other

    cs.CV

    CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow

    Authors: Philippe Weinzaepfel, Thomas Lucas, Vincent Leroy, Yohann Cabon, Vaibhav Arora, Romain Brégier, Gabriela Csurka, Leonid Antsfeld, Boris Chidlovskii, Jérôme Revaud

    Abstract: Despite impressive performance for high-level downstream tasks, self-supervised pre-training methods have not yet fully delivered on dense geometric vision tasks such as stereo matching or optical flow. The application of self-supervised concepts, such as instance discrimination or masked image modeling, to geometric tasks is an active area of research. In this work, we build on the recent cross-v… ▽ More

    Submitted 18 August, 2023; v1 submitted 18 November, 2022; originally announced November 2022.

    Comments: ICCV 2023

  7. arXiv:2211.07304  [pdf, other

    cs.RO

    Multi-Finger Grasping Like Humans

    Authors: Yuming Du, Philippe Weinzaepfel, Vincent Lepetit, Romain Brégier

    Abstract: Robots with multi-fingered grippers could perform advanced manipulation tasks for us if we were able to properly specify to them what to do. In this study, we take a step in that direction by making a robot grasp an object like a grasping demonstration performed by a human. We propose a novel optimization-based approach for transferring human grasp demonstrations to any multi-fingered grippers, wh… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: presented at IROS 2022 conference

    Journal ref: 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  8. arXiv:2210.10716  [pdf, other

    cs.CV

    CroCo: Self-Supervised Pre-training for 3D Vision Tasks by Cross-View Completion

    Authors: Philippe Weinzaepfel, Vincent Leroy, Thomas Lucas, Romain Brégier, Yohann Cabon, Vaibhav Arora, Leonid Antsfeld, Boris Chidlovskii, Gabriela Csurka, Jérôme Revaud

    Abstract: Masked Image Modeling (MIM) has recently been established as a potent pre-training paradigm. A pretext task is constructed by masking patches in an input image, and this masked content is then predicted by a neural network using visible patches as sole input. This pre-training leads to state-of-the-art performance when finetuned for high-level semantic tasks, e.g. image classification and object d… ▽ More

    Submitted 12 January, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  9. arXiv:2208.10211  [pdf, other

    cs.CV

    PoseBERT: A Generic Transformer Module for Temporal 3D Human Modeling

    Authors: Fabien Baradel, Romain Brégier, Thibault Groueix, Philippe Weinzaepfel, Yannis Kalantidis, Grégory Rogez

    Abstract: Training state-of-the-art models for human pose estimation in videos requires datasets with annotations that are really hard and expensive to obtain. Although transformers have been recently utilized for body pose sequence modeling, related methods rely on pseudo-ground truth to augment the currently limited training data available for learning such models. In this paper, we introduce PoseBERT, a… ▽ More

    Submitted 19 October, 2022; v1 submitted 22 August, 2022; originally announced August 2022.

    Comments: Accepted to TPAMI 2022

  10. arXiv:2110.09243  [pdf, other

    cs.CV

    Leveraging MoCap Data for Human Mesh Recovery

    Authors: Fabien Baradel, Thibault Groueix, Philippe Weinzaepfel, Romain Brégier, Yannis Kalantidis, Grégory Rogez

    Abstract: Training state-of-the-art models for human body pose and shape recovery from images or videos requires datasets with corresponding annotations that are really hard and expensive to obtain. Our goal in this paper is to study whether poses from 3D Motion Capture (MoCap) data can be used to improve image-based and video-based human mesh recovery methods. We find that fine-tune image-based models with… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: 3DV 2021

  11. arXiv:2103.16317  [pdf, other

    cs.CV

    Deep Regression on Manifolds: A 3D Rotation Case Study

    Authors: Romain Brégier

    Abstract: Many machine learning problems involve regressing variables on a non-Euclidean manifold -- e.g. a discrete probability distribution, or the 6D pose of an object. One way to tackle these problems through gradient-based learning is to use a differentiable function that maps arbitrary inputs of a Euclidean space onto the manifold. In this paper, we establish a set of desirable properties for such map… ▽ More

    Submitted 12 October, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

    Comments: Oral presentation at 3DV 2021

    MSC Class: 68T07

  12. arXiv:2012.02743  [pdf, other

    cs.CV

    SMPLy Benchmarking 3D Human Pose Estimation in the Wild

    Authors: Vincent Leroy, Philippe Weinzaepfel, Romain Brégier, Hadrien Combaluzier, Grégory Rogez

    Abstract: Predicting 3D human pose from images has seen great recent improvements. Novel approaches that can even predict both pose and shape from a single input image have been introduced, often relying on a parametric model of the human body such as SMPL. While qualitative results for such methods are often shown for images captured in-the-wild, a proper benchmark in such conditions is still missing, as i… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

    Comments: 3DV 2020 Oral presentation

  13. arXiv:2008.09457  [pdf, other

    cs.CV

    DOPE: Distillation Of Part Experts for whole-body 3D pose estimation in the wild

    Authors: Philippe Weinzaepfel, Romain Brégier, Hadrien Combaluzier, Vincent Leroy, Grégory Rogez

    Abstract: We introduce DOPE, the first method to detect and estimate whole-body 3D human poses, including bodies, hands and faces, in the wild. Achieving this level of details is key for a number of applications that require understanding the interactions of the people with each other or with the environment. The main challenge is the lack of in-the-wild data with labeled whole-body 3D poses. In previous wo… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Comments: ECCV 2020

  14. arXiv:2003.13764  [pdf, other

    cs.CV

    Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and Objects for 3D Hand Pose Estimation under Hand-Object Interaction

    Authors: Anil Armagan, Guillermo Garcia-Hernando, Seungryul Baek, Shreyas Hampali, Mahdi Rad, Zhaohui Zhang, Shipeng Xie, MingXiu Chen, Boshen Zhang, Fu Xiong, Yang Xiao, Zhiguo Cao, Junsong Yuan, Pengfei Ren, Weiting Huang, Haifeng Sun, Marek Hrúz, Jakub Kanis, Zdeněk Krňoul, Qingfu Wan, Shile Li, Linlin Yang, Dongheui Lee, Angela Yao, Weiguo Zhou , et al. (10 additional authors not shown)

    Abstract: We study how well different types of approaches generalise in the task of 3D hand pose estimation under single hand scenarios and hand-object interaction. We show that the accuracy of state-of-the-art methods can drop, and that they fail mostly on poses absent from the training set. Unfortunately, since the space of hand poses is highly dimensional, it is inherently not feasible to cover the whole… ▽ More

    Submitted 10 September, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: European Conference on Computer Vision (ECCV), 2020

  15. Symmetry Aware Evaluation of 3D Object Detection and Pose Estimation in Scenes of Many Parts in Bulk

    Authors: Romain Brégier, Frédéric Devernay, Laetitia Leyrit, James Crowley

    Abstract: While 3D object detection and pose estimation has been studied for a long time, its evaluation is not yet completely satisfactory. Indeed, existing datasets typically consist in numerous acquisitions of only a few scenes because of the tediousness of pose annotation, and existing evaluation protocols cannot handle properly objects with symmetries. This work aims at addressing those two points. We… ▽ More

    Submitted 21 June, 2018; originally announced June 2018.

    Journal ref: 2017 IEEE International Conference on Computer Vision Workshop (ICCVW), Oct 2017, Venice, France. IEEE

  16. arXiv:1801.01281  [pdf, other

    cs.CV cs.LG cs.RO

    Object segmentation in depth maps with one user click and a synthetically trained fully convolutional network

    Authors: Matthieu Grard, Romain Brégier, Florian Sella, Emmanuel Dellandréa, Liming Chen

    Abstract: With more and more household objects built on planned obsolescence and consumed by a fast-growing population, hazardous waste recycling has become a critical challenge. Given the large variability of household waste, current recycling platforms mostly rely on human operators to analyze the scene, typically composed of many object instances piled up in bulk. Helping them by robotizing the unitary e… ▽ More

    Submitted 24 September, 2018; v1 submitted 4 January, 2018; originally announced January 2018.

    Comments: This is a pre-print of an article published in Human Friendly Robotics, 10th International Workshop, Springer Proceedings in Advanced Robotics, vol 7. The final authenticated version is available online at: https://doi.org/10.1007/978-3-319-89327-3\_16, Springer Proceedings in Advanced Robotics, Siciliano Bruno, Khatib Oussama, In press, Human Friendly Robotics, 10th International Workshop, 7

  17. arXiv:1612.04631  [pdf, other

    cs.CV math.MG physics.class-ph

    Defining the Pose of any 3D Rigid Object and an Associated Distance

    Authors: Romain Brégier, Frédéric Devernay, Laetitia Leyrit, James Crowley

    Abstract: The pose of a rigid object is usually regarded as a rigid transformation, described by a translation and a rotation. However, equating the pose space with the space of rigid transformations is in general abusive, as it does not account for objects with proper symmetries -- which are common among man-made objects.In this article, we define pose as a distinguishable static state of an object, and eq… ▽ More

    Submitted 29 November, 2017; v1 submitted 14 December, 2016; originally announced December 2016.

    Journal ref: International Journal of Computer Vision, Springer Verlag, 2017