Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–31 of 31 results for author: Tzionas, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.16770  [pdf, other

    cs.CV cs.RO

    3D Whole-body Grasp Synthesis with Directional Controllability

    Authors: Georgios Paschalidis, Romana Wilschut, Dimitrije Antić, Omid Taheri, Dimitrios Tzionas

    Abstract: Synthesizing 3D whole-bodies that realistically grasp objects is useful for animation, mixed reality, and robotics. This is challenging, because the hands and body need to look natural w.r.t. each other, the grasped object, as well as the local scene (i.e., a receptacle supporting the object). Only recent work tackles this, with a divide-and-conquer approach; it first generates a "guiding" right-h… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  2. arXiv:2405.14869  [pdf, other

    cs.CV cs.AI cs.GR

    PuzzleAvatar: Assembling 3D Avatars from Personal Albums

    Authors: Yuliang Xiu, Yufei Ye, Zhen Liu, Dimitrios Tzionas, Michael J. Black

    Abstract: Generating personalized 3D avatars is crucial for AR/VR. However, recent text-to-3D methods that generate avatars for celebrities or fictional characters, struggle with everyday people. Methods for faithful reconstruction typically require full-body images in controlled settings. What if a user could just upload their personal "OOTD" (Outfit Of The Day) photo collection and get a faithful avatar i… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: video: https://youtu.be/0hpXH2tVPk4

  3. arXiv:2309.15273  [pdf, other

    cs.CV

    DECO: Dense Estimation of 3D Human-Scene Contact In The Wild

    Authors: Shashank Tripathi, Agniv Chatterjee, Jean-Claude Passy, Hongwei Yi, Dimitrios Tzionas, Michael J. Black

    Abstract: Understanding how humans use physical contact to interact with the world is key to enabling human-centric artificial intelligence. While inferring 3D contact is crucial for modeling realistic and physically-plausible human-object interactions, existing methods either focus on 2D, consider body joints rather than the surface, use coarse 3D body regions, or do not generalize to in-the-wild images. I… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: Accepted as Oral in ICCV'23. Project page: https://deco.is.tue.mpg.de

  4. arXiv:2308.12965  [pdf, other

    cs.CV

    POCO: 3D Pose and Shape Estimation with Confidence

    Authors: Sai Kumar Dwivedi, Cordelia Schmid, Hongwei Yi, Michael J. Black, Dimitrios Tzionas

    Abstract: The regression of 3D Human Pose and Shape (HPS) from an image is becoming increasingly accurate. This makes the results useful for downstream tasks like human action recognition or 3D graphics. Yet, no regressor is perfect, and accuracy can be affected by ambiguous image evidence or by poses and appearance that are unseen during training. Most current HPS regressors, however, do not report the con… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  5. arXiv:2308.11617  [pdf, other

    cs.CV

    GRIP: Generating Interaction Poses Using Spatial Cues and Latent Consistency

    Authors: Omid Taheri, Yi Zhou, Dimitrios Tzionas, Yang Zhou, Duygu Ceylan, Soren Pirk, Michael J. Black

    Abstract: Hands are dexterous and highly versatile manipulators that are central to how humans interact with objects and their environment. Consequently, modeling realistic hand-object interactions, including the subtle motion of individual fingers, is critical for applications in computer graphics, computer vision, and mixed reality. Prior work on capturing and modeling humans interacting with objects in 3… ▽ More

    Submitted 15 July, 2024; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: The project has been started during Omid Taheri's internship at Adobe and as a collaboration with the Max Planck Institute for Intelligent Systems

  6. arXiv:2304.10482  [pdf, other

    cs.CV cs.GR

    Reconstructing Signing Avatars From Video Using Linguistic Priors

    Authors: Maria-Paola Forte, Peter Kulits, Chun-Hao Huang, Vasileios Choutas, Dimitrios Tzionas, Katherine J. Kuchenbecker, Michael J. Black

    Abstract: Sign language (SL) is the primary method of communication for the 70 million Deaf people around the world. Video dictionaries of isolated signs are a core SL learning tool. Replacing these with 3D avatars can aid learning and enable AR/VR applications, improving access to technology and online media. However, little work has attempted to estimate expressive 3D avatars from SL video; occlusion, noi… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

  7. arXiv:2303.18246  [pdf, other

    cs.CV cs.AI cs.GR

    3D Human Pose Estimation via Intuitive Physics

    Authors: Shashank Tripathi, Lea Müller, Chun-Hao P. Huang, Omid Taheri, Michael J. Black, Dimitrios Tzionas

    Abstract: Estimating 3D humans from images often produces implausible bodies that lean, float, or penetrate the floor. Such methods ignore the fact that bodies are typically supported by the scene. A physics engine can be used to enforce physical plausibility, but these are not differentiable, rely on unrealistic proxy bodies, and are difficult to integrate into existing optimization and learning frameworks… ▽ More

    Submitted 24 July, 2023; v1 submitted 31 March, 2023; originally announced March 2023.

    Comments: Accepted in CVPR'23. Project page: https://ipman.is.tue.mpg.de

  8. arXiv:2303.03373  [pdf, other

    cs.CV

    Detecting Human-Object Contact in Images

    Authors: Yixin Chen, Sai Kumar Dwivedi, Michael J. Black, Dimitrios Tzionas

    Abstract: Humans constantly contact objects to move and perform tasks. Thus, detecting human-object contact is important for building human-centered artificial intelligence. However, there exists no robust method to detect contact between the body and the scene from an image, and there exists no dataset to learn such a detector. We fill this gap with HOT ("Human-Object conTact"), a new dataset of human-obje… ▽ More

    Submitted 4 April, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR 2023

  9. arXiv:2212.07422  [pdf, other

    cs.CV cs.AI cs.GR

    ECON: Explicit Clothed humans Optimized via Normal integration

    Authors: Yuliang Xiu, Jinlong Yang, Xu Cao, Dimitrios Tzionas, Michael J. Black

    Abstract: The combination of deep learning, artist-curated scans, and Implicit Functions (IF), is enabling the creation of detailed, clothed, 3D humans from images. However, existing methods are far from perfect. IF-based methods recover free-form geometry, but produce disembodied limbs or degenerate shapes for novel poses or clothes. To increase robustness for these cases, existing work uses an explicit pa… ▽ More

    Submitted 23 March, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

    Comments: Homepage: https://xiuyuliang.cn/econ Code: https://github.com/YuliangXiu/ECON

  10. arXiv:2210.13861  [pdf, other

    cs.CV

    SUPR: A Sparse Unified Part-Based Human Representation

    Authors: Ahmed A. A. Osman, Timo Bolkart, Dimitrios Tzionas, Michael J. Black

    Abstract: Statistical 3D shape models of the head, hands, and fullbody are widely used in computer vision and graphics. Despite their wide use, we show that existing models of the head and hands fail to capture the full range of motion for these parts. Moreover, existing work largely ignores the feet, which are crucial for modeling human movement and have applications in biomechanics, animation, and the foo… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: Accepted in ECCV 2022

  11. arXiv:2209.12354  [pdf, other

    cs.CV

    InterCap: Joint Markerless 3D Tracking of Humans and Objects in Interaction

    Authors: Yinghao Huang, Omid Tehari, Michael J. Black, Dimitrios Tzionas

    Abstract: Humans constantly interact with daily objects to accomplish tasks. To understand such interactions, computers need to reconstruct these from cameras observing whole-body interaction with scenes. This is challenging due to occlusion between the body and objects, motion blur, depth/scale ambiguities, and the low image resolution of hands and graspable object parts. To make the problem tractable, the… ▽ More

    Submitted 1 October, 2022; v1 submitted 25 September, 2022; originally announced September 2022.

    Comments: To appear at GCPR2022

  12. arXiv:2206.07036  [pdf, other

    cs.CV

    Accurate 3D Body Shape Regression using Metric and Semantic Attributes

    Authors: Vasileios Choutas, Lea Muller, Chun-Hao P. Huang, Siyu Tang, Dimitrios Tzionas, Michael J. Black

    Abstract: While methods that regress 3D human meshes from images have progressed rapidly, the estimated body shapes often do not capture the true human shape. This is problematic since, for many applications, accurate body shape is as important as pose. The key reason that body shape accuracy lags pose accuracy is the lack of data. While humans can label 2D joints, and these constrain 3D pose, it is not so… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: First two authors contributed equally

    Journal ref: CVPR 2022

  13. arXiv:2204.13662  [pdf, other

    cs.CV

    ARCTIC: A Dataset for Dexterous Bimanual Hand-Object Manipulation

    Authors: Zicong Fan, Omid Taheri, Dimitrios Tzionas, Muhammed Kocabas, Manuel Kaufmann, Michael J. Black, Otmar Hilliges

    Abstract: Humans intuitively understand that inanimate objects do not move by themselves, but that state changes are typically caused by human manipulation (e.g., the opening of a book). This is not yet the case for machines. In part this is because there exist no datasets with ground-truth 3D annotations for the study of physically consistent and synchronised motion of hands and articulated objects. To thi… ▽ More

    Submitted 23 April, 2023; v1 submitted 28 April, 2022; originally announced April 2022.

    Comments: Project page: https://arctic.is.tue.mpg.de

  14. arXiv:2203.03609  [pdf, other

    cs.CV cs.AI cs.GR

    Human-Aware Object Placement for Visual Environment Reconstruction

    Authors: Hongwei Yi, Chun-Hao P. Huang, Dimitrios Tzionas, Muhammed Kocabas, Mohamed Hassan, Siyu Tang, Justus Thies, Michael J. Black

    Abstract: Humans are in constant contact with the world as they move through it and interact with it. This contact is a vital source of information for understanding 3D humans, 3D scenes, and the interactions between them. In fact, we demonstrate that these human-scene interactions (HSIs) can be leveraged to improve the 3D reconstruction of a scene from a monocular RGB video. Our key idea is that, as a pers… ▽ More

    Submitted 28 March, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: Accepted by CVPR2022

  15. Embodied Hands: Modeling and Capturing Hands and Bodies Together

    Authors: Javier Romero, Dimitrios Tzionas, Michael J. Black

    Abstract: Humans move their hands and bodies together to communicate and solve tasks. Capturing and replicating such coordinated activity is critical for virtual characters that behave realistically. Surprisingly, most methods treat the 3D modeling and tracking of bodies and hands separately. Here we formulate a model of hands and bodies interacting together and fit it to full-body 4D sequences. When scanni… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

    Comments: SIGGRAPH ASIA 2017

    Journal ref: ACM Transactions on Graphics, Vol. 36, No. 6, Article 245. Publication date: November 2017

  16. arXiv:2112.11454  [pdf, other

    cs.CV

    GOAL: Generating 4D Whole-Body Motion for Hand-Object Grasping

    Authors: Omid Taheri, Vasileios Choutas, Michael J. Black, Dimitrios Tzionas

    Abstract: Generating digital humans that move realistically has many applications and is widely studied, but existing methods focus on the major limbs of the body, ignoring the hands and head. Hands have been separately studied, but the focus has been on generating realistic static grasps of objects. To synthesize virtual characters that interact with the world, we need to generate full-body motions and rea… ▽ More

    Submitted 16 March, 2023; v1 submitted 21 December, 2021; originally announced December 2021.

  17. arXiv:2112.09127  [pdf, other

    cs.CV cs.AI cs.GR

    ICON: Implicit Clothed humans Obtained from Normals

    Authors: Yuliang Xiu, Jinlong Yang, Dimitrios Tzionas, Michael J. Black

    Abstract: Current methods for learning realistic and animatable 3D clothed avatars need either posed 3D scans or 2D images with carefully controlled user poses. In contrast, our goal is to learn an avatar from only 2D images of people in unconstrained poses. Given a set of images, our method estimates a detailed 3D surface from each image and then combines these into an animatable avatar. Implicit functions… ▽ More

    Submitted 28 March, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: Project page: https://icon.is.tue.mpg.de/. Accepted by CVPR 2022

  18. arXiv:2105.05301  [pdf, other

    cs.CV

    Collaborative Regression of Expressive Bodies using Moderation

    Authors: Yao Feng, Vasileios Choutas, Timo Bolkart, Dimitrios Tzionas, Michael J. Black

    Abstract: Recovering expressive humans from images is essential for understanding human behavior. Methods that estimate 3D bodies, faces, or hands have progressed significantly, yet separately. Face methods recover accurate 3D shape and geometric details, but need a tight crop and struggle with extreme views and low resolution. Whole-body methods are robust to a wide range of poses and resolutions, but prov… ▽ More

    Submitted 15 October, 2021; v1 submitted 11 May, 2021; originally announced May 2021.

    Comments: 21 pages. The first two authors contributed equally to this work

  19. arXiv:2012.11581  [pdf, other

    cs.CV

    Populating 3D Scenes by Learning Human-Scene Interaction

    Authors: Mohamed Hassan, Partha Ghosh, Joachim Tesch, Dimitrios Tzionas, Michael J. Black

    Abstract: Humans live within a 3D space and constantly interact with it to perform tasks. Such interactions involve physical contact between surfaces that is semantically meaningful. Our goal is to learn how humans interact with scenes and leverage this to enable virtual characters to do the same. To that end, we introduce a novel Human-Scene Interaction (HSI) model that encodes proximal relationships, call… ▽ More

    Submitted 5 April, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

    Journal ref: CVPR2021

  20. GRAB: A Dataset of Whole-Body Human Grasping of Objects

    Authors: Omid Taheri, Nima Ghorbani, Michael J. Black, Dimitrios Tzionas

    Abstract: Training computers to understand, model, and synthesize human grasping requires a rich dataset containing complex 3D object shapes, detailed contact information, hand pose and shape, and the 3D body motion over time. While "grasping" is commonly thought of as a single hand stably lifting an object, we capture the motion of the entire body and adopt the generalized notion of "whole-body grasps". Th… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

    Comments: ECCV 2020

  21. arXiv:2008.09062  [pdf, other

    cs.CV cs.GR

    Monocular Expressive Body Regression through Body-Driven Attention

    Authors: Vasileios Choutas, Georgios Pavlakos, Timo Bolkart, Dimitrios Tzionas, Michael J. Black

    Abstract: To understand how people look, interact, or perform tasks, we need to quickly and accurately capture their 3D body, face, and hands together from an RGB image. Most existing methods focus only on parts of the body. A few recent approaches reconstruct full expressive 3D humans from images using 3D body models that include the face and hands. These methods are optimization-based and thus slow, prone… ▽ More

    Submitted 20 August, 2020; originally announced August 2020.

    Comments: Accepted in ECCV'20. Project page: http://expose.is.tue.mpg.de

  22. Learning Multi-Human Optical Flow

    Authors: Anurag Ranjan, David T. Hoffmann, Dimitrios Tzionas, Siyu Tang, Javier Romero, Michael J. Black

    Abstract: The optical flow of humans is well known to be useful for the analysis of human action. Recent optical flow methods focus on training deep networks to approach the problem. However, the training data used by them does not cover the domain of human motion. Therefore, we develop a dataset of multi-human optical flow and train optical flow networks on this dataset. We use a 3D model of the human body… ▽ More

    Submitted 4 December, 2019; v1 submitted 24 October, 2019; originally announced October 2019.

    Comments: arXiv admin note: text overlap with arXiv:1806.05666

    Report number: 2019

    Journal ref: International Journal of Computer Vision (IJCV) 2019

  23. arXiv:1908.06963  [pdf, other

    cs.CV

    Resolving 3D Human Pose Ambiguities with 3D Scene Constraints

    Authors: Mohamed Hassan, Vasileios Choutas, Dimitrios Tzionas, Michael J. Black

    Abstract: To understand and analyze human behavior, we need to capture humans moving in, and interacting with, the world. Most existing methods perform 3D human pose estimation without explicitly considering the scene. We observe however that the world constrains the body and vice-versa. To motivate this, we show that current 3D human pose estimation methods produce results that are not consistent with the… ▽ More

    Submitted 20 August, 2019; originally announced August 2019.

    Comments: To appear in ICCV 2019

  24. arXiv:1908.00967  [pdf, other

    cs.CV

    Learning to Train with Synthetic Humans

    Authors: David T. Hoffmann, Dimitrios Tzionas, Micheal J. Black, Siyu Tang

    Abstract: Neural networks need big annotated datasets for training. However, manual annotation can be too expensive or even unfeasible for certain tasks, like multi-person 2D pose estimation with severe occlusions. A remedy for this is synthetic data with perfect ground truth. Here we explore two variations of synthetic data for this challenging problem; a dataset with purely synthetic humans and a real dat… ▽ More

    Submitted 2 August, 2019; originally announced August 2019.

    Comments: In German Conference on Pattern Recognition (GCPR)

  25. arXiv:1904.05866  [pdf, other

    cs.CV

    Expressive Body Capture: 3D Hands, Face, and Body from a Single Image

    Authors: Georgios Pavlakos, Vasileios Choutas, Nima Ghorbani, Timo Bolkart, Ahmed A. A. Osman, Dimitrios Tzionas, Michael J. Black

    Abstract: To facilitate the analysis of human actions, interactions and emotions, we compute a 3D model of human body pose, hand pose, and facial expression from a single monocular image. To achieve this, we use thousands of 3D scans to train a new, unified, 3D model of the human body, SMPL-X, that extends SMPL with fully articulated hands and an expressive face. Learning to regress the parameters of SMPL-X… ▽ More

    Submitted 11 April, 2019; originally announced April 2019.

    Comments: To appear in CVPR 2019

  26. arXiv:1904.05767  [pdf, other

    cs.CV

    Learning joint reconstruction of hands and manipulated objects

    Authors: Yana Hasson, Gül Varol, Dimitrios Tzionas, Igor Kalevatykh, Michael J. Black, Ivan Laptev, Cordelia Schmid

    Abstract: Estimating hand-object manipulations is essential for interpreting and imitating human actions. Previous work has made significant progress towards reconstruction of hand poses and object shapes in isolation. Yet, reconstructing hands and objects during manipulation is a more challenging task due to significant occlusions of both the hand and object. While presenting challenges, manipulations may… ▽ More

    Submitted 11 April, 2019; originally announced April 2019.

    Comments: CVPR 2019

  27. 3D Object Reconstruction from Hand-Object Interactions

    Authors: Dimitrios Tzionas, Juergen Gall

    Abstract: Recent advances have enabled 3d object reconstruction approaches using a single off-the-shelf RGB-D camera. Although these approaches are successful for a wide range of object classes, they rely on stable and distinctive geometric or texture features. Many objects like mechanical parts, toys, household or decorative articles, however, are textureless and characterized by minimalistic shapes that a… ▽ More

    Submitted 3 April, 2017; originally announced April 2017.

    Comments: International Conference on Computer Vision (ICCV) 2015, http://files.is.tue.mpg.de/dtzionas/In-Hand-Scanning

  28. Capturing Hand Motion with an RGB-D Sensor, Fusing a Generative Model with Salient Points

    Authors: Dimitrios Tzionas, Abhilash Srikantha, Pablo Aponte, Juergen Gall

    Abstract: Hand motion capture has been an active research topic in recent years, following the success of full-body pose tracking. Despite similarities, hand tracking proves to be more challenging, characterized by a higher dimensionality, severe occlusions and self-similarity between fingers. For this reason, most approaches rely on strong assumptions, like hands in isolation or expensive multi-camera syst… ▽ More

    Submitted 3 April, 2017; originally announced April 2017.

    Comments: German Conference on Pattern Recognition (GCPR) 2014, http://files.is.tue.mpg.de/dtzionas/GCPR_2014.html

  29. A Comparison of Directional Distances for Hand Pose Estimation

    Authors: Dimitrios Tzionas, Juergen Gall

    Abstract: Benchmarking methods for 3d hand tracking is still an open problem due to the difficulty of acquiring ground truth data. We introduce a new dataset and benchmarking protocol that is insensitive to the accumulative error of other protocols. To this end, we create testing frame pairs of increasing difficulty and measure the pose estimation error separately for each of them. This approach gives new i… ▽ More

    Submitted 3 April, 2017; originally announced April 2017.

    Comments: German Conference on Pattern Recognition (GCPR) 2013, http://files.is.tue.mpg.de/dtzionas/GCPR_2013.html

  30. arXiv:1609.01371  [pdf, other

    cs.CV

    Reconstructing Articulated Rigged Models from RGB-D Videos

    Authors: Dimitrios Tzionas, Juergen Gall

    Abstract: Although commercial and open-source software exist to reconstruct a static object from a sequence recorded with an RGB-D sensor, there is a lack of tools that build rigged models of articulated objects that deform realistically and can be used for tracking or animation. In this work, we fill this gap and propose a method that creates a fully rigged model of an articulated object from depth data of… ▽ More

    Submitted 9 September, 2016; v1 submitted 5 September, 2016; originally announced September 2016.

    Comments: Accepted for publication - European Conference on Computer Vision Workshops 2016 (ECCVW'16) - Workshop on Recovering 6D Object Pose (R6D'16)

  31. Capturing Hands in Action using Discriminative Salient Points and Physics Simulation

    Authors: Dimitrios Tzionas, Luca Ballan, Abhilash Srikantha, Pablo Aponte, Marc Pollefeys, Juergen Gall

    Abstract: Hand motion capture is a popular research field, recently gaining more attention due to the ubiquity of RGB-D sensors. However, even most recent approaches focus on the case of a single isolated hand. In this work, we focus on hands that interact with other hands or objects and present a framework that successfully captures motion in such interaction scenarios for both rigid and articulated object… ▽ More

    Submitted 7 March, 2016; v1 submitted 6 June, 2015; originally announced June 2015.

    Comments: Accepted for publication by the International Journal of Computer Vision (IJCV) on 16.02.2016 (submitted on 17.10.14). A combination into a single framework of an ECCV'12 multicamera-RGB and a monocular-RGBD GCPR'14 hand tracking paper with several extensions, additional experiments and details