Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–37 of 37 results for author: Aberman, K

.
  1. arXiv:2407.11966  [pdf, other

    cs.CV cs.AI cs.LG

    Efficient Training with Denoised Neural Weights

    Authors: Yifan Gong, Zheng Zhan, Yanyu Li, Yerlan Idelbayev, Andrey Zharkov, Kfir Aberman, Sergey Tulyakov, Yanzhi Wang, Jian Ren

    Abstract: Good weight initialization serves as an effective measure to reduce the training cost of a deep neural network (DNN) model. The choice of how to initialize parameters is challenging and may require manual tuning, which can be time-consuming and prone to human error. To overcome such limitations, this work takes a novel step towards building a weight generator to synthesize the neural weights for i… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: ECCV 2024. Project Page: https://yifanfanfanfan.github.io/denoised-weights/

  2. arXiv:2406.09413  [pdf, other

    cs.CV cs.GR cs.LG

    Interpreting the Weight Space of Customized Diffusion Models

    Authors: Amil Dravid, Yossi Gandelsman, Kuan-Chieh Wang, Rameen Abdal, Gordon Wetzstein, Alexei A. Efros, Kfir Aberman

    Abstract: We investigate the space of weights spanned by a large collection of customized diffusion models. We populate this space by creating a dataset of over 60,000 models, each of which is a base model fine-tuned to insert a different person's visual identity. We model the underlying manifold of these weights as a subspace, which we term weights2weights. We demonstrate three immediate applications of th… ▽ More

    Submitted 17 July, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: Project Page: https://snap-research.github.io/weights2weights

  3. arXiv:2404.11565  [pdf, other

    cs.CV cs.AI cs.GR

    MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation

    Authors: Kuan-Chieh Wang, Daniil Ostashev, Yuwei Fang, Sergey Tulyakov, Kfir Aberman

    Abstract: We introduce a new architecture for personalization of text-to-image diffusion models, coined Mixture-of-Attention (MoA). Inspired by the Mixture-of-Experts mechanism utilized in large language models (LLMs), MoA distributes the generation workload between two attention pathways: a personalized branch and a non-personalized prior branch. MoA is designed to retain the original model's prior by fixi… ▽ More

    Submitted 6 May, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: Project Website: https://snap-research.github.io/mixture-of-attention, Same as previous version, only updated metadata because bib was missing an author name

  4. arXiv:2403.16990  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation

    Authors: Omer Dahary, Or Patashnik, Kfir Aberman, Daniel Cohen-Or

    Abstract: Text-to-image diffusion models have an unprecedented ability to generate diverse and high-quality images. However, they often struggle to faithfully capture the intended semantics of complex input prompts that include multiple subjects. Recently, numerous layout-to-image extensions have been introduced to improve user control, aiming to localize subjects represented by specific tokens. Yet, these… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Project page: https://omer11a.github.io/bounded-attention/

  5. arXiv:2403.14599  [pdf, other

    cs.CV

    MyVLM: Personalizing VLMs for User-Specific Queries

    Authors: Yuval Alaluf, Elad Richardson, Sergey Tulyakov, Kfir Aberman, Daniel Cohen-Or

    Abstract: Recent large-scale vision-language models (VLMs) have demonstrated remarkable capabilities in understanding and generating textual descriptions for visual content. However, these models lack an understanding of user-specific concepts. In this work, we take a first step toward the personalization of VLMs, enabling them to learn and reason over user-provided concepts. For example, we explore whether… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Project page: https://snap-research.github.io/MyVLM/

  6. arXiv:2402.00867  [pdf, other

    cs.CV

    AToM: Amortized Text-to-Mesh using 2D Diffusion

    Authors: Guocheng Qian, Junli Cao, Aliaksandr Siarohin, Yash Kant, Chaoyang Wang, Michael Vasilkovsky, Hsin-Ying Lee, Yuwei Fang, Ivan Skorokhodov, Peiye Zhuang, Igor Gilitschenski, Jian Ren, Bernard Ghanem, Kfir Aberman, Sergey Tulyakov

    Abstract: We introduce Amortized Text-to-Mesh (AToM), a feed-forward text-to-mesh framework optimized across multiple text prompts simultaneously. In contrast to existing text-to-3D methods that often entail time-consuming per-prompt optimization and commonly output representations other than polygonal meshes, AToM directly generates high-quality textured meshes in less than 1 second with around 10 times re… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 19 pages with appendix and references. Webpage: https://snap-research.github.io/AToM/

  7. arXiv:2401.06127  [pdf, other

    cs.CV cs.AI cs.LG

    E$^{2}$GAN: Efficient Training of Efficient GANs for Image-to-Image Translation

    Authors: Yifan Gong, Zheng Zhan, Qing Jin, Yanyu Li, Yerlan Idelbayev, Xian Liu, Andrey Zharkov, Kfir Aberman, Sergey Tulyakov, Yanzhi Wang, Jian Ren

    Abstract: One highly promising direction for enabling flexible real-time on-device image editing is utilizing data distillation by leveraging large-scale text-to-image diffusion models to generate paired datasets used for training generative adversarial networks (GANs). This approach notably alleviates the stringent requirements typically imposed by high-end commercial GPUs for performing image editing with… ▽ More

    Submitted 2 June, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: ICML 2024. Project Page: https://yifanfanfanfan.github.io/e2gan/

  8. arXiv:2312.17234  [pdf, other

    cs.CV

    Personalized Restoration via Dual-Pivot Tuning

    Authors: Pradyumna Chari, Sizhuo Ma, Daniil Ostashev, Achuta Kadambi, Gurunandan Krishnan, Jian Wang, Kfir Aberman

    Abstract: Generative diffusion models can serve as a prior which ensures that solutions of image restoration systems adhere to the manifold of natural images. However, for restoring facial images, a personalized prior is necessary to accurately represent and reconstruct unique facial features of a given individual. In this paper, we propose a simple, yet effective, method for personalized restoration, calle… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  9. arXiv:2312.02432  [pdf, other

    cs.CV

    Orthogonal Adaptation for Modular Customization of Diffusion Models

    Authors: Ryan Po, Guandao Yang, Kfir Aberman, Gordon Wetzstein

    Abstract: Customization techniques for text-to-image models have paved the way for a wide range of previously unattainable applications, enabling the generation of specific concepts across diverse contexts and styles. While existing methods facilitate high-fidelity customization for individual concepts or a limited, pre-defined set of them, they fall short of achieving scalability, where a single model can… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Project page: https://ryanpo.com/ortha/

  10. arXiv:2311.09571  [pdf, other

    cs.GR cs.CV

    3D Paintbrush: Local Stylization of 3D Shapes with Cascaded Score Distillation

    Authors: Dale Decatur, Itai Lang, Kfir Aberman, Rana Hanocka

    Abstract: In this work we develop 3D Paintbrush, a technique for automatically texturing local semantic regions on meshes via text descriptions. Our method is designed to operate directly on meshes, producing texture maps which seamlessly integrate into standard graphics pipelines. We opt to simultaneously produce a localization map (to specify the edit region) and a texture map which conforms to it. This s… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: Project page: https://threedle.github.io/3d-paintbrush

  11. arXiv:2310.07204  [pdf, other

    cs.AI cs.CV cs.GR cs.LG

    State of the Art on Diffusion Models for Visual Computing

    Authors: Ryan Po, Wang Yifan, Vladislav Golyanik, Kfir Aberman, Jonathan T. Barron, Amit H. Bermano, Eric Ryan Chan, Tali Dekel, Aleksander Holynski, Angjoo Kanazawa, C. Karen Liu, Lingjie Liu, Ben Mildenhall, Matthias Nießner, Björn Ommer, Christian Theobalt, Peter Wonka, Gordon Wetzstein

    Abstract: The field of visual computing is rapidly advancing due to the emergence of generative artificial intelligence (AI), which unlocks unprecedented capabilities for the generation, editing, and reconstruction of images, videos, and 3D scenes. In these domains, diffusion models are the generative AI architecture of choice. Within the last year alone, the literature on diffusion-based tools and applicat… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  12. arXiv:2309.16668  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    RealFill: Reference-Driven Generation for Authentic Image Completion

    Authors: Luming Tang, Nataniel Ruiz, Qinghao Chu, Yuanzhen Li, Aleksander Holynski, David E. Jacobs, Bharath Hariharan, Yael Pritch, Neal Wadhwa, Kfir Aberman, Michael Rubinstein

    Abstract: Recent advances in generative imagery have brought forth outpainting and inpainting models that can produce high-quality, plausible image content in unknown regions. However, the content these models hallucinate is necessarily inauthentic, since they are unaware of the true scene. In this work, we propose RealFill, a novel generative approach for image completion that fills in missing regions of a… ▽ More

    Submitted 14 May, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: SIGGRAPH 2024 (Journal Track). Project page: https://realfill.github.io

  13. arXiv:2307.15042  [pdf, other

    cs.CV cs.GR

    TEDi: Temporally-Entangled Diffusion for Long-Term Motion Synthesis

    Authors: Zihan Zhang, Richard Liu, Kfir Aberman, Rana Hanocka

    Abstract: The gradual nature of a diffusion process that synthesizes samples in small increments constitutes a key ingredient of Denoising Diffusion Probabilistic Models (DDPM), which have presented unprecedented quality in image synthesis and been recently explored in the motion domain. In this work, we propose to adapt the gradual diffusion concept (operating along a diffusion time-axis) into the temporal… ▽ More

    Submitted 29 July, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

    Comments: Project page: https://threedle.github.io/TEDi/

  14. arXiv:2307.06949  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models

    Authors: Nataniel Ruiz, Yuanzhen Li, Varun Jampani, Wei Wei, Tingbo Hou, Yael Pritch, Neal Wadhwa, Michael Rubinstein, Kfir Aberman

    Abstract: Personalization has emerged as a prominent aspect within the field of generative AI, enabling the synthesis of individuals in diverse contexts and styles, while retaining high-fidelity to their identities. However, the process of personalization presents inherent challenges in terms of time and memory requirements. Fine-tuning each personalized model needs considerable GPU time investment, and sto… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: project page: https://hyperdreambooth.github.io

  15. arXiv:2305.16311  [pdf, other

    cs.CV cs.GR cs.LG

    Break-A-Scene: Extracting Multiple Concepts from a Single Image

    Authors: Omri Avrahami, Kfir Aberman, Ohad Fried, Daniel Cohen-Or, Dani Lischinski

    Abstract: Text-to-image model personalization aims to introduce a user-provided concept to the model, allowing its synthesis in diverse contexts. However, current methods primarily focus on the case of learning a single concept from multiple images with variations in backgrounds and poses, and struggle when adapted to a different scenario. In this work, we introduce the task of textual scene decomposition:… ▽ More

    Submitted 4 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: SIGGRAPH Asia 2023. Project page: at: https://omriavrahami.com/break-a-scene/ Video: https://www.youtube.com/watch?v=-9EA-BhizgM

  16. arXiv:2304.07090  [pdf, other

    cs.CV cs.GR cs.LG

    Delta Denoising Score

    Authors: Amir Hertz, Kfir Aberman, Daniel Cohen-Or

    Abstract: We introduce Delta Denoising Score (DDS), a novel scoring function for text-based image editing that guides minimal modifications of an input image towards the content described in a target prompt. DDS leverages the rich generative prior of text-to-image diffusion models and can be used as a loss term in an optimization problem to steer an image towards a desired direction dictated by a text. DDS… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: Project page: https://delta-denoising-score.github.io/

  17. arXiv:2303.13508  [pdf, other

    cs.CV cs.AI cs.GR

    DreamBooth3D: Subject-Driven Text-to-3D Generation

    Authors: Amit Raj, Srinivas Kaza, Ben Poole, Michael Niemeyer, Nataniel Ruiz, Ben Mildenhall, Shiran Zada, Kfir Aberman, Michael Rubinstein, Jonathan Barron, Yuanzhen Li, Varun Jampani

    Abstract: We present DreamBooth3D, an approach to personalize text-to-3D generative models from as few as 3-6 casually captured images of a subject. Our approach combines recent advances in personalizing text-to-image models (DreamBooth) with text-to-3D generation (DreamFusion). We find that naively combining these methods fails to yield satisfactory subject-specific 3D assets due to personalized text-to-im… ▽ More

    Submitted 27 March, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Project page at https://dreambooth3d.github.io/ Video Summary at https://youtu.be/kKVDrbfvOoA

  18. arXiv:2303.09522  [pdf, other

    cs.CV cs.CL cs.GR cs.LG

    P+: Extended Textual Conditioning in Text-to-Image Generation

    Authors: Andrey Voynov, Qinghao Chu, Daniel Cohen-Or, Kfir Aberman

    Abstract: We introduce an Extended Textual Conditioning space in text-to-image models, referred to as $P+$. This space consists of multiple textual conditions, derived from per-layer prompts, each corresponding to a layer of the denoising U-net of the diffusion model. We show that the extended space provides greater disentangling and control over image synthesis. We further introduce Extended Textual Inve… ▽ More

    Submitted 15 July, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

  19. arXiv:2211.13752  [pdf, other

    cs.CV cs.GR cs.LG

    Sketch-Guided Text-to-Image Diffusion Models

    Authors: Andrey Voynov, Kfir Aberman, Daniel Cohen-Or

    Abstract: Text-to-Image models have introduced a remarkable leap in the evolution of machine learning, demonstrating high-quality synthesis of images from a given text-prompt. However, these powerful pretrained models still lack control handles that can guide spatial properties of the synthesized images. In this work, we introduce a universal approach to guide a pretrained text-to-image diffusion model, wit… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

  20. arXiv:2211.09794  [pdf, other

    cs.CV

    Null-text Inversion for Editing Real Images using Guided Diffusion Models

    Authors: Ron Mokady, Amir Hertz, Kfir Aberman, Yael Pritch, Daniel Cohen-Or

    Abstract: Recent text-guided diffusion models provide powerful image generation capabilities. Currently, a massive effort is given to enable the modification of these images using text only as means to offer intuitive and versatile editing. To edit a real image using these state-of-the-art tools, one must first invert the image with a meaningful text prompt into the pretrained model's domain. In this paper,… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  21. arXiv:2208.12242  [pdf, other

    cs.CV cs.GR cs.LG

    DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

    Authors: Nataniel Ruiz, Yuanzhen Li, Varun Jampani, Yael Pritch, Michael Rubinstein, Kfir Aberman

    Abstract: Large text-to-image models achieved a remarkable leap in the evolution of AI, enabling high-quality and diverse synthesis of images from a given text prompt. However, these models lack the ability to mimic the appearance of subjects in a given reference set and synthesize novel renditions of them in different contexts. In this work, we present a new approach for "personalization" of text-to-image… ▽ More

    Submitted 15 March, 2023; v1 submitted 25 August, 2022; originally announced August 2022.

    Comments: Published at CVPR 2023. Project page: https://dreambooth.github.io/

  22. arXiv:2208.01626  [pdf, other

    cs.CV cs.CL cs.GR cs.LG

    Prompt-to-Prompt Image Editing with Cross Attention Control

    Authors: Amir Hertz, Ron Mokady, Jay Tenenbaum, Kfir Aberman, Yael Pritch, Daniel Cohen-Or

    Abstract: Recent large-scale text-driven synthesis models have attracted much attention thanks to their remarkable capabilities of generating highly diverse images that follow given text prompts. Such text-based synthesis methods are particularly appealing to humans who are used to verbally describe their intent. Therefore, it is only natural to extend the text-driven image synthesis to text-driven image ed… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

  23. arXiv:2206.08010  [pdf, other

    cs.GR cs.AI cs.CV cs.LG

    MoDi: Unconditional Motion Synthesis from Diverse Data

    Authors: Sigal Raab, Inbal Leibovitch, Peizhuo Li, Kfir Aberman, Olga Sorkine-Hornung, Daniel Cohen-Or

    Abstract: The emergence of neural networks has revolutionized the field of motion synthesis. Yet, learning to unconditionally synthesize motions from a given distribution remains challenging, especially when the motions are highly diverse. In this work, we present MoDi -- a generative model trained in an unsupervised setting from an extremely diverse, unstructured and unlabeled dataset. During inference, Mo… ▽ More

    Submitted 18 December, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: Video: https://youtu.be/O1sVzwrsNUg, Project page: https://sigal-raab.github.io/MoDi, Code: https://github.com/sigal-raab/MoDi

  24. arXiv:2205.02625  [pdf, other

    cs.GR cs.AI cs.CV cs.LG

    GANimator: Neural Motion Synthesis from a Single Sequence

    Authors: Peizhuo Li, Kfir Aberman, Zihan Zhang, Rana Hanocka, Olga Sorkine-Hornung

    Abstract: We present GANimator, a generative model that learns to synthesize novel motions from a single, short motion sequence. GANimator generates motions that resemble the core elements of the original motion, while simultaneously synthesizing novel and diverse movements. Existing data-driven techniques for motion synthesis require a large motion dataset which contains the desired and specific skeletal s… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

    Comments: SIGGRAPH 2022. Project page: https://peizhuoli.github.io/ganimator/ , Video: https://www.youtube.com/watch?v=OV9VoHMEeyI

  25. arXiv:2203.17272  [pdf, other

    cs.CV cs.GR cs.LG

    MyStyle: A Personalized Generative Prior

    Authors: Yotam Nitzan, Kfir Aberman, Qiurui He, Orly Liba, Michal Yarom, Yossi Gandelsman, Inbar Mosseri, Yael Pritch, Daniel Cohen-or

    Abstract: We introduce MyStyle, a personalized deep generative prior trained with a few shots of an individual. MyStyle allows to reconstruct, enhance and edit images of a specific person, such that the output is faithful to the person's key facial characteristics. Given a small reference set of portrait images of a person (~100), we tune the weights of a pretrained StyleGAN face generator to form a local,… ▽ More

    Submitted 6 October, 2022; v1 submitted 31 March, 2022; originally announced March 2022.

    Comments: SIGGRAPH ASIA 2022, Project webpage: https://mystyle-personalized-prior.github.io/, Video: https://youtu.be/QvOdQR3tlOc

  26. Rhythm is a Dancer: Music-Driven Motion Synthesis with Global Structure

    Authors: Andreas Aristidou, Anastasios Yiannakidis, Kfir Aberman, Daniel Cohen-Or, Ariel Shamir, Yiorgos Chrysanthou

    Abstract: Synthesizing human motion with a global structure, such as a choreography, is a challenging task. Existing methods tend to concentrate on local smooth pose transitions and neglect the global context or the theme of the motion. In this work, we present a music-driven motion synthesis framework that generates long-term sequences of human motions which are synchronized with the input beats, and joint… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    ACM Class: I.3.7; I.2

    Journal ref: IEEE Transactions on Visualization and Computer Graphics, Volume 29, Issue 8, August 2023

  27. arXiv:2109.01980  [pdf, other

    cs.CV cs.GR cs.LG

    Deep Saliency Prior for Reducing Visual Distraction

    Authors: Kfir Aberman, Junfeng He, Yossi Gandelsman, Inbar Mosseri, David E. Jacobs, Kai Kohlhoff, Yael Pritch, Michael Rubinstein

    Abstract: Using only a model that was trained to predict where people look at images, and no additional training data, we can produce a range of powerful editing effects for reducing distraction in images. Given an image and a mask specifying the region to edit, we backpropagate through a state-of-the-art saliency model to parameterize a differentiable editing operator, such that the saliency within the mas… ▽ More

    Submitted 4 September, 2021; originally announced September 2021.

    Comments: https://deep-saliency-prior.github.io/

  28. arXiv:2105.02451  [pdf, other

    cs.GR cs.CV cs.LG

    Learning Skeletal Articulations with Neural Blend Shapes

    Authors: Peizhuo Li, Kfir Aberman, Rana Hanocka, Libin Liu, Olga Sorkine-Hornung, Baoquan Chen

    Abstract: Animating a newly designed character using motion capture (mocap) data is a long standing problem in computer animation. A key consideration is the skeletal structure that should correspond to the available mocap data, and the shape deformation in the joint regions, which often requires a tailored, pose-specific refinement. In this work, we develop a neural technique for articulating 3D characters… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: SIGGRAPH 2021. Project page: https://peizhuoli.github.io/neural-blend-shapes/ , Video: https://youtu.be/antc20EFh6k

  29. arXiv:2012.09401  [pdf, other

    cs.CV

    Zoom-to-Inpaint: Image Inpainting with High-Frequency Details

    Authors: Soo Ye Kim, Kfir Aberman, Nori Kanazawa, Rahul Garg, Neal Wadhwa, Huiwen Chang, Nikhil Karnad, Munchurl Kim, Orly Liba

    Abstract: Although deep learning has enabled a huge leap forward in image inpainting, current methods are often unable to synthesize realistic high-frequency details. In this paper, we propose applying super-resolution to coarsely reconstructed outputs, refining them at high resolution, and then downscaling the output to the original resolution. By introducing high-resolution images to the refinement networ… ▽ More

    Submitted 29 June, 2022; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: Accepted to CVPRW 2022

  30. arXiv:2009.13856  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Neural Alignment for Face De-pixelization

    Authors: Maayan Shuvi, Noa Fish, Kfir Aberman, Ariel Shamir, Daniel Cohen-Or

    Abstract: We present a simple method to reconstruct a high-resolution video from a face-video, where the identity of a person is obscured by pixelization. This concealment method is popular because the viewer can still perceive a human face figure and the overall head motion. However, we show in our experiments that a fairly good approximation of the original video can be reconstructed in a way that comprom… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

    ACM Class: I.3.3; I.3.8; I.4.3; I.4.7; I.4.5; I.4.9; I.4.10; I.2.6; I.2.10

  31. arXiv:2006.12075  [pdf, other

    cs.CV cs.GR cs.LG

    MotioNet: 3D Human Motion Reconstruction from Monocular Video with Skeleton Consistency

    Authors: Mingyi Shi, Kfir Aberman, Andreas Aristidou, Taku Komura, Dani Lischinski, Daniel Cohen-Or, Baoquan Chen

    Abstract: We introduce MotioNet, a deep neural network that directly reconstructs the motion of a 3D human skeleton from monocular video.While previous methods rely on either rigging or inverse kinematics (IK) to associate a consistent skeleton with temporally coherent joint rotations, our method is the first data-driven approach that directly outputs a kinematic skeleton, which is a complete, commonly used… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

    Comments: Accepted to Transactions on Graphics (ToG) 2020. Project page: {https://rubbly.cn/publications/motioNet} Video: {https://youtu.be/8YubchlzvFA}

    ACM Class: I.4.5

    Journal ref: ACM Transaction on Graphics, 40(1), Article 1, 2020

  32. arXiv:2005.05751  [pdf, other

    cs.GR cs.CV cs.LG

    Unpaired Motion Style Transfer from Video to Animation

    Authors: Kfir Aberman, Yijia Weng, Dani Lischinski, Daniel Cohen-Or, Baoquan Chen

    Abstract: Transferring the motion style from one animation clip to another, while preserving the motion content of the latter, has been a long-standing problem in character animation. Most existing data-driven approaches are supervised and rely on paired data, where motions with the same content are performed in different styles. In addition, these approaches are limited to transfer of styles that were seen… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

    Comments: SIGGRAPH 2020. Project page: https://deepmotionediting.github.io/style_transfer , Video: https://www.youtube.com/watch?v=m04zuBSdGrc , Code: https://github.com/DeepMotionEditing/deep-motion-editing

  33. arXiv:2005.05732  [pdf, other

    cs.CV cs.GR cs.LG

    Skeleton-Aware Networks for Deep Motion Retargeting

    Authors: Kfir Aberman, Peizhuo Li, Dani Lischinski, Olga Sorkine-Hornung, Daniel Cohen-Or, Baoquan Chen

    Abstract: We introduce a novel deep learning framework for data-driven motion retargeting between skeletons, which may have different structure, yet corresponding to homeomorphic graphs. Importantly, our approach learns how to retarget without requiring any explicit pairing between the motions in the training set. We leverage the fact that different homeomorphic skeletons may be reduced to a common primal s… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

    Comments: SIGGRAPH 2020. Project page: https://deepmotionediting.github.io/retargeting , Video: https://www.youtube.com/watch?v=ym8Tnmiz5N8

  34. Learning Character-Agnostic Motion for Motion Retargeting in 2D

    Authors: Kfir Aberman, Rundi Wu, Dani Lischinski, Baoquan Chen, Daniel Cohen-Or

    Abstract: Analyzing human motion is a challenging task with a wide variety of applications in computer vision and in graphics. One such application, of particular importance in computer animation, is the retargeting of motion from one performer to another. While humans move in three dimensions, the vast majority of human motions are captured using video, requiring 2D-to-3D pose and camera recovery, before e… ▽ More

    Submitted 5 May, 2019; originally announced May 2019.

    Comments: SIGGRAPH 2019. arXiv admin note: text overlap with arXiv:1804.05653 by other authors

  35. arXiv:1808.06847  [pdf, other

    cs.CV cs.GR

    Deep Video-Based Performance Cloning

    Authors: Kfir Aberman, Mingyi Shi, Jing Liao, Dani Lischinski, Baoquan Chen, Daniel Cohen-Or

    Abstract: We present a new video-based performance cloning technique. After training a deep generative network using a reference video capturing the appearance and dynamics of a target actor, we are able to generate videos where this actor reenacts other performances. All of the training data and the driving performances are provided as ordinary video segments, without motion capture or depth information. O… ▽ More

    Submitted 21 August, 2018; originally announced August 2018.

  36. Neural Best-Buddies: Sparse Cross-Domain Correspondence

    Authors: Kfir Aberman, Jing Liao, Mingyi Shi, Dani Lischinski, Baoquan Chen, Daniel Cohen-Or

    Abstract: Correspondence between images is a fundamental problem in computer vision, with a variety of graphics applications. This paper presents a novel method for sparse cross-domain correspondence. Our method is designed for pairs of images where the main objects of interest may belong to different semantic categories and differ drastically in shape and appearance, yet still contain semantically related… ▽ More

    Submitted 21 August, 2018; v1 submitted 10 May, 2018; originally announced May 2018.

    Comments: SIGGRAPH 2018

  37. Sub-Nyquist SAR via Fourier Domain Range Doppler Processing

    Authors: Kfir Aberman, Yonina C. Eldar

    Abstract: Conventional Synthetic Aperture Radar (SAR) systems are limited in their ability to satisfy the increasing requirement for improved spatial resolution and wider coverage. The demand for high resolution requires high sampling rates, while coverage is limited by the pulse repetition frequency. Consequently, sampling rate reduction is of high practical value in SAR imaging. In this paper, we introduc… ▽ More

    Submitted 9 May, 2017; v1 submitted 14 August, 2016; originally announced August 2016.

    Comments: Submitted to IEEE transactions on geoscience and remote sensing