Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3474085.3475531acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

SOGAN: 3D-Aware Shadow and Occlusion Robust GAN for Makeup Transfer

Published: 17 October 2021 Publication History

Abstract

In recent years, virtual makeup applications have become more and more popular. However, it is still challenging to propose a robust makeup transfer method in the real-world environment. Current makeup transfer methods mostly work well on good-conditioned clean makeup images, but transferring makeup that exhibits shadow and occlusion is not satisfying. To alleviate it, we propose a novel makeup transfer method, called 3D-Aware Shadow and Occlusion Robust GAN (SOGAN). Given the source and the reference faces, we first fit a 3D face model and then disentangle the faces into shape and texture. In the texture branch, we map the texture to the UV space and design a UV texture generator to transfer the makeup. Since human faces are symmetrical in the UV space, we can conveniently remove the undesired shadow and occlusion from the reference image by carefully designing a Flip Attention Module (FAM). After obtaining cleaner makeup features from the reference image, a Makeup Transfer Module (MTM) is introduced to perform accurate makeup transfer. The qualitative and quantitative experiments demonstrate that our SOGAN not only achieves superior results in shadow and occlusion situations but also performs well in large pose and expression variations.

References

[1]
Volker Blanz, Curzio Basso, Tomaso Poggio, and Thomas Vetter. 2003. Reanimating faces in images and video. In Computer graphics forum.
[2]
Volker Blanz and Thomas Vetter. 1999. A morphable model for the synthesis of 3D faces. In International Conference on Computer Graphics and Interactive Techniques.
[3]
Volker Blanz and Thomas Vetter. 2003. Face recognition based on fitting a 3D morphable model. IEEE Transactions on Pattern Analysis and Machine Intelligence (2003).
[4]
Huiwen Chang, Jingwan Lu, Fisher Yu, and Adam Finkelstein. 2018. Pairedcyclegan: Asymmetric style transfer for applying and removing makeup. In IEEE Conference on Computer Vision and Pattern Recognition.
[5]
Hung-Jen Chen, Ka-Ming Hui, Szu-Yu Wang, Li-Wu Tsao, Hong-Han Shuai, and Wen-Huang Cheng. 2019. Beautyglow: On-demand makeup transfer framework with reversible generative network. In IEEE Conference on Computer Vision and Pattern Recognition.
[6]
Qiyao Deng, Jie Cao, Yunfan Liu, Zhenhua Chai, Qi Li, and Zhenan Sun. 2020 a. Reference guided face component editing. International Joint Conferences on Artificial Intelligence (2020).
[7]
Yu Deng, Jiaolong Yang, Dong Chen, Fang Wen, and Xin Tong. 2020 b. Disentangled and controllable face image generation via 3D imitative-contrastive learning. In IEEE Conference on Computer Vision and Pattern Recognition.
[8]
Yao Feng, Fan Wu, Xiaohu Shao, Yanfeng Wang, and Xi Zhou. 2018. Joint 3D face reconstruction and dense alignment with position map regression network. In European Conference on Computer Vision.
[9]
Chaoyou Fu, Xiang Wu, Yibo Hu, Huaibo Huang, and Ran He. 2019. Dual variational generation for low shot heterogeneous face recognition. In Advances in Neural Information Processing Systems.
[10]
Chaoyou Fu, Xiang Wu, Yibo Hu, Huaibo Huang, and Ran He. 2021. Dvg-face: Dual variational generation for heterogeneous face recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence (2021).
[11]
Baris Gecer, Binod Bhattarai, Josef Kittler, and Tae-Kyun Kim. 2018. Semi-supervised adversarial learning to generate photorealistic face images of new identities from 3D morphable model. In European Conference on Computer Vision.
[12]
Zhenglin Geng, Chen Cao, and Sergey Tulyakov. 2019. 3D guided fine-grained face manipulation. In IEEE Conference on Computer Vision and Pattern Recognition.
[13]
Qiao Gu, Guanzhi Wang, Mang Tik Chiu, Yu-Wing Tai, and Chi-Keung Tang. 2019. Ladn: Local adversarial disentangling network for facial makeup and de-makeup. In International Conference on Computer Vision.
[14]
Jianzhu Guo, Xiangyu Zhu, Yang Yang, Fan Yang, Zhen Lei, and Stan Z Li. 2020. Towards fast, accurate and stable 3D dense face alignment. In European Conference on Computer Vision.
[15]
Xiaobin Hu, Wenqi Ren, John LaMaster, Xiaochun Cao, Xiaoming Li, Zechao Li, Bjoern Menze, and Wei Liu. 2020. Face super-resolution guided by 3D facial priors. In European Conference on Computer Vision.
[16]
Wentao Jiang, Si Liu, Chen Gao, Jie Cao, Ran He, Jiashi Feng, and Shuicheng Yan. 2020. Psgan: Pose and expression robust spatial-aware gan for customizable makeup transfer. In IEEE Conference on Computer Vision and Pattern Recognition.
[17]
Hyeongwoo Kim, Pablo Garrido, Ayush Tewari, Weipeng Xu, Justus Thies, Matthias Niessner, Patrick Pérez, Christian Richardt, Michael Zollhöfer, and Christian Theobalt. 2018. Deep video portraits. ACM Transactions on Graphics (2018).
[18]
Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. International Conference on Learning Representations.
[19]
Peipei Li, Yibo Hu, Ran He, and Zhenan Sun. 2019. Global and local consistent wavelet-domain age synthesis. IEEE Transactions on Information Forensics and Security (2019).
[20]
Peipei Li, Huaibo Huang, Yibo Hu, Xiang Wu, Ran He, and Zhenan Sun. 2020 a. Hierarchical face aging through disentangled latent characteristics. In European Conference on Computer Vision.
[21]
Peipei Li, Yinglu Liu, Hailin Shi, Xiang Wu, Yibo Hu, Ran He, and Zhenan Sun. 2020 b. Dual-Structure Disentangling Variational Generation for Data-Limited Face Parsing. In ACM International Conference on Multimedia.
[22]
Tingting Li, Ruihe Qian, Chao Dong, Si Liu, Qiong Yan, Wenwu Zhu, and Liang Lin. 2018. Beautygan: Instance-level facial makeup transfer with deep generative adversarial network. In ACM International Conference on Multimedia.
[23]
Koki Nagano, Jaewoo Seo, Jun Xing, Lingyu Wei, Zimo Li, Shunsuke Saito, Aviral Agarwal, Jens Fursund, and Hao Li. 2018. paGAN: real-time avatars using dynamic textures. ACM Transactions on Graphics (2018).
[24]
Pascal Paysan, Reinhard Knothe, Brian Amberg, Sami Romdhani, and Thomas Vetter. 2009. A 3D face model for pose and illumination invariant face recognition. In IEEE International Conference on Advanced Video and Signal based Surveillance.
[25]
Wenqi Ren, Jiaolong Yang, Senyou Deng, David Wipf, Xiaochun Cao, and Xin Tong. 2019. Face video deblurring using 3D facial priors. In International Conference on Computer Vision.
[26]
Ayush Tewari, Mohamed Elgharib, Gaurav Bharaj, Florian Bernard, Hans-Peter Seidel, Patrick Pérez, Michael Zollhofer, and Christian Theobalt. 2020. Stylerig: Rigging stylegan for 3D control over portrait images. In IEEE Conference on Computer Vision and Pattern Recognition.
[27]
Justus Thies, Michael Zollhöfer, and Matthias Nießner. 2019. Deferred neural rendering: Image synthesis using neural textures. ACM Transactions on Graphics (2019).
[28]
Justus Thies, Michael Zollhofer, Marc Stamminger, Christian Theobalt, and Matthias Nießner. 2016. Face2face: Real-time face capture and reenactment of rgb videos. In IEEE Conference on Computer Vision and Pattern Recognition.
[29]
Xiaolong Wang, Ross Girshick, Abhinav Gupta, and Kaiming He. 2018. Non-local neural networks. In IEEE Conference on Computer Vision and Pattern Recognition.
[30]
Shangzhe Wu, Christian Rupprecht, and Andrea Vedaldi. 2020. Unsupervised Learning of Probably Symmetric Deformable 3D Objects from Images in the Wild. In IEEE Conference on Computer Vision and Pattern Recognition.
[31]
Sicheng Xu, Jiaolong Yang, Dong Chen, Fang Wen, Yu Deng, Yunde Jia, and Xin Tong. 2020. Deep 3D portrait from a single image. In IEEE Conference on Computer Vision and Pattern Recognition.
[32]
Xi Yin, Xiang Yu, Kihyuk Sohn, Xiaoming Liu, and Manmohan Chandraker. 2017. Towards large-pose face frontalization in the wild. In International Conference on Computer Vision.
[33]
Han Zhang, Ian Goodfellow, Dimitris Metaxas, and Augustus Odena. 2019. Self-attention generative adversarial networks. In International Conference on Machine Learning.
[34]
Jun-Yan Zhu, Taesung Park, Phillip Isola, and Alexei A Efros. 2017. Unpaired image-to-image translation using cycle-consistent adversarial networks. In International Conference on Computer Vision.
[35]
Xiangyu Zhu, Zhen Lei, Xiaoming Liu, Hailin Shi, and Stan Z Li. 2016. Face alignment across large poses: A 3D solution. In IEEE Conference on Computer Vision and Pattern Recognition.
[36]
Xiangyu Zhu, Fan Yang, Di Huang, Chang Yu, Hao Wang, Jianzhu Guo, Zhen Lei, and Stan Z Li. 2020. Beyond 3DMM space: Towards fine-grained 3D face reconstruction. In European Conference on Computer Vision.

Cited By

View all
  • (2025)MuNeRF: Robust Makeup Transfer in Neural Radiance FieldsIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2024.336844331:3(1746-1757)Online publication date: Mar-2025
  • (2025)SSAT++: A Semantic-Aware and Versatile Makeup Transfer Network With Local Color Consistency ConstraintIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.333206536:1(1287-1301)Online publication date: Jan-2025
  • (2024)Language-Driven Interactive Shadow DetectionProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681192(5527-5536)Online publication date: 28-Oct-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '21: Proceedings of the 29th ACM International Conference on Multimedia
October 2021
5796 pages
ISBN:9781450386517
DOI:10.1145/3474085
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 October 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. facial makeup transfer
  2. generative models

Qualifiers

  • Research-article

Funding Sources

Conference

MM '21
Sponsor:
MM '21: ACM Multimedia Conference
October 20 - 24, 2021
Virtual Event, China

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)29
  • Downloads (Last 6 weeks)3
Reflects downloads up to 08 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2025)MuNeRF: Robust Makeup Transfer in Neural Radiance FieldsIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2024.336844331:3(1746-1757)Online publication date: Mar-2025
  • (2025)SSAT++: A Semantic-Aware and Versatile Makeup Transfer Network With Local Color Consistency ConstraintIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.333206536:1(1287-1301)Online publication date: Jan-2025
  • (2024)Language-Driven Interactive Shadow DetectionProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681192(5527-5536)Online publication date: 28-Oct-2024
  • (2024)High Fidelity Makeup via 2D and 3D Identity Preservation NetACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365647520:8(1-24)Online publication date: 13-Jun-2024
  • (2024)LipAT: Beyond Style Transfer for Controllable Neural Simulation of Lipstick using Cosmetic Attributes2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV57701.2024.00786(8031-8040)Online publication date: 3-Jan-2024
  • (2024)DRAN: Detailed Region-Adaptive Normalization for Conditional Image SynthesisIEEE Transactions on Multimedia10.1109/TMM.2023.329048126(1969-1982)Online publication date: 1-Jan-2024
  • (2024)FaceCLIP: Facial Image-to-Video Translation via a Brief Text DescriptionIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.333092034:6(4270-4284)Online publication date: Jun-2024
  • (2024)Hybrid Transformers With Attention-Guided Spatial Embeddings for Makeup Transfer and RemovalIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.331279034:4(2876-2890)Online publication date: Apr-2024
  • (2024)Shadow-Aware Makeup Transfer with Lighting Adaptation2024 IEEE International Conference on Image Processing (ICIP)10.1109/ICIP51287.2024.10647290(2271-2277)Online publication date: 27-Oct-2024
  • (2024)A Fine Rendering High-Resolution Makeup Transfer network via inversion-editing strategyEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.109346138(109346)Online publication date: Dec-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media