Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Open access

Generative Adversarial Networks in Computer Vision: A Survey and Taxonomy

Published: 09 February 2021 Publication History
  • Get Citation Alerts
  • Abstract

    Generative adversarial networks (GANs) have been extensively studied in the past few years. Arguably their most significant impact has been in the area of computer vision where great advances have been made in challenges such as plausible image generation, image-to-image translation, facial attribute manipulation, and similar domains. Despite the significant successes achieved to date, applying GANs to real-world problems still poses significant challenges, three of which we focus on here. These are as follows: (1) the generation of high quality images, (2) diversity of image generation, and (3) stabilizing training. Focusing on the degree to which popular GAN technologies have made progress against these challenges, we provide a detailed review of the state-of-the-art in GAN-related research in the published scientific literature. We further structure this review through a convenient taxonomy we have adopted based on variations in GAN architectures and loss functions. While several reviews for GANs have been presented to date, none have considered the status of this field based on their progress toward addressing practical challenges relevant to computer vision. Accordingly, we review and critically discuss the most popular architecture-variant, and loss-variant GANs, for tackling these challenges. Our objective is to provide an overview as well as a critical analysis of the status of GAN research in terms of relevant progress toward critical computer vision application requirements. As we do this we also discuss the most compelling applications in computer vision in which GANs have demonstrated considerable success along with some suggestions for future research directions. Codes related to the GAN-variants studied in this work is summarized on https://github.com/sheqi/GAN_Review.

    Supplementary Material

    a37-wang-suppl.pdf (wang.zip)
    Supplemental movie, appendix, image and software files for, Generative Adversarial Networks in Computer Vision: A Survey and Taxonomy

    References

    [1]
    Darius Afchar, Vincent Nozick, Junichi Yamagishi, and Isao Echizen. 2018. MesoNet: A compact facial video forgery detection network. In Proceedings of the 2018 IEEE International Workshop on Information Forensics and Security (WIFS’18). IEEE, 1--7.
    [2]
    Jeongyoun Ahn and J. S. Marron. 2010. The maximal data piling direction for discrimination. Biometrika 97, 1 (2010), 254--259.
    [3]
    Martin Arjovsky and Léon Bottou. 2017. Towards Principled Methods for Training Generative Adversarial Networks. arXiv:1701.04862. Retrieved from https://arxiv.org/abs/1701.04862.
    [4]
    Martin Arjovsky, Soumith Chintala, and Léon Bottou. 2017. Wasserstein GAN. arXiv:1701.07875. Retrieved from https://arxiv.org/abs/1701.07875.
    [5]
    Larry Armijo. 1966. Minimization of functions having Lipschitz continuous first partial derivatives. Pac. J. Math. 16, 1 (1966), 1--3.
    [6]
    Mathieu Aubry, Daniel Maturana, Alexei A. Efros, Bryan C. Russell, and Josef Sivic. 2014. Seeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 3762--3769.
    [7]
    Jimmy Lei Ba, Jamie Ryan Kiros, and Geoffrey E. Hinton. 2016. Layer Normalization. arXiv:1607.06450. Retrieved from https://arxiv.org/abs/1607.06450.
    [8]
    Dzmitry Bahdanau, Philemon Brakel, Kelvin Xu, Anirudh Goyal, Ryan Lowe, Joelle Pineau, Aaron Courville, and Yoshua Bengio. 2016. An Actor-critic Algorithm for Sequence Prediction. arXiv:1607.07086. Retrieved from https://arxiv.org/abs/1607.07086.
    [9]
    Connelly Barnes, Eli Shechtman, Adam Finkelstein, and Dan B. Goldman. 2009. PatchMatch: A randomized correspondence algorithm for structural image editing. ACM Trans. Graph. 28, 3 (2009), 24.
    [10]
    Shane Barratt and Rishi Sharma. 2018. A Note on the Inception Score. arXiv:1801.01973. Retrieved from https://arxiv.org/abs/1801.01973.
    [11]
    David Berthelot, Thomas Schumm, and Luke Metz. 2017. BEGAN: Boundary Equilibrium Generative Adversarial Networks. arXiv:1703.10717. Retrieved from https://arxiv.org/abs/1703.10717.
    [12]
    Ali Borji. 2019. Pros and cons of GAN evaluation measures. Comput. Vis. Image Understand. 179 (2019), 41--65.
    [13]
    Andrew Brock, Jeff Donahue, and Karen Simonyan. 2018. Large Scale GAN Training for High Fidelity Natural Image Synthesis. arXiv:1809.11096. Retrieved from https://arxiv.org/abs/1809.11096.
    [14]
    Andrew Brock, Theodore Lim, James M. Ritchie, and Nick Weston. 2016. Neural Photo Editing with Introspective Adversarial Networks. arXiv:1609.07093. Retrieved from https://arxiv.org/abs/1609.07093.
    [15]
    Eoin Brophy, Zhengwei Wang, and Tomas E. Ward. 2019. Quick and Easy Time Series Generation with Established Image-based GANs. arXiv:1902.05624. Retrieved from https://arxiv.org/abs/1902.05624.
    [16]
    Peter Burt and Edward Adelson. 1983. The laplacian pyramid as a compact image code. IEEE Trans. Commun. 31, 4 (1983), 532--540.
    [17]
    Iain Carmichael and J. S. Marron. 2017. Geometric Insights into Support Vector Machine Behavior Using the KKT Conditions. arXiv:1704.00767. Retrieved from https://arxiv.org/abs/1704.00767.
    [18]
    Tong Che, Yanran Li, Athul Paul Jacob, Yoshua Bengio, and Wenjie Li. 2016. Mode Regularized Generative Adversarial Networks. arXiv:1612.02136. Retrieved from https://arxiv.org/abs/1612.02136.
    [19]
    Ting Chen, Xiaohua Zhai, Marvin Ritter, Mario Lucic, and Neil Houlsby. 2019. Self-supervised GANs via auxiliary rotation loss. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’19).
    [20]
    Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, and Pieter Abbeel. 2016. InfoGAN: Interpretable representation learning by information maximizing generative adversarial nets. In Advances in Neural Information Processing Systems. 2172--2180.
    [21]
    Zeyuan Chen, Shaoliang Nie, Tianfu Wu, and Christopher G. Healey. 2018. High Resolution Face Completion with Multiple Controllable Attributes via Fully End-to-end Progressive Generative Adversarial Networks. arXiv:1801.07632. Retrieved from https://arxiv.org/abs/1801.07632.
    [22]
    Junsuk Choe, Song Park, Kyungmin Kim, Joo Hyun Park, Dongseob Kim, and Hyunjung Shim. 2017. Face generation for low-shot learning using generative adversarial networks. In Proceedings of the IEEE International Conference on Computer Vision. 1940--1948.
    [23]
    Yunjey Choi, Minje Choi, Munyoung Kim, Jung-Woo Ha, Sunghun Kim, and Jaegul Choo. 2018. StarGAN: Unified generative adversarial networks for multi-domain image-to-image translation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 8789--8797.
    [24]
    A. Clark, J. Donahue, and K. Simonyan. 2019. Adversarial Video Generation on Complex Datasets. arXiv:1907.06571. Retrieved from https://arxiv.org/abs/1907.0657.
    [25]
    Adam Coates, Andrew Ng, and Honglak Lee. 2011. An analysis of single-layer networks in unsupervised feature learning. In Proceedings of the 14th International Conference on Artificial Intelligence and Statistics. 215--223.
    [26]
    Antonia Creswell, Tom White, Vincent Dumoulin, Kai Arulkumaran, Biswa Sengupta, and Anil A. Bharath. 2018. Generative adversarial networks: An overview. IEEE Sign. Process. Mag. 35, 1 (2018), 53--65.
    [27]
    Quanyu Dai, Qiang Li, Jian Tang, and Dan Wang. 2018. Adversarial network embedding. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence.
    [28]
    Zihang Dai, Zhilin Yang, Fan Yang, William W. Cohen, and Ruslan R. Salakhutdinov. 2017. Good semi-supervised learning that requires a bad GAN. In Advances in Neural Information Processing Systems. 6510--6520.
    [29]
    Giannis Daras, Augustus Odena, Han Zhang, and Alexandros G. Dimakis. 2020. Your local GAN: Designing two dimensional local attention mechanisms for generative models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20).
    [30]
    Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. ImageNet: A large-scale hierarchical image database. In Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 248--255.
    [31]
    Emily L. Denton, Soumith Chintala, Rob Fergus, et al. 2015. Deep generative image models using a laplacian pyramid of adversarial networks. In Advances in Neural Information Processing Systems. 1486--1494.
    [32]
    Carl Doersch. 2016. Tutorial on Variational Autoencoders. arXiv:1606.05908. Retrieved from https://arxiv.org/abs/1606.05908.
    [33]
    Brian Dolhansky and Cristian Canton Ferrer. 2018. Eye in-painting with exemplar generative adversarial networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 7902--7911.
    [34]
    Chris Donahue, Julian McAuley, and Miller Puckette. 2018. Synthesizing Audio with Generative Adversarial Networks. arXiv:1802.04208. Retrieved from https://arxiv.orb/abs/1802.04208.
    [35]
    Jeff Donahue, Philipp Krähenbühl, and Trevor Darrell. 2016. Adversarial Feature Learning. arXiv:1605.09782. Retrieved from https://arxiv.org/abs/1605.0978.
    [36]
    Tzanko Donchev and Elza Farkhi. 1998. Stability and euler approximation of one-sided lipschitz differential inclusions. SIAM J. Contr. Optim. 36, 2 (1998), 780--796.
    [37]
    Hao Dong, Simiao Yu, Chao Wu, and Yike Guo. 2017. Semantic image synthesis via adversarial learning. In Proceedings of the IEEE International Conference on Computer Vision. 5706--5714.
    [38]
    Gintare Karolina Dziugaite, Daniel M. Roy, and Zoubin Ghahramani. 2015. Training Generative Neural Networks via Maximum Mean Discrepancy Optimization. arXiv:1505.03906. Retrieved from https://arxiv.org/abs/1505.03906.
    [39]
    Cristóbal Esteban, Stephanie L. Hyland, and Gunnar Rätsch. 2017. Real-valued (Medical) Time Series Generation with Recurrent Conditional GANs. arXiv:1706.02633. Retrieved from https://arxiv.org/abs/1706.02633.
    [40]
    William Fedus, Ian Goodfellow, and Andrew M. Dai. 2018. MaskGAN: Better Text Generation via Filling in the _. arXiv:1801.07736. Retrieved from https://arxiv.org/abs/1801.07736.
    [41]
    Xavier Glorot, Antoine Bordes, and Yoshua Bengio. 2011. Deep sparse rectifier neural networks. In Proceedings of the 14th International Conference on Artificial Intelligence and Statistics. 315--323.
    [42]
    A. A. Goldstein. 1977. Optimization of Lipschitz continuous functions. Math. Program. 13, 1 (1977), 14--22.
    [43]
    Xinyu Gong, Shiyu Chang, Yifan Jiang, and Zhangyang Wang. 2019. AutoGAN: Neural architecture search for generative adversarial networks. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’19).
    [44]
    Ian Goodfellow. 2016. NIPS 2016 Tutorial: Generative Adversarial Networks. arXiv:1701.00160. Retrieved from https://arxiv.orb/abs/1701.00160.
    [45]
    Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. In Advances in Neural Information Processing Systems. 2672--2680.
    [46]
    Arthur Gretton, Karsten M. Borgwardt, Malte J. Rasch, Bernhard Schölkopf, and Alexander Smola. 2012. A kernel two-sample test. J. Mach. Learn. Res. 13, 1 (Mar. 2012), 723--773.
    [47]
    Ishaan Gulrajani, Faruk Ahmed, Martin Arjovsky, Vincent Dumoulin, and Aaron C. Courville. 2017. Improved training of wasserstein GANs. In Advances in Neural Information Processing Systems. 5767--5777.
    [48]
    Kay Gregor Hartmann, Robin Tibor Schirrmeister, and Tonio Ball. 2018. EEG-GAN: Generative Adversarial Networks for Electroencephalographic Brain Signals. arXiv:1806.01875. Retrieved from https://arxiv.org/abs/1806.01875.
    [49]
    Martin Heusel, Hubert Ramsauer, Thomas Unterthiner, Bernhard Nessler, and Sepp Hochreiter. 2017. GANs trained by a two time-scale update rule converge to a local nash equilibrium. In Advances in Neural Information Processing Systems. 6626--6637.
    [50]
    Jean-Baptiste Hiriart-Urruty and Claude Lemaréchal. 2012. Fundamentals of Convex Analysis. Springer Science 8 Business Media.
    [51]
    Saifuddin Hitawala. 2018. Comparative Study on Generative Adversarial Networks. arXiv:1801.04271 (2). Retrieved from https://arxiv.org/abs/1801.04271.
    [52]
    Yongjun Hong, Uiwon Hwang, Jaeyoon Yoo, and Sungroh Yoon. 2019. How generative adversarial networks and their variants work: An overview. ACM Comput. Surv. 52, 1 (2019), 10.
    [53]
    Chih-Chung Hsu, Chia-Yen Lee, and Yi-Xiu Zhuang. 2018. Learning to detect fake face images in the wild. In Proceedings of the 2018 International Symposium on Computer, Consumer and Control (IS3C’18). IEEE, 388--391.
    [54]
    Jia-Bin Huang, Sing Bing Kang, Narendra Ahuja, and Johannes Kopf. 2014. Image completion using planar structure guidance. ACM Trans. Graph. 33, 4 (2014), 1--10.
    [55]
    Satoshi Iizuka, Edgar Simo-Serra, and Hiroshi Ishikawa. 2017. Globally and locally consistent image completion. ACM Trans. Graph. 36, 4 (2017), 107.
    [56]
    Daniel Jiwoong Im, He Ma, Graham Taylor, and Kristin Branson. 2018. Quantitatively Evaluating GANs with Divergences Proposed for Training. arXiv:1803.01045. Retrieved from https://arxiv.org/abs/1803.01045.
    [57]
    Sergey Ioffe and Christian Szegedy. 2015. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. arXiv:1502.03167. Retrieved from https://arxiv.org/abs/1502.03167.
    [58]
    Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, and Alexei A. Efros. 2017. Image-to-image translation with conditional adversarial networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1125--1134.
    [59]
    Nikolay Jetchev, Urs Bergmann, and Roland Vollgraf. 2016. Texture Synthesis with Spatial Generative Adversarial Networks. arXiv:1611.08207. Retrieved from https://arxiv.org/abs/1611.08207.
    [60]
    Alexia Jolicoeur-Martineau. 2018. The Relativistic Discriminator: A Key Element Missing from Standard GAN. arXiv:1807.00734. Retrieved from https://arxiv.org/abs/1807.00734.
    [61]
    Emmanuel Kahembwe and Subramanian Ramamoorthy. 2019. Lower Dimensional Kernels for Video Discriminators. arXiv:1912.08860. Retrieved from https://arxiv.org/abs/1912.08860.
    [62]
    Takuhiro Kaneko, Yoshitaka Ushiku, and Tatsuya Harada. 2019. Label-noise robust generative adversarial networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2467--2476.
    [63]
    Animesh Karnewar and Oliver Wang. 2020. MSG-GAN: Multi-scale gradients for generative adversarial networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 7799--7808.
    [64]
    Tero Karras, Timo Aila, Samuli Laine, and Jaakko Lehtinen. 2017. Progressive Growing of GANs for Improved Quality, Stability, and Variation. arXiv:1710.10196. Retrieved from https://arxiv.org/abs/1710.10196.
    [65]
    Tero Karras, Samuli Laine, and Timo Aila. 2018. A Style-based Generator Architecture for Generative Adversarial Networks. arXiv:1812.04948. Retrieved from https://arxiv.org/abs/1812.04948.
    [66]
    Taeksoo Kim, Moonsu Cha, Hyunsoo Kim, Jung Kwon Lee, and Jiwon Kim. 2017. Learning to discover cross-domain relations with generative adversarial networks. In Proceedings of the 34th International Conference on Machine Learning, Volume 70. 1857--1865.
    [67]
    Diederik P. Kingma and Max Welling. 2013. Auto-encoding Variational Bayes. arXiv:1312.6114. Retrieved from https://arxiv.org/abs/1312.6114.
    [68]
    Naveen Kodali, Jacob Abernethy, James Hays, and Zsolt Kira. 2017. On Convergence and Stability of GANs. arXiv:1705.07215. Retrieved from https://arxiv.org/abs/1705.07215.
    [69]
    Jean Kossaifi, Linh Tran, Yannis Panagakis, and Maja Pantic. 2018. GANGAN: Geometry-aware generative adversarial networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 878--887.
    [70]
    Alex Krizhevsky and Geoffrey Hinton. 2009. Learning Multiple Layers of Features from Tiny Images. Technical Report. Citeseer.
    [71]
    Karol Kurach, Mario Lucic, Xiaohua Zhai, Marcin Michalski, and Sylvain Gelly. 2018. The GAN Landscape: Losses, Architectures, Regularization, and Normalization. arXiv:1807.04720. Retrieved from https://arxiv.org/abs/1807.04720.
    [72]
    Christoph Lassner, Gerard Pons-Moll, and Peter V. Gehler. 2017. A generative model of people in clothing. In Proceedings of the IEEE International Conference on Computer Vision. 853--862.
    [73]
    Yann LeCun, Léon Bottou, Yoshua Bengio, Patrick Haffner, et al. 1998. Gradient-based learning applied to document recognition. Proc. IEEE 86, 11 (1998), 2278--2324.
    [74]
    Christian Ledig, Lucas Theis, Ferenc Huszár, Jose Caballero, Andrew Cunningham, Alejandro Acosta, Andrew Aitken, Alykhan Tejani, Johannes Totz, Zehan Wang, et al. 2017. Photo-realistic single image super-resolution using a generative adversarial network. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 105--114.
    [75]
    Dan Li, Dacheng Chen, Lei Shi, Baihong Jin, Jonathan Goh, and See-Kiong Ng. 2019. MAD-GAN: Multivariate Anomaly Detection for Time Series Data with Generative Adversarial Networks. arXiv:1901.04997. Retrieved from https://arxiv.org/abs/1901.04997.
    [76]
    Jiwei Li, Will Monroe, Alan Ritter, Michel Galley, Jianfeng Gao, and Dan Jurafsky. 2016. Deep Reinforcement Learning for Dialogue Generation. arXiv:1606.01541. Retrieved from https://arxiv.org/abs/1606.01541.
    [77]
    Yijun Li, Sifei Liu, Jimei Yang, and Ming-Hsuan Yang. 2017. Generative face completion. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3911--3919.
    [78]
    Yujia Li, Alexander Schwing, Kuan-Chieh Wang, and Richard Zemel. 2017. Dualing GANs. In Advances in Neural Information Processing Systems. 5606--5616.
    [79]
    Yujia Li, Kevin Swersky, and Rich Zemel. 2015. Generative moment matching networks. In Proceedings of the International Conference on Machine Learning. 1718--1727.
    [80]
    Jae Hyun Lim and Jong Chul Ye. 2017. Geometric GAN. arXiv:1705.02894. Retrieved from https://arxiv.org/abs/1705.02894.
    [81]
    Cheng-Lin Liu, Fei Yin, Da-Han Wang, and Qiu-Feng Wang. 2013. Online and offline handwritten chinese character recognition: Benchmarking on new databases. Pattern Recogn. 46, 1 (2013), 155--162.
    [82]
    Ming-Yu Liu, Thomas Breuel, and Jan Kautz. 2017. Unsupervised image-to-image translation networks. In Advances in Neural Information Processing Systems. 700--708.
    [83]
    Ming-Yu Liu and Oncel Tuzel. 2016. Coupled generative adversarial networks. In Advances in Neural Information Processing Systems. 469--477.
    [84]
    Ziwei Liu, Ping Luo, Xiaogang Wang, and Xiaoou Tang. 2018. Deep learning face attributes in the wild. In Proceedings of International Conference on Computer Vision (ICCV'15).
    [85]
    Jonathan Long, Evan Shelhamer, and Trevor Darrell. 2015. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3431--3440.
    [86]
    Pauline Luc, Camille Couprie, Soumith Chintala, and Jakob Verbeek. 2016. Semantic Segmentation Using Adversarial Networks. arXiv:1611.08408. Retrieved from https://arxiv.org/abs/1611.08408.
    [87]
    Yonghong Luo, Xiangrui Cai, Ying Zhang, Jun Xu, et al. 2018. Multivariate time series imputation with generative adversarial networks. In Advances in Neural Information Processing Systems. 1596--1607.
    [88]
    Sebastian Lutz, Konstantinos Amplianitis, and Aljosa Smolic. 2018. AlphaGAN: Generative Adversarial Networks for Natural Image Matting. arXiv:1807.10088. Retrieved from https://arxiv.org/abs/1807.10088.
    [89]
    Liqian Ma, Xu Jia, Qianru Sun, Bernt Schiele, Tinne Tuytelaars, and Luc Van Gool. 2017. Pose guided person image generation. In Advances in Neural Information Processing Systems. 406--416.
    [90]
    Shuang Ma, Jianlong Fu, Chang Wen Chen, and Tao Mei. 2018. DA-GAN: Instance-level image translation by deep attention generative adversarial networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5657--5666.
    [91]
    Dwarikanath Mahapatra, Behzad Bozorgtabar, and Rahil Garnavi. 2019. Image super-resolution using progressive generative adversarial networks for medical image analysis. Comput. Med. Imag. Graph. 71 (2019), 30--39.
    [92]
    Dwarikanath Mahapatra, Behzad Bozorgtabar, Sajini Hewavitharanage, and Rahil Garnavi. 2017. Image super resolution using generative adversarial networks and local saliency maps for retinal image analysis. In Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 382--390.
    [93]
    Devraj Mandal, Sanath Narayan, Sai Kumar Dwivedi, Vikram Gupta, Shuaib Ahmed, Fahad Shahbaz Khan, and Ling Shao. 2019. Out-of-distribution detection for generalized zero-shot action recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 9985--9993.
    [94]
    Xudong Mao, Qing Li, Haoran Xie, Raymond Y. K. Lau, Zhen Wang, and Stephen Paul Smolley. 2017. Least squares generative adversarial networks. In Proceedings of the 2017 IEEE International Conference on Computer Vision. IEEE, 2813--2821.
    [95]
    Xudong Mao, Qing Li, Haoran Xie, Raymond Y. K. Lau, Zhen Wang, and Stephen Paul Smolley. 2018. On the effectiveness of least squares generative adversarial networks. IEEE Trans. Pattern Anal. Mach. Intell. 41, 12 (2018), 2947--2960.
    [96]
    James Stephen Marron, Michael J. Todd, and Jeongyoun Ahn. 2007. Distance-weighted discrimination. J. Am. Stat. Assoc. 102, 480 (2007), 1267--1271.
    [97]
    Michael McCloskey and Neal J. Cohen. 1989. Catastrophic interference in connectionist networks: The sequential learning problem. In Psychology of Learning and Motivation. Vol. 24. Elsevier, 109--165.
    [98]
    Luke Metz, Ben Poole, David Pfau, and Jascha Sohl-Dickstein. 2016. Unrolled Generative Adversarial Networks. arXiv:1611.02163. Retrieved from http://arxiv.org/abs/1611.02163.
    [99]
    Mehdi Mirza and Simon Osindero. 2014. Conditional Generative Adversarial Nets. arXiv:1411.1784. Retrieved from https://arxiv.org/abs/1411.1784.
    [100]
    Takeru Miyato, Toshiki Kataoka, Masanori Koyama, and Yuichi Yoshida. 2018. Spectral Normalization for Generative Adversarial Networks. arXiv:1802.05957. Retrieved from https://arxiv.org/abs/1802.05957.
    [101]
    Alfred Müller. 1997. Integral probability metrics and their generating classes of functions. Adv. Appl. Probab. 29, 2 (1997), 429--443.
    [102]
    Yuval Netzer, Tao Wang, Adam Coates, Alessandro Bissacco, Bo Wu, and Andrew Y. Ng. 2011. Reading digits in natural images with unsupervised feature learning. In NIPS Workshop on Deep Learning and Unsupervised Feature Learning.
    [103]
    Frank Nielsen and Richard Nock. 2013. On the chi square and higher-order chi distances for approximating f-divergences. IEEE Sign. Process. Lett. 21, 1 (2013), 10--13.
    [104]
    Sebastian Nowozin, Botond Cseke, and Ryota Tomioka. 2016. f-GAN: Training generative neural samplers using variational divergence minimization. In Advances in Neural Information Processing Systems. 271--279.
    [105]
    Augustus Odena. 2016. Semi-supervised Learning with Generative Adversarial Networks. arXiv:1606.01583. Retrieved from https://arxiv.org/abs/1606.01583.
    [106]
    Augustus Odena, Christopher Olah, and Jonathon Shlens. 2017. Conditional image synthesis with auxiliary classifier gans. In Proceedings of the 34th International Conference on Machine Learning, Vol. 70. 2642--2651.
    [107]
    Junting Pan, Cristian Canton Ferrer, Kevin McGuinness, Noel E. O’Connor, Jordi Torres, Elisa Sayrol, and Xavier Giro-i Nieto. 2017. SalGAN: Visual Saliency Prediction with Generative Adversarial Networks. arXiv:1701.01081. Retrieved from https://arxiv.org/abs/1701.01081.
    [108]
    Sung Woo Park and Junseok Kwon. 2019. Sphere generative adversarial network based on geometric moment matching. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’19).
    [109]
    Pascal Paysan, Reinhard Knothe, Brian Amberg, Sami Romdhani, and Thomas Vetter. 2009. A 3D face model for pose and illumination invariant face recognition. In Proceedings of the 2009 6th IEEE International Conference on Advanced Video and Signal Based Surveillance. IEEE, 296--301.
    [110]
    Ben Poole, Alexander A. Alemi, Jascha Sohl-Dickstein, and Anelia Angelova. 2016. Improved Generator Objectives for GANs. arXiv:1612.02780. Retrieved from https://arxiv.org/abs/1612.02780.
    [111]
    Guo-Jun Qi. 2017. Loss-sensitive Generative Adversarial Networks on Lipschitz Densities. arXiv:1701.06264. Retrieved from https://arxiv.org/abs/1701.06264.
    [112]
    Zhaofan Qiu, Yingwei Pan, Ting Yao, and Tao Mei. 2017. Deep semantic hashing with generative adversarial networks. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 225--234.
    [113]
    Alec Radford, Luke Metz, and Soumith Chintala. 2015. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks. arXiv:1511.06434. Retrieved from https://arxiv.org/abs/1511.06434.
    [114]
    Scott Reed, Zeynep Akata, Xinchen Yan, Lajanugen Logeswaran, Bernt Schiele, and Honglak Lee. 2016. Generative Adversarial Text to Image Synthesis. arXiv:1605.05396. Retrieved from https://arxiv.org/abs/1605.05396.
    [115]
    Yossi Rubner, Carlo Tomasi, and Leonidas J. Guibas. 2000. The earth mover’s distance as a metric for image retrieval. Int. J. Comput. Vis. 40, 2 (2000), 99--121.
    [116]
    Andrei A. Rusu, Neil C. Rabinowitz, Guillaume Desjardins, Hubert Soyer, James Kirkpatrick, Koray Kavukcuoglu, Razvan Pascanu, and Raia Hadsell. 2016. Progressive Neural Networks. arXiv:1606.04671. Retrieved from https://arxiv.org/abs/1606.04671.
    [117]
    Tim Salimans, Ian Goodfellow, Wojciech Zaremba, Vicki Cheung, Alec Radford, and Xi Chen. 2016. Improved techniques for training GANs. In Advances in Neural Information Processing Systems. 2234--2242.
    [118]
    Tim Salimans and Durk P. Kingma. 2016. Weight normalization: A simple reparameterization to accelerate training of deep neural networks. In Advances in Neural Information Processing Systems. 901--909.
    [119]
    Hanul Shin, Jung Kwon Lee, Jaehong Kim, and Jiwon Kim. 2017. Continual learning with deep generative replay. In Advances in Neural Information Processing Systems. 2990--2999.
    [120]
    Konstantin Shmelkov, Cordelia Schmid, and Karteek Alahari. 2018. How good is my GAN? In Proceedings of the European Conference on Computer Vision (ECCV’18). 213--229.
    [121]
    Jiaming Song and Stefano Ermon. 2019. Bridging the Gap between f-GANs and Wasserstein GANs. arXiv:1910.09779. Retrieved from https://arxiv.org/abs/1910.09779.
    [122]
    Nasim Souly, Concetto Spampinato, and Mubarak Shah. 2017. Semi supervised semantic segmentation using generative adversarial network. In Proceedings of the IEEE International Conference on Computer Vision. 5688--5696.
    [123]
    Bharath K. Sriperumbudur, Kenji Fukumizu, Arthur Gretton, Bernhard Schölkopf, and Gert R. G. Lanckriet. 2009. On Integral Probability Metrics, ϕ-divergences and Binary Classification. arXiv:0901.2698. Retrieved from https://arxiv.org/abs/0901.2698.
    [124]
    Lucas Theis, Aäron van den Oord, and Matthias Bethge. 2015. A Note on the Evaluation of Generative Models. arXiv:1511.01844. Retrieved from https://arxiv.org/abs/1511.01844.
    [125]
    Matteo Tomei, Marcella Cornia, Lorenzo Baraldi, and Rita Cucchiara. 2018. Art2Real: Unfolding the Reality of Artworks via Semantically-aware Image-to-image Translation. arXiv:1811.10666. Retrieved from https://arxiv.org/abs/1811.10666.
    [126]
    Sergey Tulyakov, Ming-Yu Liu, Xiaodong Yang, and Jan Kautz. 2018. MoCoGAN: Decomposing motion and content for video generation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1526--1535.
    [127]
    Alan M. Turing. 2009. Computing machinery and intelligence. In Parsing the Turing Test. Springer, 23--65.
    [128]
    Mehmet Ozgur Turkoglu, Luuk Spreeuwers, William Thong, and Berkay Kicanaoglu. 2019. A layer-based sequential framework for scene generation with GANs. In Proceedings of the 33rd AAAI Conference on Artificial Intelligence.
    [129]
    Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems. 5998--6008.
    [130]
    Carl Vondrick, Hamed Pirsiavash, and Antonio Torralba. 2016. Generating videos with scene dynamics. In Advances in Neural Information Processing Systems. 613--621.
    [131]
    Kunfeng Wang, Chao Gou, Yanjie Duan, Yilun Lin, Xinhu Zheng, and Fei-Yue Wang. 2017. Generative adversarial networks: Introduction and outlook. IEEE/CAA J. Autom. Sin. 4, 4 (2017), 588--598.
    [132]
    Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Andrew Tao, Jan Kautz, and Bryan Catanzaro. 2018. High-resolution image synthesis and semantic manipulation with conditional GANs. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 8798--8807.
    [133]
    Xintao Wang, Ke Yu, Shixiang Wu, Jinjin Gu, Yihao Liu, Chao Dong, Yu Qiao, and Chen Change Loy. 2018. ESRGAN: Enhanced super-resolution generative adversarial networks. In Proceedings of the European Conference on Computer Vision Workshop.
    [134]
    Zhengwei Wang, Graham Healy, Alan F. Smeaton, and Tomas E. Ward. 2020. Use of neural signals to evaluate the quality of generative adversarial network performance in facial image generation. Cogn. Comput. 12, 1 (2020), 13--24.
    [135]
    Zhengwei Wang, Qi She, Alan F. Smeaton, Tomas E. Ward, and Graham Healy. 2020. Synthetic-neuroscore: Using a neuro-ai interface for evaluating generative adversarial networks. Neurocomputing 405 (2020), 26--36.
    [136]
    Huikai Wu, Shuai Zheng, Junge Zhang, and Kaiqi Huang. 2017. GP-GAN: Towards Realistic High-resolution Image Blending. arXiv:1703.07195. Retrieved from https://arxiv.org/abs/1703.07195.
    [137]
    Yuanbo Xiangli, Yubin Deng, Bo Dai, Chen Change Loy, and Dahua Lin. 2020. Real or Not Real, That Is the Question. arXiv:2002.05512. Retrieved from https://arxiv.org/abs/2002.05512.
    [138]
    Qiantong Xu, Gao Huang, Yang Yuan, Chuan Guo, Yu Sun, Felix Wu, and Kilian Weinberger. 2018. An Empirical Study on Evaluation Metrics of Generative Adversarial Networks. arXiv:1806.07755. Retrieved from https://arxiv.org/abs/1806.07755.
    [139]
    Chao Yang, Xin Lu, Zhe Lin, Eli Shechtman, Oliver Wang, and Hao Li. 2017. High-resolution image inpainting using multi-scale neural patch synthesis. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 6721--6729.
    [140]
    Jimei Yang, Brian Price, Scott Cohen, Honglak Lee, and Ming-Hsuan Yang. 2016. Object contour detection with a fully convolutional encoder-decoder network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 193--202.
    [141]
    Zhilin Yang, Junjie Hu, Ruslan Salakhutdinov, and William Cohen. 2017. Semi-supervised QA with generative domain-adaptive nets. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 1040--1050.
    [142]
    Raymond A. Yeh, Chen Chen, Teck Yian Lim, Alexander G. Schwing, Mark Hasegawa-Johnson, and Minh N. Do. 2017. Semantic image inpainting with deep generative models. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5485--5493.
    [143]
    Zili Yi, Hao Zhang, Ping Tan, and Minglun Gong. 2017. DualGAN: Unsupervised dual learning for image-to-image translation. In Proceedings of the IEEE International Conference On Computer Vision. 2849--2857.
    [144]
    Yuichi Yoshida and Takeru Miyato. 2017. Spectral Norm Regularization for Improving the Generalizability of Deep Learning. arXiv:1705.10941. Retrieved from https://arxiv.org/abs/1705.10941.
    [145]
    Fisher Yu, Ari Seff, Yinda Zhang, Shuran Song, Thomas Funkhouser, and Jianxiong Xiao. 2015. Lsun: Construction of a Large-scale Image Dataset Using Deep Learning with Humans in the Loop. arXiv:1506.03365. Retrieved from https://arxiv.org/abs/1506.03365.
    [146]
    Jiahui Yu, Zhe Lin, Jimei Yang, Xiaohui Shen, Xin Lu, and Thomas S. Huang. 2018. Generative Image Inpainting with Contextual Attention. arXiv:1801.07892. Retrieved from https://arxiv.org/abs/1801.07892.
    [147]
    Lantao Yu, Weinan Zhang, Jun Wang, and Yong Yu. 2017. SeqGAN: Sequence generative adversarial nets with policy gradient. In Proceedings of the 31st AAAI Conference on Artificial Intelligence.
    [148]
    Matthew D. Zeiler and Rob Fergus. 2014. Visualizing and understanding convolutional networks. In Proceedings of the European Conference on Computer Vision. Springer, 818--833.
    [149]
    Han Zhang, Ian Goodfellow, Dimitris Metaxas, and Augustus Odena. 2018. Self-attention Generative Adversarial Networks. arXiv:1805.08318. Retrieved from https://arxiv.org/abs/1805.08318.
    [150]
    Han Zhang, Tao Xu, Hongsheng Li, Shaoting Zhang, Xiaogang Wang, Xiaolei Huang, and Dimitris N. Metaxas. 2017. Stackgan: Text to photo-realistic image synthesis with stacked generative adversarial networks. In Proceedings of the IEEE International Conference on Computer Vision. 5907--5915.
    [151]
    Junbo Zhao, Michael Mathieu, and Yann LeCun. 2016. Energy-based Generative Adversarial Network. arXiv:1609.03126. Retrieved from https://arxiv.org/abs/1609.03126.
    [152]
    Jun-Yan Zhu, Philipp Krähenbühl, Eli Shechtman, and Alexei A. Efros. 2016. Generative visual manipulation on the natural image manifold. In Proceedings of the European Conference on Computer Vision. Springer, 597--613.
    [153]
    Jun-Yan Zhu, Taesung Park, Phillip Isola, and Alexei A. Efros. 2017. Unpaired Image-to-image Translation Using Cycle-consistent Adversarial Networks. arXiv:1703.10593v6. Retrieved from https://arxiv.org/abs/1703.10593v6.
    [154]
    Jun-Yan Zhu, Richard Zhang, Deepak Pathak, Trevor Darrell, Alexei A. Efros, Oliver Wang, and Eli Shechtman. 2017. Toward multimodal image-to-image translation. In Advances in Neural Information Processing Systems. 465--476.
    [155]
    Wentao Zhu, Xiang Xiang, Trac D. Tran, and Xiaohui Xie. 2016. Adversarial Deep Structural Networks for Mammographic Mass Segmentation. arXiv:1612.05970. Retrieved from https://arxiv.org/abs/1612.05970.

    Cited By

    View all
    • (2024)Application of Gaussian Model and Deep Learning Encoder-Decoder Algorithm for Single-Image Reflection RemovalAdvanced Journal of Science, Technology and Engineering10.52589/AJSTE-GWXJPEN44:2(64-80)Online publication date: 22-Jul-2024
    • (2024)G-GANS for Adaptive Learning in Dynamic Network SlicesEngineering, Technology & Applied Science Research10.48084/etasr.704614:3(14327-14341)Online publication date: 1-Jun-2024
    • (2024)REVIEW AND EXPERIMENTAL COMPARISON OF GENERATIVE ADVERSARIAL NETWORKS FOR SYNTHETIC IMAGE GENERATIONNew Trends in Computer Sciences10.3846/ntcs.2024.205162:1(1-18)Online publication date: 30-May-2024
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Computing Surveys
    ACM Computing Surveys  Volume 54, Issue 2
    March 2022
    800 pages
    ISSN:0360-0300
    EISSN:1557-7341
    DOI:10.1145/3450359
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 09 February 2021
    Accepted: 01 November 2020
    Revised: 01 September 2020
    Received: 01 November 2019
    Published in CSUR Volume 54, Issue 2

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Generative adversarial networks
    2. architecture-variants
    3. computer vision
    4. loss-variants
    5. stabilizing training

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3,260
    • Downloads (Last 6 weeks)210
    Reflects downloads up to 09 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Application of Gaussian Model and Deep Learning Encoder-Decoder Algorithm for Single-Image Reflection RemovalAdvanced Journal of Science, Technology and Engineering10.52589/AJSTE-GWXJPEN44:2(64-80)Online publication date: 22-Jul-2024
    • (2024)G-GANS for Adaptive Learning in Dynamic Network SlicesEngineering, Technology & Applied Science Research10.48084/etasr.704614:3(14327-14341)Online publication date: 1-Jun-2024
    • (2024)REVIEW AND EXPERIMENTAL COMPARISON OF GENERATIVE ADVERSARIAL NETWORKS FOR SYNTHETIC IMAGE GENERATIONNew Trends in Computer Sciences10.3846/ntcs.2024.205162:1(1-18)Online publication date: 30-May-2024
    • (2024)Performance Optimization of GAN-based Image Style Transfer on Indoor Geometric Shaped Data2024 11th International Conference on Computing for Sustainable Global Development (INDIACom)10.23919/INDIACom61295.2024.10498547(936-940)Online publication date: 28-Feb-2024
    • (2024)AdvNF: Reducing mode collapse in conditional normalising flows using adversarial learningSciPost Physics10.21468/SciPostPhys.16.5.13216:5Online publication date: 24-May-2024
    • (2024)Creativity and Machine Learning: A SurveyACM Computing Surveys10.1145/3664595Online publication date: 11-May-2024
    • (2024)Deep Neural Networks and Tabular Data: A SurveyIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.322916135:6(7499-7519)Online publication date: Jun-2024
    • (2024)Multiobjective Evolutionary Generative Adversarial Network Compression for Image TranslationIEEE Transactions on Evolutionary Computation10.1109/TEVC.2023.326113528:3(798-809)Online publication date: Jun-2024
    • (2024)Few-Shot Synthetic Online Transfer Learning for Cross-Site Neurological Disease DiagnosisIEEE Transactions on Computational Social Systems10.1109/TCSS.2023.327056911:2(2201-2209)Online publication date: Apr-2024
    • (2024)SalDA: DeepConvNet Greets Attention for Visual Saliency PredictionIEEE Transactions on Cognitive and Developmental Systems10.1109/TCDS.2023.327417916:1(319-331)Online publication date: Feb-2024
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Get Access

    Login options

    Full Access

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media