VHS to HDTV Video Translation Using Multi-task Adversarial Learning

Luo, Hongming; Liao, Guangsen; Hou, Xianxu; Liu, Bozhi; Zhou, Fei; Qiu, Guoping

doi:10.1007/978-3-030-37731-1_7

Hongming Luo^16,17,18,19,
Guangsen Liao^16,17,18,19,
Xianxu Hou^16,17,18,19,
Bozhi Liu^16,17,18,19,
Fei Zhou^16,17,18,19 &
…
Guoping Qiu^{16,17,18,19,20}

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11961))

Included in the following conference series:

International Conference on Multimedia Modeling

2863 Accesses
1 Citations

Abstract

There are large amount of valuable video archives in Video Home System (VHS) format. However, due to the analog nature, their quality is often poor. Compared to High-definition television (HDTV), VHS video not only has a dull color appearance but also has a lower resolution and often appears blurry. In this paper, we focus on the problem of translating VHS video to HDTV video and have developed a solution based on a novel unsupervised multi-task adversarial learning model. Inspired by the success of generative adversarial network (GAN) and CycleGAN, we employ cycle consistency loss, adversarial loss and perceptual loss together to learn a translation model. An important innovation of our work is the incorporation of super-resolution model and color transfer model that can solve unsupervised multi-task problem. To our knowledge, this is the first work that dedicated to the study of the relation between VHS and HDTV and the first computational solution to translate VHS to HDTV. We present experimental results to demonstrate the effectiveness of our solution qualitatively and quantitatively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Unsupervised video-to-video translation with preservation of frame modification tendency

Article 22 July 2020

Two-Channel VAE-GAN Based Image-To-Video Translation

Multi-hop Video Super Resolution with Long-Term Consistency (MVSRGAN)

Article 22 May 2023

References

Cui, Z., Chang, H., Shan, S., Zhong, B., Chen, X.: Deep network cascade for image super-resolution. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 49–64. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_4
Chapter Google Scholar
Dong, C., Loy, C.C., He, K., Tang, X.: Image super-resolution using deep convolutional networks. IEEE Trans. Pattern Anal. Mach. Intell. 38(2), 295–307 (2015)
Article Google Scholar
Dong, C., Loy, C.C., Tang, X.: Accelerating the super-resolution convolutional neural network. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9906, pp. 391–407. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46475-6_25
Chapter Google Scholar
Gatys, L.A., Ecker, A.S., Bethge, M.: Image style transfer using convolutional neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2414–2423 (2016)
Google Scholar
Isola, P., Zhu, J.Y., Zhou, T., Efros, A.A.: Image-to-image translation with conditional adversarial networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1125–1134 (2017)
Google Scholar
Johnson, J., Alahi, A., Fei-Fei, L.: Perceptual losses for real-time style transfer and super-resolution. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9906, pp. 694–711. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46475-6_43
Chapter Google Scholar
Kim, J., Kwon Lee, J., Mu Lee, K.: Accurate image super-resolution using very deep convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1646–1654 (2016)
Google Scholar
Ledig, C., et al.: Photo-realistic single image super-resolution using a generative adversarial network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4681–4690 (2017)
Google Scholar
Li, Y., Liu, M.-Y., Li, X., Yang, M.-H., Kautz, J.: A closed-form solution to photorealistic image stylization. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11207, pp. 468–483. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01219-9_28
Chapter Google Scholar
Luan, F., Paris, S., Shechtman, E., Bala, K.: Deep photo style transfer. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4990–4998 (2017)
Google Scholar
Mittal, A., Moorthy, A.K., Bovik, A.C.: No-reference image quality assessment in the spatial domain. IEEE Trans. Image Process. 21(12), 4695–4708 (2012)
Article MathSciNet Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Chapter Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
Tai, Y., Yang, J., Liu, X.: Image super-resolution via deep recursive residual network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3147–3155 (2017)
Google Scholar
Venkatanath, N., Praneeth, D., Bh, M.C., Channappayya, S.S., Medasani, S.S.: Blind image quality evaluation using perception based features. In: 2015 Twenty First National Conference on Communications (NCC), pp. 1–6. IEEE (2015)
Google Scholar
Wang, T.C., et al.: Video-to-video synthesis. arXiv preprint arXiv:1808.06601 (2018)
Zhang, Y., Tian, Y., Kong, Y., Zhong, B., Fu, Y.: Residual dense network for image super-resolution. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2472–2481 (2018)
Google Scholar
Zhu, J.Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE International Conference On Computer Vision, pp. 2223–2232 (2017)
Google Scholar

Download references

Acknowledgement

This work was supported by initial funding of newly-introduced teacher in Shenzhen University with No. 2019121. The authors would like to thank the editors and reviewers for their constructive suggestions on our work. The corresponding author of this paper is Fei Zhou.

Author information

Authors and Affiliations

College of Electronics and Information Engineering, Shenzhen University, Shenzhen, China
Hongming Luo, Guangsen Liao, Xianxu Hou, Bozhi Liu, Fei Zhou & Guoping Qiu
Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen, China
Hongming Luo, Guangsen Liao, Xianxu Hou, Bozhi Liu, Fei Zhou & Guoping Qiu
Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ), Shenzhen, China
Hongming Luo, Guangsen Liao, Xianxu Hou, Bozhi Liu, Fei Zhou & Guoping Qiu
Shenzhen Institute of Artificial Intelligence and Robotics for Society, Shenzhen, China
Hongming Luo, Guangsen Liao, Xianxu Hou, Bozhi Liu, Fei Zhou & Guoping Qiu
School of Computer Science, University of Nottingham, Nottingham, UK
Guoping Qiu

Authors

Hongming Luo
View author publications
You can also search for this author in PubMed Google Scholar
Guangsen Liao
View author publications
You can also search for this author in PubMed Google Scholar
Xianxu Hou
View author publications
You can also search for this author in PubMed Google Scholar
Bozhi Liu
View author publications
You can also search for this author in PubMed Google Scholar
Fei Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Guoping Qiu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fei Zhou .

Editor information

Editors and Affiliations

Korea Advanced Institute of Science and, Daejeon, Korea (Republic of)
Yong Man Ro
National Chiao Tung University, Hsinchu, Taiwan
Wen-Huang Cheng
Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Junmo Kim
National Cheng Kung University, Tainan City, Taiwan
Wei-Ta Chu
Tsinghua University, Beijing, China
Peng Cui
Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Jung-Woo Choi
National Tsing Hua University, Hsinchu, Taiwan
Min-Chun Hu
Ghent University, Ghent, Belgium
Wesley De Neve

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Luo, H., Liao, G., Hou, X., Liu, B., Zhou, F., Qiu, G. (2020). VHS to HDTV Video Translation Using Multi-task Adversarial Learning. In: Ro, Y., et al. MultiMedia Modeling. MMM 2020. Lecture Notes in Computer Science(), vol 11961. Springer, Cham. https://doi.org/10.1007/978-3-030-37731-1_7

Download citation

DOI: https://doi.org/10.1007/978-3-030-37731-1_7
Published: 24 December 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37730-4
Online ISBN: 978-3-030-37731-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

VHS to HDTV Video Translation Using Multi-task Adversarial Learning

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Unsupervised video-to-video translation with preservation of frame modification tendency

Two-Channel VAE-GAN Based Image-To-Video Translation

Multi-hop Video Super Resolution with Long-Term Consistency (MVSRGAN)

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

VHS to HDTV Video Translation Using Multi-task Adversarial Learning

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Unsupervised video-to-video translation with preservation of frame modification tendency

Two-Channel VAE-GAN Based Image-To-Video Translation

Multi-hop Video Super Resolution with Long-Term Consistency (MVSRGAN)

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation