Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3669754.3669758acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccaiConference Proceedingsconference-collections
research-article

Artistic Outpainting through Adaptive Image-to-Text and Text-to-Image Generation

Published: 30 August 2024 Publication History

Abstract

Artistic heritage often confronts the challenges of degradation, particularly in museum environments where valuable art paintings may exhibit missing regions along their borders. This research addresses the urgent need to restore and revitalize damaged art paintings, utilizing advanced computational methods to harmoniously fill these gaps while preserving the original aesthetic. This paper introduces an innovative approach, employing adaptive Image-to-Text and Text-to-Image Generation for the completion of damaged art paintings, referred to as Artistic Outpainting. Our proposed methodology unfolds in a carefully structured three-step process. Commencing with a pixel-wise network, we employ sophisticated image inpainting techniques to restore art paintings with missing border regions, ensuring a detailed reconstruction that seamlessly integrates additional content. This sets the foundation for subsequent transformations. Building upon the restored art painting, our approach integrates an Image-to-Text Generation model in the second step. The generated output from the first step serves as input, enabling the model to extract rich contextual information and translate the visual composition into a comprehensive textual prompt. This transition from visual to semantic understanding enhances art painting interpretation, fostering a deeper connection and laying the groundwork for the final transformation. In the last step, our Text-to-Image Generation synthesis applies the textual prompt to create a visually cohesive outpainting. This synthesis process employs state-of-the-art techniques, allowing for adaptive and imaginative extensions guided by the intrinsic creativity embedded within the textual descriptions. The experimental results underscore the efficacy of our method, demonstrating superior performance compared to other existing approaches. In the realm of art restoration, our approach not only contributes to the preservation of cultural heritage but also serves as evidence of the harmonious integration of computer vision and artistic conservation.

References

[1]
[1] Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei A. Efros. 2016. Image-to-image translation with conditional adversarial networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[2]
[2] Tao Xu, Pengchuan Zhang, Qiuyuan Huang, Han Zhang, Zhe Gan, Xiaolei Huang, Xiaodong He. 2017. Attngan: Fine-grained text to image generation with attentional generative adversarial networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[3]
[3] Jun-Yan Zhu, Taesung Park, Phillip Isola, Alexei A. Efros. 2017. Unpaired image-to-image translation using cycle-consistent adversarial networks. In Proceedings of the IEEE International Conference on Computer Vision (ICCV).
[4]
[4] Weihuang Liu, Xiaodong Cun, Chi-Man Pun, Menghan Xia, Yong Zhang, Jue Wang. 2023. CoordFill: Efficient High-Resolution Image Inpainting via Parameterized Coordinate Querying. In Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR).
[5]
[5] Jiahui Yu, Zirui Wang, Vijay Vasudevan, Legg Yeung, Mojtaba Seyedhosseini, Yonghui Wu. 2022. CoCa: Contrastive Captioners are Image-Text Foundation Models. In Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR).
[6]
[6] Dominic Rampas, Pablo Pernias, Marc Aubreville. 2022. A Novel Sampling Scheme for Text- and Image-Conditional Image Synthesis in Quantized Latent Spaces. In Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR).
[7]
[7] Patrick Esser, Robin Rombach, Björn Ommer. 2020. Taming Transformers for High-Resolution Image Synthesis. In Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR).
[8]
[8] Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, Björn Ommer. 2022. High-Resolution Image Synthesis with Latent Diffusion Models. In Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR).
[9]
[9] Aditya Ramesh, Prafulla Dhariwal, Alex Nichol, Casey Chu, Mark Chen. 2022. Hierarchical Text-Conditional Image Generation with CLIP Latents. In Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR).
[10]
[10] Zhou Wang, Alan C. Bovik, Ligang Lu. 2002. IEEE International Conference on Acoustics, Speech, and Signal Processing.
[11]
[11] Muhammad Ali Qureshi, Mohamed Deriche, Azeddine Beghdadi, Asjad Amin. 2017. Journal of Visual Communication and Image Representation, Volume 49, pp. 177–191.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICCAI '24: Proceedings of the 2024 10th International Conference on Computing and Artificial Intelligence
April 2024
491 pages
ISBN:9798400717055
DOI:10.1145/3669754
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 August 2024

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Artistic Outpainting
  2. Image-to-Text Generation
  3. Pixel-wise Network
  4. Text-to-Image Generation.

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICCAI 2024

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 30 Aug 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media