research-article

RepFinder: finding approximately repeated scene elements for image editing

Authors:

Ming-Ming Cheng,

Fang-Lue Zhang,

Niloy J. Mitra,

Shi-Min HuAuthors Info & Claims

ACM Transactions on Graphics (TOG), Volume 29, Issue 4

Article No.: 83, Pages 1 - 8

https://doi.org/10.1145/1778765.1778820

Published: 26 July 2010 Publication History

Abstract

Repeated elements are ubiquitous and abundant in both manmade and natural scenes. Editing such images while preserving the repetitions and their relations is nontrivial due to overlap, missing parts, deformation across instances, illumination variation, etc. Manually enforcing such relations is laborious and error-prone. We propose a novel framework where user scribbles are used to guide detection and extraction of such repeated elements. Our detection process, which is based on a novel boundary band method, robustly extracts the repetitions along with their deformations. The algorithm only considers the shape of the elements, and ignores similarity based on color, texture, etc. We then use topological sorting to establish a partial depth ordering of overlapping repeated instances. Missing parts on occluded instances are completed using information from other instances. The extracted repeated instances can then be seamlessly edited and manipulated for a variety of high level tasks that are otherwise difficult to perform. We demonstrate the versatility of our framework on a large set of inputs of varying complexity, showing applications to image rearrangement, edit transfer, deformation propagation, and instance replacement.

Supplementary Material

JPG File (tp016-10.jpg)

Download
10.47 KB

MP4 File (tp016-10.mp4)

Download
25.74 MB

References

[1]

Adams, A., Gelfand, N., Dolson, J., and Levoy, M. 2009. Gaussian KD-trees for fast high-dimensional filtering. ACM Trans. Graph. 28, 3, 21:1--12.

Digital Library

[2]

Ahuja, N., and Todorovic, S. 2007. Extracting texels in 2.1D natural textures. In Proc. of ICCV, 1--8.

[3]

An, X., and Pellacini, F. 2008. Appprop: all-pairs appearance-space edit propagation. ACM Trans. Graph. 27, 3, 40: 1--9.

Digital Library

[4]

Bai, X., Li, Q. N., Latecki, L. J., Liu, W. Y., and Tu, Z. W. 2009. Shape band: A deformable object detection approach. In Proc. of CVPR, 1335--1342.

[5]

Bai, X., Wang, J., Simons, D., and Sapiro, G. 2009. Video SnapCut: robust video object cutout using localized classifiers. In ACM Trans. Graph., ACM, 70.

Digital Library

[6]

Barnes, C., Shechtman, E., Finkelstein, A., and Goldman, D. B. 2009. Patchmatch: A randomized correspondence algorithm for structural image editing. ACM Trans. Graph. 28, 3, 24:1--11.

Digital Library

[7]

Bay, H., Ess, A., Tuytelaars, T., and Gool, L. J. V. 2008. Speeded-up robust features (SURF). Computer Vision and Image Understanding 110, 3, 346--359.

Digital Library

[8]

Belongie, S., Malik, J., and Puzicha, J. 2002. Shape matching and object recognition using shape contexts. IEEE TPAMI 24, 4, 509--522.

Digital Library

[9]

Berg, A. C., Berg, T. L., and Malik, J. 2005. Shape matching and object recognition using low distortion correspondences. In Proc. of CVPR, I: 26--33.

Digital Library

[10]

Bookstein, F. 1989. Principal warps: Thin-plate splines and the decomposition of deformations. IEEE TPAMI 11, 6, 567--585.

Digital Library

[11]

Boykov, Y. Y., and Lea, G. F. 2006. Graph cuts and efficient N-D image segmentation. IJCV 70, 2, 109--131.

Digital Library

[12]

Brox, T., Kleinschmidt, O., and Cremers, D. 2008. Efficient nonlocal means for denoising of textural patterns. IEEE Trans. Image Processing 17, 7, 1083--1092.

Digital Library

[13]

Chen, T., Cheng, M., Tan, P., Shamir, A., and Hu, S. 2009. Sketch2Photo: internet image montage. ACM Trans. Graph. 28, 5, 124: 1--10.

Digital Library

[14]

Cho, T. S., Butman, M., Avidan, S., and Freeman, W. T. 2008. The patch transform and its applications to image editing. In Proc. of CVPR, 1--8.

[15]

Criminisi, A., Perez, P., and Toyama, K. 2004. Region filling and object removal by exemplar-based image inpainting. IEEE Trans. Image Processing 13, 9, 1200--1212.

Digital Library

[16]

Eisemann, E., and Durand, F. 2004. Flash photography enhancement via intrinsic relighting. ACM Trans. Graph. 23, 3, 673--678.

Digital Library

[17]

Ho, J., Peter, A., Rangarajan, A., and Yang, M.-H. 2009. An algebraic approach to affine registration of point sets. In Proc. of ICCV, 1--8.

[18]

Hoiem, D., Efros, A. A., and Hebert, M. 2005. Automatic photo pop-up. ACM Trans. Graph. 24, 3, 577--584.

Digital Library

[19]

Igarashi, T., Moscovich, T., and Hughes, J. F. 2005. Asrigid-as-possible shape manipulation. ACM Trans. Graph. 24, 3, 1134--1141.

Digital Library

[20]

Jia, Y., Hu, S., and Martin, R. 2005. Video completion using tracking and fragment merging. The Visual Computer 21, 8, 601--610.

[21]

Karni, Z., Freedman, D., and Gotsman, C. 2009. Energy-based image deformation. Comput. Graph. Forum 28, 5, 1257--1268.

Digital Library

[22]

Kilthau, S. L., Drew, M. S., and Moller, T. 2002. Full search content independent block matching based on the fast fourier transform. In Proc. of ICIP, I: 669--672.

[23]

Koffka, K. 1935. Principles of Gestalt Psychology. Lund Humphries.

[24]

Lalonde, J.-F., Hoiem, D., Efros, A. A., Rother, C., Winn, J. M., and Criminisi, A. 2007. Photo clip art. ACM Trans. Graph. 26, 3, 3:1--10.

Digital Library

[25]

Landes, P.-E., and Soler, C. 2009. Content-Aware Texture Synthesis. Research Report RR-6959, INRIA.

[26]

Lempitsky, V., Kohli, P., Rother, C., and Sharp, T. 2009. Image segmentation with a bounding box prior. In Proc. of ICCV, 1--8.

[27]

Leung, T., and Malik, J. 1996. Detecting, localizing and grouping repeated scene elements from an image. In Proc. of ECCV, I:546--555.

Digital Library

[28]

Levin, A., Lischinski, D., and Weiss, Y. 2008. A closed-form solution to natural image matting. IEEE TPAMI 30, 2, 228--242.

Digital Library

[29]

Liu, Y., Collins, R. T., and Tsin, Y. 2003. A computational model for periodic pattern perception based on frieze and wallpaper groups. IEEE TPAMI 26, 3, 354--371.

Digital Library

[30]

Lowe, D. G. 2004. Distinctive image features from scale-invariant keypoints. IJCV 60, 2, 91--110.

Digital Library

[31]

McCann, J., and Pollard, N. S. 2009. Local layering. ACM Trans. Graph. 28, 3, 84:1--7.

Digital Library

[32]

Paris, S., and Durand, F. 2007. A topological approach to hierarchical segmentation using mean shift. In Proc. of CVPR, 1--8.

[33]

Pauly, M., Mitra, N. J., Wallner, J., Pottmann, H., and Guibas, L. J. 2008. Discovering structural regularity in 3D geometry. ACM Trans. Graph. 27, 3, 43:1--11.

Digital Library

[34]

Rother, C., Kolmogorov, V., and Blake, A. 2004. Grab-Cut: Interactive foreground extraction using iterated graph cuts. ACM Trans. Graph. 23, 3, 309--314.

Digital Library

[35]

Sapiro, G., Kimmel, R., and Caselles, V. 1995. Geodesic active contours. In Proc. of ICCV, 694--699.

Digital Library

[36]

Schaefer, S., McPhail, T., and Warren, J. 2006. Image deformation using moving least squares. ACM Trans. Graph. 25, 3, 533--540.

Digital Library

[37]

Shamir, A., and Avidan, S. 2009. Seam carving for media retargeting. Commun. ACM 52, 1, 77--85.

Digital Library

[38]

Shi, J., and Malik, J. 2000. Normalized cuts and image segmentation. IEEE TPAMI 22, 8, 888--905.

Digital Library

[39]

Simakov, D., Caspi, Y., Shechtman, E., and Irani, M. 2008. Summarizing visual data using bidirectional similarity. In Proc. of CVPR, 1--8.

[40]

Sun, J., Yuan, L., Jia, J., and Shum, H.-Y. 2005. Image completion with structure propagation. ACM Trans. Graph. 24, 3, 861--868.

Digital Library

[41]

Thayananthan, A., Stenger, B., Torr, P. H. S., and Cipolla, R. 2003. Shape context and chamfer matching in cluttered scenes. In Proc. of CVPR, I: 127--133.

Digital Library

[42]

Xu, K., Li, Y., Ju, T., Hu, S., and Liu, T. 2009. Efficient affinity-based edit propagation using KD tree. In ACM Trans. Graph., ACM, 118: 1--6.

Digital Library

[43]

Zhang, G.-X., Cheng, M.-M., Hu, S.-M., and Martin, R. R. 2009. A shape-preserving approach to image resizing. Comput. Graph. Forum 28, 7, 1897--1906.

[44]

Zheng, Q., Sharf, A., Wan, G., Li, Y., Mitra, N. J., Cohen-Or, D., and Chen, B. 2010. Non-local scan consolidation for 3d urban scene. ACM Trans. Graph. 29, 3, to appear.

Digital Library

Cited By

Han JWang MWu WJia X(2025)Perceptual localization and focus refinement network for RGB-D salient object detectionExpert Systems with Applications10.1016/j.eswa.2024.125278259(125278)Online publication date: Jan-2025
https://doi.org/10.1016/j.eswa.2024.125278
Xu BLiang HLiang RChen P(2024)Synthesize Boundaries: A Boundary-Aware Self-Consistent Framework for Weakly Supervised Salient Object DetectionIEEE Transactions on Multimedia10.1109/TMM.2023.332139326(4194-4205)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1109/TMM.2023.3321393
Wu JHao FLiang WXu J(2024)Transformer Fusion and Pixel-Level Contrastive Learning for RGB-D Salient Object DetectionIEEE Transactions on Multimedia10.1109/TMM.2023.327530826(1011-1026)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1109/TMM.2023.3275308
Show More Cited By

Index Terms

RepFinder: finding approximately repeated scene elements for image editing
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Shape inference
  2. Computer graphics
    1. Graphics systems and interfaces
    2. Shape modeling

Recommendations

RepFinder: finding approximately repeated scene elements for image editing
SIGGRAPH '10: ACM SIGGRAPH 2010 papers

Repeated elements are ubiquitous and abundant in both manmade and natural scenes. Editing such images while preserving the repetitions and their relations is nontrivial due to overlap, missing parts, deformation across instances, illumination variation, ...
WYSIWYG computational photography via viewfinder editing

Digital cameras with electronic viewfinders provide a relatively faithful depiction of the final image, providing a WYSIWYG experience. If, however, the image is created from a burst of differently captured images, or non-linear interactive edits ...
Plenoptic Image Editing

This paper presents a new class of interactive image editing operations designed to maintain consistency between multiple images of a physical 3D scene. The distinguishing feature of these operations is that edits to any one image propagate ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics

ACM Transactions on Graphics Volume 29, Issue 4

July 2010

942 pages

ISSN:0730-0301

EISSN:1557-7368

DOI:10.1145/1778765

Issue’s Table of Contents

Copyright © 2010 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 July 2010

Published in TOG Volume 29, Issue 4

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Ministry of Science and Technology of the People's Republic of China
Development Program of China
Intel-MoE Joint Research Fund

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

133
Total Citations
View Citations
1,794
Total Downloads

Downloads (Last 12 months)34
Downloads (Last 6 weeks)3

Reflects downloads up to 25 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

Han JWang MWu WJia X(2025)Perceptual localization and focus refinement network for RGB-D salient object detectionExpert Systems with Applications10.1016/j.eswa.2024.125278259(125278)Online publication date: Jan-2025
https://doi.org/10.1016/j.eswa.2024.125278
Xu BLiang HLiang RChen P(2024)Synthesize Boundaries: A Boundary-Aware Self-Consistent Framework for Weakly Supervised Salient Object DetectionIEEE Transactions on Multimedia10.1109/TMM.2023.332139326(4194-4205)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1109/TMM.2023.3321393
Wu JHao FLiang WXu J(2024)Transformer Fusion and Pixel-Level Contrastive Learning for RGB-D Salient Object DetectionIEEE Transactions on Multimedia10.1109/TMM.2023.327530826(1011-1026)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1109/TMM.2023.3275308
Qiu YLiu YZhang LLu HXu J(2024)Boosting Salient Object Detection with Transformer-based Asymmetric Bilateral U-NetIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.3307693(1-1)Online publication date: 2024
https://doi.org/10.1109/TCSVT.2023.3307693
Xu BLiang HGong WLiang RChen P(2024)A Visual Representation-Guided Framework With Global Affinity for Weakly Supervised Salient Object DetectionIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.328407634:1(248-259)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1109/TCSVT.2023.3284076
Yang XDuan LZhou Q(2024)PileNet: A high-and-low pass complementary filter with multi-level feature refinement for salient object detectionJournal of Visual Communication and Image Representation10.1016/j.jvcir.2024.104186102(104186)Online publication date: Jun-2024
https://doi.org/10.1016/j.jvcir.2024.104186
Shao FWang KLiu Y(2023)Salient object detection algorithm based on diversity features and global guidance informationInnovation & Technology Advances10.61187/ita.v1i1.141:1(12-20)Online publication date: 30-Jun-2023
https://doi.org/10.61187/ita.v1i1.14
Yu MLin XLiu YGuo Y(2023)Multi-scale attention and boundary enhancement with long-range dependency for salient object detectionJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-22372644:6(8957-8969)Online publication date: 1-Jan-2023
https://dl.acm.org/doi/10.3233/JIFS-223726
Wang YQiu YXu J(2023)Edge Refinement Exploration for Salient Object Detection2023 42nd Chinese Control Conference (CCC)10.23919/CCC58697.2023.10240569(7805-7810)Online publication date: 24-Jul-2023
https://doi.org/10.23919/CCC58697.2023.10240569
Luo YJin XPan ZWu KKou QYang XGao X(2023)Texture Atlas Compression Based on Repeated Content RemovalSIGGRAPH Asia 2023 Conference Papers10.1145/3610548.3618150(1-11)Online publication date: 10-Dec-2023
https://dl.acm.org/doi/10.1145/3610548.3618150
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Issue’s Table of Contents