article

Open access

Photo tourism: exploring photo collections in 3D

Authors:

Steven M. Seitz,

Richard SzeliskiAuthors Info & Claims

ACM Transactions on Graphics (TOG), Volume 25, Issue 3

Pages 835 - 846

https://doi.org/10.1145/1141911.1141964

Published: 01 July 2006 Publication History

Abstract

We present a system for interactively browsing and exploring large unstructured collections of photographs of a scene using a novel 3D interface. Our system consists of an image-based modeling front end that automatically computes the viewpoint of each photograph as well as a sparse 3D model of the scene and image to model correspondences. Our photo explorer uses image-based rendering techniques to smoothly transition between photographs, while also enabling full 3D navigation and exploration of the set of images and world geometry, along with auxiliary information such as overhead maps. Our system also makes it easy to construct photo tours of scenic or historic locations, and to annotate image details, which are automatically transferred to other relevant images. We demonstrate our system on several large personal photo collections as well as images gathered from Internet photo sharing sites.

Supplementary Material

High Resolution (p835-snavely-high.mov)

Download
76.99 MB

Low Resolution (p835-snavely-low.mov)

Download
28.37 MB

References

[1]

Aliaga, D., Funkhouser, T., Yanovsky, D., and Carlbom, I. 2003. Sea of images. IEEE Computer Graphics and Applications 23, 6, 22--30.

Digital Library

[2]

Aliaga, D., Yanovsky, D., Funkhouser, T., and Carlbom, I. 2003. Interactive image-based rendering using feature globalization. In Proc. SIGGRAPH Symposium on Interactive 3D Graphics, 163--170.

Digital Library

[3]

Arya, S., Mount, D. M., Netanyahu, N. S., Silverman, R., and Wu, A. Y. 1998. An optimal algorithm for approximate nearest neighbor searching fixed dimensions. J. of the ACM 45, 6, 891--923.

Digital Library

[4]

Brown, M., and Lowe, D. G. 2005. Unsupervised 3d object recognition and reconstruction in unordered datasets. In Proc. Int. Conf. on 3D Digital Imaging and Modelling, 56--63.

Digital Library

[5]

Buehler, C., Bosse, M., McMillan, L., Gortler, S., and Cohen, M. 2001. Unstructured lumigraph rendering. In SIGGRAPH Conf. Proc., 425--432.

Digital Library

[6]

Chen, S., and Williams, L. 1993. View interpolation for image synthesis. In SIGGRAPH Conf. Proc., 279--288.

Digital Library

[7]

Chew, L. P. 1987. Constrained delaunay triangulations. In Proc. Sym. on Computational geometry, 215--222.

Digital Library

[8]

Cooper, M., Foote, J., Girgensohn, A., and Wilcox, L. 2003. Temporal event clustering for digital photo collections. In Proc. ACM Int. Conf. on Multimedia, 364--373.

Digital Library

[9]

Debevec, P. E., Taylor, C. J., and Malik, J. 1996. Modeling and rendering architecture from photographs: a hybrid geometry- and image-based approach. In SIGGRAPH Conf. Proc., 11--20.

Digital Library

[10]

Dick, A. R., Torr, P. H. S., and Cipolla, R. 2004. Modelling and interpretation of architecture from several images. Int. J. of Computer Vision 60, 2, 111--134.

Digital Library

[11]

Feiner, S., MacIntyre, B., Hollerer, T., and Webster, A. 1997. A touring machine: Prototyping 3d mobile augmented reality systems for exploring the urban environment. In Proc. IEEE Int. Sym. on Wearable Computers, 74--81.

Digital Library

[12]

Fischler, M., and Bolles, R. 1987. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Readings in computer vision: issues, problems, principles, and paradigms, 726--740.

Digital Library

[13]

Gortler, S. J., Grzeszczuk, R., Szeliski, R., and Cohen, M. F. 1996. The Lumigraph. In SIGGRAPH Conf. Proc., 43--54.

Digital Library

[14]

Grzeszczuk, R. 2002. Course 44: Image-based modeling. In SIGGRAPH 2002.

[15]

Hartley, R. I., and Zisserman, A. 2004. Multiple View Geometry. Cambridge University Press, Cambridge, UK.

Digital Library

[16]

Irani, M., and Anandan, P. 1998. Video indexing based on mosaic representation. IEEE Trans. on Pattern Analysis and Machine Intelligence 86, 5, 905--921.

[17]

Johansson, B., and Cipolla, R. 2002. A system for automatic pose-estimation from a single image in a city scene. In Proc. IASTED Int. Conf. Signal Processing, Pattern Recognition and Applications.

[18]

Kadobayashi, R., and Tanaka, K. 2005. 3d viewpoint-based photo search and information browsing. In Proc. ACM Int. Conf. on Research and development in information retrieval, 621--622.

Digital Library

[19]

Levoy, M., and Hanrahan, P. 1996. Light field rendering. In SIGGRAPH Conf. Proc., 31--42.

Digital Library

[20]

Lippman, A. 1980. Movie maps: An application of the optical videodisc to computer graphics. In SIGGRAPH Conf. Proc., 32--43.

Digital Library

[21]

Lourakis, M., and Argyros, A. 2004. The design and implementation of a generic sparse bundle adjustment software package based on the levenberg-marquardt algorithm. Tech. Rep. 340, Inst. of Computer Science-FORTH, Heraklion, Crete, Greece. Available from www.ics.forth.gr/~lourakis/sba.

[22]

Lowe, D. 2004. Distinctive image features from scale-invariant keypoints. Int. J. of Computer Vision 60, 2, 91--110.

Digital Library

[23]

McCurdy, N., and Griswold, W. 2005. A systems architecture for ubiquitous video. In Proc. Int. Conf. on mobile systems, applications, and services, 1--14.

Digital Library

[24]

McMillan, L., and Bishop, G. 1995. Plenoptic modeling: An image-based rendering system. In SIGGRAPH Conf. Proc., 39--46.

Digital Library

[25]

Mikolajczyk, K., Tuytelaars, T., Schmid, C., Zisserman, A., Matas, J., Schaffalitzky, F., Kadir, T., and van Gool, L. 2005. A comparison of affine region detectors. Int. J. of Computer Vision 65, 1/2, 43--72.

Digital Library

[26]

Naaman, M., Paepcke, A., and Garcia-Molina, H. 2003. From where to what: Metadata sharing for digital photographs with geographic coordinates. In Proc. Int. Conf. on Cooperative Information Systems, 196--217.

[27]

Naaman, M., Song, Y. J., Paepcke, A., and Garcia-Molina, H. 2004. Automatic organization for digital photographs with geographic coordinates. In Proc. ACM/IEEE-CS Joint Conf. on Digital libraries, 53--62.

Digital Library

[28]

Nocedal, J., and Wright, S. J. 1999. Numerical Optimization. Springer Series in Operations Research. Springer-Verlag, New York, NY.

[29]

Pollefeys, M., Van Gool, L., Vergauwen, M., Verbiest, F., Cornelis, K., Tops, J., and Koch, R. 2004. Visual modeling with a hand-held camera. Int. J. of Computer Vision 59, 3, 207--232.

Digital Library

[30]

Robertson, D. P., and Cipolla, R. 2002. Building architectural models from many views using map constraints. In Proc. European Conf. on Computer Vision, vol. II, 155--169.

Digital Library

[31]

Rodden, K., and Wood, K. R. 2003. How do people manage their digital photographs? In Proc. Conf. on Human Factors in Computing Systems, 409--416.

Digital Library

[32]

Román, A., Garg, G., and Levoy, M. 2004. Interactive design of multi-perspective images for visualizing urban landscapes. In Proc. IEEE Visualization, 537--544.

Digital Library

[33]

Russell, B. C., Torralba, A., Murphy, K. P., and Freeman, W. T. 2005. Labelme: A database and web-based tool for image annotation. Tech. Rep. MIT-CSAIL-TR-2005-056, Massachusetts Institute of Technology.

[34]

Schaffalitzky, F., and Zisserman, A. 2002. Multi-view matching for unordered image sets, or "How do I organize my holiday snaps?". In Proc. European Conf. on Computer Vision, vol. 1, 414--431.

Digital Library

[35]

Schmid, C., and Zisserman, A. 1997. Automatic line matching across views. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, 666--671.

Digital Library

[36]

Seitz, S. M., and Dyer, C. M. 1996. View morphing. In SIGGRAPH Conf. Proc., 21--30.

Digital Library

[37]

Sivic, J., and Zisserman, A. 2003. Video Google: A text retrieval approach to object matching in videos. In Proc. Int. Conf. on Computer Vision, 1470--1477.

Digital Library

[38]

Steedly, D., Essa, I., and Delleart, F. 2003. Spectral partitioning for structure from motion. In Proc. Int. Conf. on Computer Vision, 996--103.

Digital Library

[39]

Szeliski, R. 2005. Image alignment and stitching: A tutorial. Tech. Rep. MSR-TR-2004-92, Microsoft Research.

[40]

Teller, S., et al. 2003. Calibrated, registered images of an extended urban area. Int. J. of Computer Vision 53, 1, 93--107.

Digital Library

[41]

Toyama, K., Logan, R., and Roseway, A. 2003. Geographic location tags on digital images. In Proc. Int. Conf. on Multimedia, 156--166.

Digital Library

[42]

von Ahn, L., and Dabbish, L. 2004. Labeling images with a computer game. In Proc. Conf. on Human Factors in Computing Systems, 319--326.

Digital Library

[43]

Zitnick, L., Kang, S. B., Uyttendaele, M., Winder, S., and Szeliski, R. 2004. High-quality video view interpolation using a layered representation. In SIGGRAPH Conf. Proc., 600--608.

Digital Library

Cited By

IWATA KSATO TSATOH Y(2025)Acceleration of 3D Gaussian Splatting with Quality Preservation Under Sparse Viewpoint Conditions少数視点条件下での品質保持を考慮した高速3D Gaussian Splattingの提案Journal of the Japan Society for Precision Engineering10.2493/jjspe.91.11191:1(111-116)Online publication date: 5-Jan-2025
https://doi.org/10.2493/jjspe.91.111
Wang HXie JFu JZhang CChen DZhu ZZhang X(2025)Rapid acquisition and surface defects recognition based on panoramic image of small-section hydraulic tunnelUnderground Space10.1016/j.undsp.2024.08.00721(270-290)Online publication date: Apr-2025
https://doi.org/10.1016/j.undsp.2024.08.007
Wang RFouhey DHiggins RAntiochos SBarnes GHoeksema JLeka KLiu YSchuck PGombosi T(2024)SuperSynthIA: Physics-ready Full-disk Vector Magnetograms from HMI, Hinode, and Machine LearningThe Astrophysical Journal10.3847/1538-4357/ad41e3970:2(168)Online publication date: 29-Jul-2024
https://doi.org/10.3847/1538-4357/ad41e3
Show More Cited By

Index Terms

Photo tourism: exploring photo collections in 3D
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Reconstruction
  2. Computer graphics
    1. Graphics systems and interfaces
      1. Virtual reality
2. Human-centered computing
  1. Human computer interaction (HCI)
    1. Interaction paradigms
      1. Mixed / augmented reality
      2. Virtual reality

Recommendations

Photo tourism: exploring photo collections in 3D
SIGGRAPH '06: ACM SIGGRAPH 2006 Papers

We present a system for interactively browsing and exploring large unstructured collections of photographs of a scene using a novel 3D interface. Our system consists of an image-based modeling front end that automatically computes the viewpoint of each ...
Photo tourism: exploring photo collections in 3D
Seminal Graphics Papers: Pushing the Boundaries, Volume 2

We present a system for interactively browsing and exploring large unstructured collections of photographs of a scene using a novel 3D interface. Our system consists of an image-based modeling front end that automatically computes the viewpoint of each ...
Deep photo: model-based photograph enhancement and viewing
SIGGRAPH Asia '08: ACM SIGGRAPH Asia 2008 papers

In this paper, we introduce a novel system for browsing, enhancing, and manipulating casual outdoor photographs by combining them with already existing georeferenced digital terrain and urban models. A simple interactive registration process is used to ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics

ACM Transactions on Graphics Volume 25, Issue 3

July 2006

742 pages

ISSN:0730-0301

EISSN:1557-7368

DOI:10.1145/1141911

Issue’s Table of Contents

Seminal Graphics Papers: Pushing the Boundaries, Volume 2
August 2023
893 pages
ISBN:9798400708978
DOI:10.1145/3596711
Editor:
Mary C. Whitton
Department of Computer Science, UNC Chapel Hill, USA

Copyright © 2006 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 July 2006

Published in TOG Volume 25, Issue 3

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Badges

Seminal Paper

Author Tags

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1,525
Total Citations
View Citations
10,141
Total Downloads

Downloads (Last 12 months)1,030
Downloads (Last 6 weeks)170

Reflects downloads up to 09 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

IWATA KSATO TSATOH Y(2025)Acceleration of 3D Gaussian Splatting with Quality Preservation Under Sparse Viewpoint Conditions少数視点条件下での品質保持を考慮した高速3D Gaussian Splattingの提案Journal of the Japan Society for Precision Engineering10.2493/jjspe.91.11191:1(111-116)Online publication date: 5-Jan-2025
https://doi.org/10.2493/jjspe.91.111
Wang HXie JFu JZhang CChen DZhu ZZhang X(2025)Rapid acquisition and surface defects recognition based on panoramic image of small-section hydraulic tunnelUnderground Space10.1016/j.undsp.2024.08.00721(270-290)Online publication date: Apr-2025
https://doi.org/10.1016/j.undsp.2024.08.007
Wang RFouhey DHiggins RAntiochos SBarnes GHoeksema JLeka KLiu YSchuck PGombosi T(2024)SuperSynthIA: Physics-ready Full-disk Vector Magnetograms from HMI, Hinode, and Machine LearningThe Astrophysical Journal10.3847/1538-4357/ad41e3970:2(168)Online publication date: 29-Jul-2024
https://doi.org/10.3847/1538-4357/ad41e3
Zhang YXie YZhou JXu XMiao M(2024)Cucumber Seedling Segmentation Network Based on a Multiview Geometric Graph Encoder from 3D Point CloudsPlant Phenomics10.34133/plantphenomics.02546(0254)Online publication date: 2024
https://doi.org/10.34133/plantphenomics.0254
Louis RZech YJoseph AGonomy NSoares-Frazao S(2024)Flood Modeling of the June 2023 Flooding of Léogâne City by the Overflow of the Rouyonne River in HaitiWater10.3390/w1618259416:18(2594)Online publication date: 13-Sep-2024
https://doi.org/10.3390/w16182594
Ogawa YNakamura RSato GMaeda HSekimoto Y(2024)End-to-End Framework for the Automatic Matching of Omnidirectional Street Images and Building Data and the Creation of 3D Building ModelsRemote Sensing10.3390/rs1611185816:11(1858)Online publication date: 23-May-2024
https://doi.org/10.3390/rs16111858
de Gispert Hernández JMoliner Nuño SSánchez Riera ACrespo Cabillo IPàmies C(2024)Discovering the Sansalvador Villa through the Superposition of Photogrammetric Point Cloud Surveys and Original Jujol PlansHeritage10.3390/heritage70601447:6(3062-3079)Online publication date: 5-Jun-2024
https://doi.org/10.3390/heritage7060144
Hassan GMin ZKakani VJo G(2024)An Efficient Maximum Entropy Approach with Consensus Constraints for Robust Geometric FittingElectronics10.3390/electronics1315297213:15(2972)Online publication date: 27-Jul-2024
https://doi.org/10.3390/electronics13152972
Liu LWang CFeng CGong WZhang LLiao LFeng C(2024)Incremental SFM 3D Reconstruction Based on Deep LearningElectronics10.3390/electronics1314285013:14(2850)Online publication date: 19-Jul-2024
https://doi.org/10.3390/electronics13142850
Münster SMaiwald FBruschke JKröber CSun YDworak DKomorowicz DMunir IBeck CMünster D(2024)A Digital 4D Information System on the World Scale: Research Challenges, Approaches, and Preliminary ResultsApplied Sciences10.3390/app1405199214:5(1992)Online publication date: 28-Feb-2024
https://doi.org/10.3390/app14051992
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Media

Figures

Other

Tables

View Issue’s Table of Contents