Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article
Open access

Photo tourism: exploring photo collections in 3D

Published: 01 July 2006 Publication History

Abstract

We present a system for interactively browsing and exploring large unstructured collections of photographs of a scene using a novel 3D interface. Our system consists of an image-based modeling front end that automatically computes the viewpoint of each photograph as well as a sparse 3D model of the scene and image to model correspondences. Our photo explorer uses image-based rendering techniques to smoothly transition between photographs, while also enabling full 3D navigation and exploration of the set of images and world geometry, along with auxiliary information such as overhead maps. Our system also makes it easy to construct photo tours of scenic or historic locations, and to annotate image details, which are automatically transferred to other relevant images. We demonstrate our system on several large personal photo collections as well as images gathered from Internet photo sharing sites.

Supplementary Material

High Resolution (p835-snavely-high.mov)
Low Resolution (p835-snavely-low.mov)

References

[1]
Aliaga, D., Funkhouser, T., Yanovsky, D., and Carlbom, I. 2003. Sea of images. IEEE Computer Graphics and Applications 23, 6, 22--30.
[2]
Aliaga, D., Yanovsky, D., Funkhouser, T., and Carlbom, I. 2003. Interactive image-based rendering using feature globalization. In Proc. SIGGRAPH Symposium on Interactive 3D Graphics, 163--170.
[3]
Arya, S., Mount, D. M., Netanyahu, N. S., Silverman, R., and Wu, A. Y. 1998. An optimal algorithm for approximate nearest neighbor searching fixed dimensions. J. of the ACM 45, 6, 891--923.
[4]
Brown, M., and Lowe, D. G. 2005. Unsupervised 3d object recognition and reconstruction in unordered datasets. In Proc. Int. Conf. on 3D Digital Imaging and Modelling, 56--63.
[5]
Buehler, C., Bosse, M., McMillan, L., Gortler, S., and Cohen, M. 2001. Unstructured lumigraph rendering. In SIGGRAPH Conf. Proc., 425--432.
[6]
Chen, S., and Williams, L. 1993. View interpolation for image synthesis. In SIGGRAPH Conf. Proc., 279--288.
[7]
Chew, L. P. 1987. Constrained delaunay triangulations. In Proc. Sym. on Computational geometry, 215--222.
[8]
Cooper, M., Foote, J., Girgensohn, A., and Wilcox, L. 2003. Temporal event clustering for digital photo collections. In Proc. ACM Int. Conf. on Multimedia, 364--373.
[9]
Debevec, P. E., Taylor, C. J., and Malik, J. 1996. Modeling and rendering architecture from photographs: a hybrid geometry- and image-based approach. In SIGGRAPH Conf. Proc., 11--20.
[10]
Dick, A. R., Torr, P. H. S., and Cipolla, R. 2004. Modelling and interpretation of architecture from several images. Int. J. of Computer Vision 60, 2, 111--134.
[11]
Feiner, S., MacIntyre, B., Hollerer, T., and Webster, A. 1997. A touring machine: Prototyping 3d mobile augmented reality systems for exploring the urban environment. In Proc. IEEE Int. Sym. on Wearable Computers, 74--81.
[12]
Fischler, M., and Bolles, R. 1987. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Readings in computer vision: issues, problems, principles, and paradigms, 726--740.
[13]
Gortler, S. J., Grzeszczuk, R., Szeliski, R., and Cohen, M. F. 1996. The Lumigraph. In SIGGRAPH Conf. Proc., 43--54.
[14]
Grzeszczuk, R. 2002. Course 44: Image-based modeling. In SIGGRAPH 2002.
[15]
Hartley, R. I., and Zisserman, A. 2004. Multiple View Geometry. Cambridge University Press, Cambridge, UK.
[16]
Irani, M., and Anandan, P. 1998. Video indexing based on mosaic representation. IEEE Trans. on Pattern Analysis and Machine Intelligence 86, 5, 905--921.
[17]
Johansson, B., and Cipolla, R. 2002. A system for automatic pose-estimation from a single image in a city scene. In Proc. IASTED Int. Conf. Signal Processing, Pattern Recognition and Applications.
[18]
Kadobayashi, R., and Tanaka, K. 2005. 3d viewpoint-based photo search and information browsing. In Proc. ACM Int. Conf. on Research and development in information retrieval, 621--622.
[19]
Levoy, M., and Hanrahan, P. 1996. Light field rendering. In SIGGRAPH Conf. Proc., 31--42.
[20]
Lippman, A. 1980. Movie maps: An application of the optical videodisc to computer graphics. In SIGGRAPH Conf. Proc., 32--43.
[21]
Lourakis, M., and Argyros, A. 2004. The design and implementation of a generic sparse bundle adjustment software package based on the levenberg-marquardt algorithm. Tech. Rep. 340, Inst. of Computer Science-FORTH, Heraklion, Crete, Greece. Available from www.ics.forth.gr/~lourakis/sba.
[22]
Lowe, D. 2004. Distinctive image features from scale-invariant keypoints. Int. J. of Computer Vision 60, 2, 91--110.
[23]
McCurdy, N., and Griswold, W. 2005. A systems architecture for ubiquitous video. In Proc. Int. Conf. on mobile systems, applications, and services, 1--14.
[24]
McMillan, L., and Bishop, G. 1995. Plenoptic modeling: An image-based rendering system. In SIGGRAPH Conf. Proc., 39--46.
[25]
Mikolajczyk, K., Tuytelaars, T., Schmid, C., Zisserman, A., Matas, J., Schaffalitzky, F., Kadir, T., and van Gool, L. 2005. A comparison of affine region detectors. Int. J. of Computer Vision 65, 1/2, 43--72.
[26]
Naaman, M., Paepcke, A., and Garcia-Molina, H. 2003. From where to what: Metadata sharing for digital photographs with geographic coordinates. In Proc. Int. Conf. on Cooperative Information Systems, 196--217.
[27]
Naaman, M., Song, Y. J., Paepcke, A., and Garcia-Molina, H. 2004. Automatic organization for digital photographs with geographic coordinates. In Proc. ACM/IEEE-CS Joint Conf. on Digital libraries, 53--62.
[28]
Nocedal, J., and Wright, S. J. 1999. Numerical Optimization. Springer Series in Operations Research. Springer-Verlag, New York, NY.
[29]
Pollefeys, M., Van Gool, L., Vergauwen, M., Verbiest, F., Cornelis, K., Tops, J., and Koch, R. 2004. Visual modeling with a hand-held camera. Int. J. of Computer Vision 59, 3, 207--232.
[30]
Robertson, D. P., and Cipolla, R. 2002. Building architectural models from many views using map constraints. In Proc. European Conf. on Computer Vision, vol. II, 155--169.
[31]
Rodden, K., and Wood, K. R. 2003. How do people manage their digital photographs? In Proc. Conf. on Human Factors in Computing Systems, 409--416.
[32]
Román, A., Garg, G., and Levoy, M. 2004. Interactive design of multi-perspective images for visualizing urban landscapes. In Proc. IEEE Visualization, 537--544.
[33]
Russell, B. C., Torralba, A., Murphy, K. P., and Freeman, W. T. 2005. Labelme: A database and web-based tool for image annotation. Tech. Rep. MIT-CSAIL-TR-2005-056, Massachusetts Institute of Technology.
[34]
Schaffalitzky, F., and Zisserman, A. 2002. Multi-view matching for unordered image sets, or "How do I organize my holiday snaps?". In Proc. European Conf. on Computer Vision, vol. 1, 414--431.
[35]
Schmid, C., and Zisserman, A. 1997. Automatic line matching across views. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, 666--671.
[36]
Seitz, S. M., and Dyer, C. M. 1996. View morphing. In SIGGRAPH Conf. Proc., 21--30.
[37]
Sivic, J., and Zisserman, A. 2003. Video Google: A text retrieval approach to object matching in videos. In Proc. Int. Conf. on Computer Vision, 1470--1477.
[38]
Steedly, D., Essa, I., and Delleart, F. 2003. Spectral partitioning for structure from motion. In Proc. Int. Conf. on Computer Vision, 996--103.
[39]
Szeliski, R. 2005. Image alignment and stitching: A tutorial. Tech. Rep. MSR-TR-2004-92, Microsoft Research.
[40]
Teller, S., et al. 2003. Calibrated, registered images of an extended urban area. Int. J. of Computer Vision 53, 1, 93--107.
[41]
Toyama, K., Logan, R., and Roseway, A. 2003. Geographic location tags on digital images. In Proc. Int. Conf. on Multimedia, 156--166.
[42]
von Ahn, L., and Dabbish, L. 2004. Labeling images with a computer game. In Proc. Conf. on Human Factors in Computing Systems, 319--326.
[43]
Zitnick, L., Kang, S. B., Uyttendaele, M., Winder, S., and Szeliski, R. 2004. High-quality video view interpolation using a layered representation. In SIGGRAPH Conf. Proc., 600--608.

Cited By

View all
  • (2025)Acceleration of 3D Gaussian Splatting with Quality Preservation Under Sparse Viewpoint Conditions少数視点条件下での品質保持を考慮した高速3D Gaussian Splattingの提案Journal of the Japan Society for Precision Engineering10.2493/jjspe.91.11191:1(111-116)Online publication date: 5-Jan-2025
  • (2025)Rapid acquisition and surface defects recognition based on panoramic image of small-section hydraulic tunnelUnderground Space10.1016/j.undsp.2024.08.00721(270-290)Online publication date: Apr-2025
  • (2024)SuperSynthIA: Physics-ready Full-disk Vector Magnetograms from HMI, Hinode, and Machine LearningThe Astrophysical Journal10.3847/1538-4357/ad41e3970:2(168)Online publication date: 29-Jul-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics
ACM Transactions on Graphics  Volume 25, Issue 3
July 2006
742 pages
ISSN:0730-0301
EISSN:1557-7368
DOI:10.1145/1141911
Issue’s Table of Contents
  • cover image ACM Overlay Books
    Seminal Graphics Papers: Pushing the Boundaries, Volume 2
    August 2023
    893 pages
    ISBN:9798400708978
    DOI:10.1145/3596711
    • Editor:
    • Mary C. Whitton
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 July 2006
Published in TOG Volume 25, Issue 3

Permissions

Request permissions for this article.

Check for updates

Badges

  • Seminal Paper

Author Tags

  1. image-based modeling
  2. image-based rendering
  3. photo browsing
  4. structure from motion

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1,030
  • Downloads (Last 6 weeks)170
Reflects downloads up to 09 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Acceleration of 3D Gaussian Splatting with Quality Preservation Under Sparse Viewpoint Conditions少数視点条件下での品質保持を考慮した高速3D Gaussian Splattingの提案Journal of the Japan Society for Precision Engineering10.2493/jjspe.91.11191:1(111-116)Online publication date: 5-Jan-2025
  • (2025)Rapid acquisition and surface defects recognition based on panoramic image of small-section hydraulic tunnelUnderground Space10.1016/j.undsp.2024.08.00721(270-290)Online publication date: Apr-2025
  • (2024)SuperSynthIA: Physics-ready Full-disk Vector Magnetograms from HMI, Hinode, and Machine LearningThe Astrophysical Journal10.3847/1538-4357/ad41e3970:2(168)Online publication date: 29-Jul-2024
  • (2024)Cucumber Seedling Segmentation Network Based on a Multiview Geometric Graph Encoder from 3D Point CloudsPlant Phenomics10.34133/plantphenomics.02546(0254)Online publication date: 2024
  • (2024)Flood Modeling of the June 2023 Flooding of Léogâne City by the Overflow of the Rouyonne River in HaitiWater10.3390/w1618259416:18(2594)Online publication date: 13-Sep-2024
  • (2024)End-to-End Framework for the Automatic Matching of Omnidirectional Street Images and Building Data and the Creation of 3D Building ModelsRemote Sensing10.3390/rs1611185816:11(1858)Online publication date: 23-May-2024
  • (2024)Discovering the Sansalvador Villa through the Superposition of Photogrammetric Point Cloud Surveys and Original Jujol PlansHeritage10.3390/heritage70601447:6(3062-3079)Online publication date: 5-Jun-2024
  • (2024)An Efficient Maximum Entropy Approach with Consensus Constraints for Robust Geometric FittingElectronics10.3390/electronics1315297213:15(2972)Online publication date: 27-Jul-2024
  • (2024)Incremental SFM 3D Reconstruction Based on Deep LearningElectronics10.3390/electronics1314285013:14(2850)Online publication date: 19-Jul-2024
  • (2024)A Digital 4D Information System on the World Scale: Research Challenges, Approaches, and Preliminary ResultsApplied Sciences10.3390/app1405199214:5(1992)Online publication date: 28-Feb-2024
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Full Access

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media