Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2047196.2047270acmconferencesArticle/Chapter ViewAbstractPublication PagesuistConference Proceedingsconference-collections
research-article

KinectFusion: real-time 3D reconstruction and interaction using a moving depth camera

Published: 16 October 2011 Publication History

Abstract

KinectFusion enables a user holding and moving a standard Kinect camera to rapidly create detailed 3D reconstructions of an indoor scene. Only the depth data from Kinect is used to track the 3D pose of the sensor and reconstruct, geometrically precise, 3D models of the physical scene in real-time. The capabilities of KinectFusion, as well as the novel GPU-based pipeline are described in full. Uses of the core system for low-cost handheld scanning, and geometry-aware augmented reality and physics-based interactions are shown. Novel extensions to the core GPU pipeline demonstrate object segmentation and user interaction directly in front of the sensor, without degrading camera tracking or reconstruction. These extensions are used to enable real-time multi-touch interactions anywhere, allowing any planar or non-planar reconstructed physical surface to be appropriated for touch.

References

[1]
P. J. Besl and N. D. McKay. A method for registration of 3D shapes. IEEE Trans. Pattern Anal. Mach. Intell., 14:239--256, February 1992.
[2]
X. Cao and R. Balakrishnan. Interacting with dynamically defined information spaces using a handheld projector and a pen. In UIST, pages 225--234, 2006.
[3]
Y. Chen and G. Medioni. Object modeling by registration of multiple range images. Image and Vision Computing (IVC), 10(3):145--155, 1992.
[4]
Y. Cui et al. 3d shape scanning with a time-of-flight camera. In Computer Vision and Pattern Recognition (CVPR), pages 1173 --1180, June 2010.
[5]
B. Curless and M. Levoy. A volumetric method for building complex models from range images. ACM Trans. Graph., 1996.
[6]
S. Farsiu et al. Fast and robust multiframe super resolution. IEEE Transactions on Image Processing, 13(10):1327--1344, 2004.
[7]
J. Frahm et al. Building Rome on a cloudless day. In Proc. Europ. Conf. on Computer Vision (ECCV), 2010.
[8]
B. Freedman, A. Shpunt, M. Machline, and Y. Arieli. Depth Mapping Using Projected Patterns. Patent Application, 10 2008. WO 2008/120217 A2.
[9]
S. L. Grand. Broad-phase collision detection with CUDA. In GPU Gems 3. Addison-Wesley, 2007.
[10]
T. Harada. Real-time rigid body simulation on gpus. In GPU Gems 3. Addison-Wesley Professional, 2007.
[11]
R. Hartley and A. Zisserman. Multiple View Geometry in Computer Vision. Cambridge University Press, second edition, 2004.
[12]
P. Henry et al. RGB-D mapping: Using depth cameras for dense 3D modeling of indoor environments. In Proc. of the Int. Symposium on Experimental Robotics (ISER), 2010.
[13]
B. Huhle et al. Fusion of range and color images for denoising and resolution enhancement with a non-local filter. Computer Vision and Image Understanding, 114(12):1336--1345, 2010.
[14]
M. Kazhdan, M. Bolitho, and H. Hoppe. Poisson surface reconstruction. In Proc. of the Eurographics Symposium on Geometry Processing, 2006.
[15]
G. Klein and D. W. Murray. Parallel tracking and mapping for small AR workspaces. In ISMAR, 2007.
[16]
M. Levoy et al. The digital Michelangelo Project: 3D scanning of large statues. ACM Trans. Graph., 2000.
[17]
K. Low. Linear least-squares optimization for point-to-plane icp surface registration. Technical report, TR04-004, University of North Carolina, 2004.
[18]
P. Merrell et al. Real-time visibility-based fusion of depth maps. In Proc. of the Int. Conf. on Computer Vision (ICCV), 2007.
[19]
R. A. Newcombe and A. J. Davison. Live dense reconstruction with a single moving camera. In Proc. of the IEEE (CVPR), 2010.
[20]
R. A. Newcombe, S. Lovegrove, and A. J. Davison. Dense tracking and mapping in real-time. In Proc. of the Int. Conf. on Computer Vision (ICCV), 2011.
[21]
R. A. Newcombe et al. Real-Time Dense Surface Mapping and Tracking with Kinect. In ISMAR, 2011.
[22]
S. Osher and R. Fedkiw. Level Set Methods and Dynamic Implicit Surfaces. Springer, 2002.
[23]
S. Rusinkiewicz, O. Hall-Holt, and M. Levoy. Real-time 3D model acquisition. ACM Trans. Graph., 2002.
[24]
S. Rusinkiewicz and M. Levoy. Efficient variants of the ICP algorithm. 3D Digital Imaging and Modeling, Int. Conf. on, 0:145, 2001.
[25]
S. Thrun. Robotic mapping: A survey. In Exploring Artificial Intelligence in the New Millenium. 2002.
[26]
D. Vlasic et al. Dynamic shape capture using multi-view photometric stereo. ACM Trans. Graph., 28(5), 2009.
[27]
D. Wagner, T. Langlotz, and D. Schmalstieg. Robust and unobtrusive marker tracking on mobile phones. In ISMAR, pages 121--124, 2008.
[28]
T. Weise, T. Wismer, B. Leibe, and L. V. Gool. In-hand scanning with online loop closure. In IEEE Int. Workshop on 3-D Digital Imaging and Modeling, 2009.
[29]
K. Zhou, M. Gong, X. Huang, and B. Guo. Data-parallel octrees for surface reconstruction. IEEE Trans. on Visualization and Computer Graphics, 17, 2011.

Cited By

View all
  • (2024)СЕНСОРИ ТА МЕТОДИ ВІЗУАЛЬНОЇ НАВІГАЦІЇ ДЛЯ АВТОНОМНИХ БПЛАGrail of Science10.36074/grail-of-science.02.08.2024.046(333-335)Online publication date: 7-Aug-2024
  • (2024)Applications of 3D Reconstruction in Virtual Reality-Based Teleoperation: A Review in the Mining IndustryTechnologies10.3390/technologies1203004012:3(40)Online publication date: 15-Mar-2024
  • (2024)Three-Dimensional Dense Reconstruction: A Review of Algorithms and DatasetsSensors10.3390/s2418586124:18(5861)Online publication date: 10-Sep-2024
  • Show More Cited By

Index Terms

  1. KinectFusion: real-time 3D reconstruction and interaction using a moving depth camera

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      UIST '11: Proceedings of the 24th annual ACM symposium on User interface software and technology
      October 2011
      654 pages
      ISBN:9781450307161
      DOI:10.1145/2047196
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 16 October 2011

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. 3D
      2. AR
      3. GPU
      4. depth cameras
      5. geometry-aware interactions
      6. physics
      7. surface reconstruction
      8. tracking

      Qualifiers

      • Research-article

      Conference

      UIST '11

      Acceptance Rates

      UIST '11 Paper Acceptance Rate 67 of 262 submissions, 26%;
      Overall Acceptance Rate 561 of 2,567 submissions, 22%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)671
      • Downloads (Last 6 weeks)67
      Reflects downloads up to 17 Oct 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)СЕНСОРИ ТА МЕТОДИ ВІЗУАЛЬНОЇ НАВІГАЦІЇ ДЛЯ АВТОНОМНИХ БПЛАGrail of Science10.36074/grail-of-science.02.08.2024.046(333-335)Online publication date: 7-Aug-2024
      • (2024)Applications of 3D Reconstruction in Virtual Reality-Based Teleoperation: A Review in the Mining IndustryTechnologies10.3390/technologies1203004012:3(40)Online publication date: 15-Mar-2024
      • (2024)Three-Dimensional Dense Reconstruction: A Review of Algorithms and DatasetsSensors10.3390/s2418586124:18(5861)Online publication date: 10-Sep-2024
      • (2024)Virtual Experience Toolkit: An End-to-End Automated 3D Scene Virtualization Framework Implementing Computer Vision TechniquesSensors10.3390/s2412383724:12(3837)Online publication date: 13-Jun-2024
      • (2024)Visual SLAM for Unmanned Aerial Vehicles: Localization and PerceptionSensors10.3390/s2410298024:10(2980)Online publication date: 8-May-2024
      • (2024)HALNet: Partial Point Cloud Registration Based on Hybrid Attention and Deep Local FeaturesSensors10.3390/s2409276824:9(2768)Online publication date: 26-Apr-2024
      • (2024)A Comprehensive Review of Vision-Based 3D Reconstruction MethodsSensors10.3390/s2407231424:7(2314)Online publication date: 5-Apr-2024
      • (2024)RGBTSDF: An Efficient and Simple Method for Color Truncated Signed Distance Field (TSDF) Volume Fusion Based on RGB-D ImagesRemote Sensing10.3390/rs1617318816:17(3188)Online publication date: 29-Aug-2024
      • (2024)Research Advances and Prospects of Underwater Terrain-Aided NavigationRemote Sensing10.3390/rs1614256016:14(2560)Online publication date: 12-Jul-2024
      • (2024)Interactive Mesh Sculpting with Arbitrary Topologies in Head-Mounted VR EnvironmentsMathematics10.3390/math1215242812:15(2428)Online publication date: 5-Aug-2024
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media