Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3123266.3123348acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Real-time Monocular Dense Mapping for Augmented Reality

Published: 19 October 2017 Publication History

Abstract

Monocular simultaneous localization and mapping (SLAM) is a key enabling technique for many augmented reality (AR) applications. However, conventional methods for monocular SLAM can obtain only sparse or semi-dense maps in highly-textured image areas. Poorly-textured regions which widely exist in indoor and man-made urban environments can be hardly reconstructed, impeding interactions between virtual objects and real scenes in AR apps. In this paper,we present a novel method for real-time monocular dense mapping based on the piecewise planarity assumption for poorly textured regions. Specifically, a semi-dense map for highly-textured regions is first calculated by pixel matching and triangulation [6, 7]. Large textureless regions extracted by Maximally Stable Color Regions (MSCR) [11], which is a homogeneous-color region detector, are approximated using piecewise planar models which are estimated by the corresponding semi-dense 3D points and the proposed multi-plane segmentation algorithm. Plane models associated with the same 3D area across multiple overlapping views are linked and fused to ensure a consistent and accurate 3D reconstruction. Experimental results on two public datasets [15, 23] demonstrate that our method is 2.3X~2.9X faster than the state-of-the-art method DPPTAM [2], and meanwhile achieves better reconstruction accuracy and completeness. We also apply our method to a real AR application and live experiments with a hand-held camera demonstrate the effectiveness and efficiency of our method in practical scenario.

References

[1]
Alejo Concha and Javier Civera. 2014. Using superpixels in monocular SLAM. In Robotics and Automation (ICRA), 2014 IEEE International Conference on. IEEE, 365--372.
[2]
Alejo Concha and Javier Civera. 2015. DPPTAM: Dense piecewise planar tracking and mapping from a monocular sequence. In Intelligent Robots and Systems (IROS), 2015 IEEE/RSJ International Conference on. IEEE, 5686--5693.
[3]
Amaury Dame, Victor A Prisacariu, Carl Y Ren, and Ian Reid. 2013. Dense reconstruction using 3D object shape priors. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1288--1295.
[4]
David Eigen, Christian Puhrsch, and Rob Fergus. 2014. Depth map prediction from a single image using a multi-scale deep network. In Advances in neural information processing systems. 2366--2374.
[5]
Jakob Engel, Vladlen Koltun, and Daniel Cremers. 2016. Direct sparse odometry.arXiv preprint arXiv:1607.02565 (2016).
[6]
Jakob Engel, Thomas Schops, and Daniel Cremers. 2014. LSD-SLAM: Large-scale direct monocular SLAM. In European Conference on Computer Vision. Springer,834--849.
[7]
Jakob Engel, Jurgen Sturm, and Daniel Cremers. 2013. Semi-dense visual odometry for a monocular camera. In Proceedings of the IEEE international conference on computer vision. 1449--1456.
[8]
Pedro F Felzenszwalb and Daniel P Huttenlocher. 2004. Efficient graph-based image segmentation. International journal of computer vision 59, 2 (2004), 167--181.
[9]
Martin A Fischler and Robert C Bolles. 1981. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24, 6 (1981), 381--395.
[10]
Alex Flint, David Murray, and Ian Reid. 2011. Manhattan scene understanding using monocular, stereo, and 3d features. In Computer Vision (ICCV), 2011 IEEE International Conference on. IEEE, 2228--2235.
[11]
Per-Erik Forssén. 2007. Maximally stable colour regions for recognition and matching. In Computer Vision and Pattern Recognition, 2007. CVPR'07. IEEE Conference on. IEEE, 1--8.
[12]
Christian Forster, Matia Pizzoli, and Davide Scaramuzza. 2014. SVO: Fast semidirect monocular visual odometry. In Robotics and Automation (ICRA), 2014 IEEE International Conference on. IEEE, 15--22.
[13]
Ravi Garg, Gustavo Carneiro, and Ian Reid. 2016. Unsupervised CNN for single view depth estimation: Geometry to the rescue. In European Conference on Computer Vision. Springer, 740--56.
[14]
W Nicholas Greene, Kyel Ok, Peter Lommel, and Nicholas Roy. 2016. Multi-level mapping: Real-time dense monocular SLAM. In Robotics and Automation (ICRA), 2016 IEEE International Conference on. IEEE, 833--840.
[15]
Ankur Handa, Richard Newcombe, Adrien Angeli, and Andrew Davison. 2012. Real-time camera tracking: When is high frame-rate best? Computer Vision? ECCV 2012 (2012), 222--235.
[16]
Christian Kerl, Jügen Sturm, and Daniel Cremers. 2013. Robust odometry estimation for RGB-D cameras. In Robotics and Automation (ICRA), 2013 IEEE International Conference on. IEEE, 3748--3754.
[17]
Georg Klein and David Murray. 2007. Parallel tracking and mapping for small AR workspaces. In Mixed and Augmented Reality, 2007. ISMAR 2007. 6th IEEE and ACM International Symposium on. IEEE, 225--234.
[18]
Abhijit Kundu, Yin Li, Frank Dellaert, Fuxin Li, and James M Rehg. 2014. Joint semantic segmentation and 3d reconstruction from monocular video. In European Conference on Computer Vision. Springer, 703--718.
[19]
Fayao Liu, Chunhua Shen, Guosheng Lin, and Ian Reid. 2016. Learning depth from single monocular images using deep convolutional neural fields. IEEE transactions on pattern analysis and machine intelligence 38, 10 (2016), 2024--2039.
[20]
Raul Mur-Artal, Jose Maria Martinez Montiel, and Juan D Tardos. 2015. ORB-SLAM: a versatile and accurate monocular SLAM system. IEEE Transactions on Robotics 31, 5 (2015), 1147--1163.
[21]
Richard A Newcombe, Steven J Lovegrove, and Andrew J Davison. 2011. DTAM: Dense tracking and mapping in real-time. In Computer Vision (ICCV), 2011 IEEE International Conference on. IEEE, 2320--2327.
[22]
Sunando Sengupta and Paul Sturgess. 2015. Semantic octree: Unifying recognition, reconstruction and representation via an octree constrained higher order MRF. In Robotics and Automation (ICRA), 2015 IEEE International Conference on. IEEE, 1874--1879.
[23]
Jügen Sturm, Nikolas Engelhard, Felix Endres, Wolfram Burgard, and Daniel Cremers. 2012. A benchmark for the evaluation of RGB-D SLAM systems. In Intelligent Robots and Systems (IROS), 2012 IEEE/RSJ International Conference on. IEEE, 573--580.

Cited By

View all
  • (2021)Digitalization as solution to environmental problems? When users rely on augmented reality-recommendation agentsJournal of Business Research10.1016/j.jbusres.2021.02.019128(510-523)Online publication date: May-2021
  • (2021)Monocular Dense SLAM with Consistent Deep Depth PredictionAdvances in Computer Graphics10.1007/978-3-030-89029-2_9(113-124)Online publication date: 11-Oct-2021
  • (2020)DENAO: Monocular Depth Estimation Network with Auxiliary Optical FlowIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2020.2977021(1-1)Online publication date: 2020
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '17: Proceedings of the 25th ACM international conference on Multimedia
October 2017
2028 pages
ISBN:9781450349062
DOI:10.1145/3123266
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 October 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. augmented reality
  2. monocular dense mapping
  3. multiplane segmentation
  4. plane model

Qualifiers

  • Research-article

Funding Sources

Conference

MM '17
Sponsor:
MM '17: ACM Multimedia Conference
October 23 - 27, 2017
California, Mountain View, USA

Acceptance Rates

MM '17 Paper Acceptance Rate 189 of 684 submissions, 28%;
Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)14
  • Downloads (Last 6 weeks)1
Reflects downloads up to 08 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2021)Digitalization as solution to environmental problems? When users rely on augmented reality-recommendation agentsJournal of Business Research10.1016/j.jbusres.2021.02.019128(510-523)Online publication date: May-2021
  • (2021)Monocular Dense SLAM with Consistent Deep Depth PredictionAdvances in Computer Graphics10.1007/978-3-030-89029-2_9(113-124)Online publication date: 11-Oct-2021
  • (2020)DENAO: Monocular Depth Estimation Network with Auxiliary Optical FlowIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2020.2977021(1-1)Online publication date: 2020
  • (2019)Real-Time Dense Monocular SLAM With Online Adapted Depth Prediction NetworkIEEE Transactions on Multimedia10.1109/TMM.2018.285903421:2(470-483)Online publication date: 1-Feb-2019
  • (2018)Monocular Camera Based Real-Time Dense Mapping Using Generative Adversarial NetworkProceedings of the 26th ACM international conference on Multimedia10.1145/3240508.3240564(896-904)Online publication date: 15-Oct-2018
  • (2018)A Method of Registering Virtual Objects in Monocular Augmented Reality SystemImage and Graphics Technologies and Applications10.1007/978-981-13-1702-6_48(483-493)Online publication date: 12-Aug-2018

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media