Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Learning from Synthetic Point Cloud Data for Historical Buildings Semantic Segmentation

Published: 03 December 2020 Publication History
  • Get Citation Alerts
  • Abstract

    Historical heritage is demanding robust pipelines for obtaining Heritage Building Information Modeling models that are fully interoperable and rich in their informative content. The definition of efficient Scan-to-BIM workflows represent a very important step toward a more efficient management of the historical real estate, as creating structured three-dimensional (3D) models from point clouds is complex and time-consuming. In this scenario, semantic segmentation of 3D Point Clouds is gaining more and more attention, since it might help to automatically recognize historical architectural elements. The way paved by recent Deep Learning approaches proved to provide reliable and affordable degrees of automation in other contexts, as road scenes understanding. However, semantic segmentation is particularly challenging in historical and classical architecture, due to the shapes complexity and the limited repeatability of elements across different buildings, which makes it difficult to define common patterns within the same class of elements. Furthermore, as Deep Learning models requires a considerably large amount of annotated data to be trained and tuned to properly handle unseen scenes, the lack of (big) publicly available annotated point clouds in the historical building domain is a huge problem, which in fact blocks the research in this direction. However, creating a critical mass of annotated point clouds by manual annotation is very time-consuming and impractical. To tackle this issue, in this work we explore the idea of leveraging synthetic point cloud data to train Deep Learning models to perform semantic segmentation of point clouds obtained via Terrestrial Laser Scanning. The aim is to provide a first assessment of the use of synthetic data to drive Deep Learning--based semantic segmentation in the context of historical buildings. To achieve this purpose, we present an improved version of the Dynamic Graph CNN (DGCNN) named RadDGCNN. The main improvement consists on exploiting the radius distance. In our experiments, we evaluate the trained models on synthetic dataset (publicly available) about two different historical buildings: the Ducal Palace in Urbino, Italy, and Palazzo Ferretti in Ancona, Italy. RadDGCNN yields good results, demonstrating improved segmentation performances on the TLS real datasets.

    References

    [1]
    Fabrizio Ivan Apollonio, Simone Baldissini, Paolo Clini, Marco Gaiani, Caterina Palestini, and Camillo Trevisan. 2013. The palladiolibrary geo-models: An open 3D archive to manage and visualize information-communication resources about palladio. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives 40, 5W2 (2013), 49--54.
    [2]
    Fabrizio Ivan Apollonio, Marco Gaiani, and Zheng Sun. 2012. BIM-based modeling and data enrichment of classical architectural buildings. Sci. Res. Inf. Technol. 2, 2 (2012), 41--62.
    [3]
    Iro Armeni, Ozan Sener, Amir R. Zamir, Helen Jiang, Ioannis Brilakis, Martin Fischer, and Silvio Savarese. 2016. 3D semantic parsing of large-scale indoor spaces. In Proceedings of the 29th IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16). IEEE, Los Alamitos, CA, 1534--1543.
    [4]
    Matan Atzmon, Haggai Maron, and Yaron Lipman. 2018. Point convolutional neural networks by extension operators. arxiv:1803.10091. Retrieved from http://arxiv.org/abs/1803.10091.
    [5]
    Paul F. Aubin. 2014. Renaissance Revit: Creating Classical Architecture with Modern Software. G3B Press, CreateSpace Independent Publishing Platform.
    [6]
    Carlo Bianchini, Carlo Inglese, Alfonso Ippolito, Daniele Maiorino, and Luca James Senatore. 2017. Building information modeling (BIM): Great misunderstanding or potential opportunities for the design disciplines? In Handbook of Research on Emerging Technologies for Digital Preservation and Information Modeling. IGI Global, 67--90.
    [7]
    Fabio Bianconi, Marco Filippucci, and Federica Magi Meconi. 2018. Parametrical vitruvius. generative modeling of the architectural orders. SCIRES-IT 8, 2 (2018), 29--48.
    [8]
    Paolo Clini, Ramona Quattrini, Paolo Bonvini, Romina Nespeca, Renato Angeloni, Raissa Mammoli, Aldo Franco Dragoni, Christian Morbidoni, Paolo Sernani, Maura Mengoni, Alma Leopardi, Mauro Silvestrini, Danilo Gambelli, Enrico Cori, Marco Gallegati, Massimo Tamberi, Fabio Fraticelli, Maria Cristina Acciarri, and Serena Mandolesi. 2020. Digit(al)isation in museums: Civitas project -- AR, VR, multisensorial and multiuser experiences at the Urbino’s Ducal palace. In Virtual and Augmented Reality in Education, Art, and Museums. 194--228.
    [9]
    Haoqiang Fan, Hao Su, and Leonidas J. Guibas. 2017. A point set generation network for 3d object reconstruction from a single image. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Institute of Electrical and Electronics Engineers Inc., 605--613.
    [10]
    Marco Gaiani, Fabrizio I. Apollonio, Paolo Clini, and Ramona Quattrini. 2015. A mono-instrumental approach to high-quality 3D reality-based semantic models application on the PALLADIO library. In Proceedings of the 2nd Digital Heritage International Congress Digital Heritage, Vol. 2. Institute of Electrical and Electronics Engineers Inc., 29--36.
    [11]
    Andreas Geiger, Philip Lenz, and Raquel Urtasun. 2012. Are we ready for autonomous driving? the KITTI vision benchmark suite. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Institute of Electrical and Electronics Engineers Inc., 3354--3361.
    [12]
    David Griffiths and Jan Boehm. 2019. A review on deep learning techniques for 3D sensed data classification. Remote Sens. 11, 12 (2019), 1499.
    [13]
    David Griffiths and Jan Boehm. 2019. SynthCity: A large scale synthetic point cloud. arxiv:1907.04758. Retrieved from https://arxiv.org/abs/1907.04758.
    [14]
    Pedro Hermosilla, Tobias Ritschel, Pere-Pau Vázquez, Àlvar Vinacua, and Timo Ropinski. 2018. Monte carlo convolution for learning on non-uniformly sampled point clouds. ACM Trans. Graph. 37, 6 (2018).
    [15]
    Max Jaderberg, Karen Simonyan, Andrea Vedaldi, and Andrew Zisserman. 2014. Synthetic data and artificial neural networks for natural scene text recognition. arxiv:1406.2227. Retrieved from https://arxiv.org/abs/1406.2227.
    [16]
    Vladimir Kniaz, V. S. Gorbatsevich, and Vladimir Mizginov. 2017. Thermalnet: A deep convolutional network for synthetic thermal image generation. The International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences 42, W4 (2017), 41.
    [17]
    Zhuqiang Li, Liqiang Zhang, Ruofei Zhong, Tian Fang, Liang Zhang, and Zhenxin Zhang. 2016. Classification of urban point clouds: A robust supervised approach with automatically generating training data. IEEE J. Select. Top. Appl. Earth Observ. Remote Sens. 10, 3 (2016), 1207--1220.
    [18]
    Eva Savina Malinverni, Roberto Pierdicca, Marina Paolanti, Massimo Martini, Christian Morbidoni, Francesca Matrone, and Andrea Lingua. 2019. Deep learning for semantic segmentation of 3D point cloud. International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives 42, 2/W15 (2019), 735--742.
    [19]
    Jiří Martínek, Ladislav Lenc, and Pavel Král. 2019. Training strategies for OCR systems for historical documents. In Proceedings of the IFIP International Conference on Artificial Intelligence Applications and Innovations. Springer, 362--373.
    [20]
    Edward F. Miller and Richard A. Melton. 1975. Automated generation of testcase datasets. ACM SIGPLAN Not. 10, 6 (1975), 51--58.
    [21]
    A. Moreira, R. Quattrini, G. Maggiolo, and R. Mammoli. 2018. HBIM methodology as a bridge between italy and argentina. ISPRS Arch. 42, 2 (2018).
    [22]
    G. Myers. 1999. A dataset generator for whole genome shotgun sequencing. In Proceedings of the International Conference on Intelligent Systems for Molecular Biology, ISMB. AAAI, 202--210.
    [23]
    R. Nespeca. 2018. Towards a 3D digital model for management and fruition of ducal palace at urbino. an integrated survey with mobile mapping. SCIRES-IT 8, 2 (2018), 1--14.
    [24]
    Yaling Pei and Osmar Zaïane. 2006. A Synthetic Data Generator for Clustering and Outlier Analysis. Technical Report. University of Alberta, Edmonton, Canada.
    [25]
    Roberto Pierdicca, Marco Mameli, Eva Savina Malinverni, Marina Paolanti, and Emanuele Frontoni. 2019. Automatic generation of point cloud synthetic dataset for historical building representation. In Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 11613 (2019), 203--219.
    [26]
    Roberto Pierdicca, Marina Paolanti, Francesca Matrone, Massimo Martini, Christian Morbidoni, Eva Savina Malinverni, Emanuele Frontoni, and Andrea Lingua. 2020. Point cloud semantic segmentation using a deep learning framework for cultural heritage. Remote Sens. 12, 6 (2020).
    [27]
    Charles R. Qi, Hao Su, Kaichun Mo, and Leonidas J. Guibas. 2017. Pointnet: Deep learning on point sets for 3d classification and segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Institute of Electrical and Electronics Engineers Inc., 652--660.
    [28]
    Charles Ruizhongtai Qi, Li Yi, Hao Su, and Leonidas J. Guibas. 2017. Pointnet++: Deep hierarchical feature learning on point sets in a metric space. In Advances in Neural Information Processing Systems. 5099--5108.
    [29]
    Ramona Quattrini, Paolo Clini, Romina Nespeca, and Ludovico Ruggeri. 2016. Measurement and historical information building: Challenges and opportunities in the representation of semantically structured 3D content. Disegnarecon 9, 16 (2016), 14--1.
    [30]
    Ramona Quattrini, Roberto Pierdicca, Christian Morbidoni, and Eva Savina Malinverni. 2017. Conservation-OrientedHBIM. The BIMEXPLORER Web Tool. International Archives of the Photogrammetry, Remote Sensing 8 Spatial Information Sciences 42, 5W1 (2017), 275--28.
    [31]
    Ramona Quattrini, Roberto Pierdicca, Christian Morbidoni, and Eva Savina Malinverni. 2017. Conservation-oriented HBIM. The BIMEXPLORER web tool. ISPRS Arch. 42, 5W1 (2017), 275--281.
    [32]
    German Ros, Laura Sellart, Joanna Materzynska, David Vazquez, and Antonio M. Lopez. 2016. The SYNTHIA Dataset: A large collection of synthetic images for semantic segmentation of urban scenes. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol. 2016. IEEE Computer Society, Los Alamitos, CA, 3234--3243.
    [33]
    Ha Tran and Kourosh Khoshelham. 2019. Building change detection through comparison of a lidar scan with a building information model. ISPRS Arch. 42, 2/W13 (2019), 889--893.
    [34]
    Fei Wang, Yan Zhuang, Hong Gu, and Huosheng Hu. 2019. Automatic generation of synthetic LiDAR point clouds for 3-D data analysis. IEEE Trans. Instrum. Measure. 68, 7 (2019), 2671--2673.
    [35]
    Tao Wang, David J. Wu, Adam Coates, and Andrew Y. Ng. 2012. End-to-end text recognition with convolutional neural networks. In Proceedings of the 21st International Conference on Pattern Recognition (ICPR’12). IEEE, 3304--3308.
    [36]
    Yue Wang, Yongbin Sun, Ziwei Liu, Sanjay E. Sarma, Michael M. Bronstein, and Justin M. Solomon. 2019. Dynamic graph cnn for learning on point clouds. ACM Trans. Graph. 38, 5 (2019).
    [37]
    Bichen Wu, Xuanyu Zhou, Sicheng Zhao, Xiangyu Yue, and Kurt Keutzer. 2019. SqueezeSegV2: Improved model structure and unsupervised domain adaptation for road-object segmentation from a LiDAR point cloud. In Proceedings of the IEEE International Conference on Robotics and Automation, Vol. 2019-May. 4376--4382.
    [38]
    Zexiang Xu, Kalyan Sunkavalli, Sunil Hadap, and Ravi Ramamoorthi. 2018. Deep image-based relighting from optimal sparse samples. ACM Trans. Graph. 37, 4 (2018), 1--13.

    Cited By

    View all
    • (2024)Enhancing point cloud semantic segmentation in the data‐scarce domain of industrial plants through synthetic dataComputer-Aided Civil and Infrastructure Engineering10.1111/mice.13153Online publication date: 9-Jan-2024
    • (2024)Semantic segmentation of satellite images with different building types using deep learning methodsRemote Sensing Applications: Society and Environment10.1016/j.rsase.2024.10117634(101176)Online publication date: Apr-2024
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Journal on Computing and Cultural Heritage
    Journal on Computing and Cultural Heritage   Volume 13, Issue 4
    Special Issue on Culture Games and Regular Papers
    December 2020
    208 pages
    ISSN:1556-4673
    EISSN:1556-4711
    DOI:10.1145/3441387
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 03 December 2020
    Accepted: 01 May 2020
    Received: 01 February 2020
    Published in JOCCH Volume 13, Issue 4

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Deep learning
    2. cultural heritage
    3. dynamic graph convolutional neural network
    4. historical building
    5. point cloud semantic segmentation
    6. radius distance
    7. scan-to-BIM
    8. synthetic point cloud

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    • CIVITAS

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)128
    • Downloads (Last 6 weeks)9

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Enhancing point cloud semantic segmentation in the data‐scarce domain of industrial plants through synthetic dataComputer-Aided Civil and Infrastructure Engineering10.1111/mice.13153Online publication date: 9-Jan-2024
    • (2024)Semantic segmentation of satellite images with different building types using deep learning methodsRemote Sensing Applications: Society and Environment10.1016/j.rsase.2024.10117634(101176)Online publication date: Apr-2024
    • (2024)Semantic segmentation of large-scale point clouds by integrating attention mechanisms and transformer modelsImage and Vision Computing10.1016/j.imavis.2024.105019(105019)Online publication date: Apr-2024
    • (2024)Automatic generation of synthetic heritage point clouds: Analysis and segmentation based on shape grammar for historical vaultsJournal of Cultural Heritage10.1016/j.culher.2023.10.00366(37-47)Online publication date: Mar-2024
    • (2024)InfrastructureHandbook of Digital 3D Reconstruction of Historical Architecture10.1007/978-3-031-43363-4_9(189-198)Online publication date: 19-Apr-2024
    • (2023)H-BIM and Artificial Intelligence: Classification of Architectural Heritage for Semi-Automatic Scan-to-BIM ReconstructionSensors10.3390/s2305249723:5(2497)Online publication date: 23-Feb-2023
    • (2023)Three-Dimensional Point Cloud Semantic Segmentation for Cultural Heritage: A Comprehensive ReviewRemote Sensing10.3390/rs1503054815:3(548)Online publication date: 17-Jan-2023
    • (2023)Advancements in 3D Heritage Data Aggregation and Enrichment in Europe: Implications for Designing the Jena Experimental Repository for the DFG 3D ViewerApplied Sciences10.3390/app1317978113:17(9781)Online publication date: 29-Aug-2023
    • (2023)Knowledge graph representation method for semantic 3D modeling of Chinese grottoesHeritage Science10.1186/s40494-023-01084-211:1Online publication date: 18-Dec-2023
    • Show More Cited By

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media