Abstract
Building footprints are the most visible features in urban areas. Detecting building footprint has a substantial position in decision-making problems such as city planning and development, urban mapping and management, population estimation, etc. In this paper, we aim to automatically detect and count building footprints by leveraging deep learning techniques and the potential availability of remote sensing datasets at high spatial resolutions. We build object detection deep learning models by integrating the partitioning segmentation and convolutional neural networks (CNN) architecture using the WorldView-3 very high-resolution optical satellite imagery data. Our detection model is implemented on data from the Paris, France, and Khartoum, Sudan regions. These regions were chosen because it has very different characteristics and large data variation, making it challenging to extract building features from satellite imagery. This challenge is even greater because one area is taken perpendicularly and the other area is taken at an angle of several degrees which causes more noise, one of which is the shadow of the building. The detection models can detect and count the building footprints and have promising performance on large variable data with an Average Precision (AP) of 64.19%. Paris was detected better by 74.66% compared to Khartoum by 56.19%. The partitioning segmentation technique is used to tune the anchor boxes of CNN input images. Our result offers a potential practical implementation for rapid yet accurate estimation of urban monitoring and city planning, particularly in metropolitan regions.
Similar content being viewed by others
Data availability
Data derived from public domain resources, i.e. https://spacenet.ai/datasets/.
References
Afira N, Wijayanto AW (2022) Mono-temporal and multi-temporal approaches for burnt area detection using Sentinel-2 satellite imagery (a case study of Rokan Hilir Regency, Indonesia). Ecol Inform 69:101677
Agarap AF (2018) Deep learning using rectified linear units (ReLU). arXiv: 1803.08375. https://doi.org/10.48550/arXiv.1803.08375
Agarwal S, Terrail JO du, Jurie F (2018) Recent advances in object detection in the age of deep convolutional neural networks. arXiv: 1809.03193. https://doi.org/10.48550/arXiv.1809.03193
Alganci U, Soydas M, Sertel E (2020) Comparative Research on Deep Learning Approaches for Airplane Detection from Very High-Resolution Satellite Images. Remote Sens (Basel) 12, 458
Alidoost F, Arefi H (2018) A CNN-Based Approach for Automatic Building Detection and Recognition of Roof Types Using a Single Aerial Image. PFG – J Photogramm Remote Sens Geoinform Sci 86, 235–248
Awrangjeb M, Fraser C (2014) Automatic Segmentation of Raw LIDAR Data for Extraction of Building Roofs. Remote Sens (basel) 6:3716–3751
Belgiu M, Drǎguţ L (2014) Comparing supervised and unsupervised multiresolution segmentation approaches for extracting buildings from very high resolution imagery. ISPRS J Photogramm Remote Sens 96:67–75
Benali A, Dermeche H, Belhadj S, Adnane A, Hanifi Elhachemi Amar R (2014) Buildings extraction of very high spatial resolution satellite images. in 2014 International Conference on Multimedia Computing and Systems (ICMCS) 277–282 (IEEE). https://doi.org/10.1109/ICMCS.2014.6911229.
Bochkovskiy A (2016) darknet. https://github.com/AlexeyAB
Bonczak B, Kontokosta CE (2019) Large-scale parameterization of 3D building morphology in complex urban landscapes using aerial LiDAR and city administrative data. Comput Environ Urban Syst 73:126–142
Chartock E, LaRow W, Singh V (2017) Extraction of building footprints from satellite imagery. Stanford University Report. http://vision.stanford.edu/teaching/cs231n/reports/2017/pdfs/550.pdf. Accessed 1 Mar 2022
Cheng L, Gong J, Li M, Liu Y (2011) 3D Building Model Reconstruction from Multi-view Aerial Imagery and Lidar Data. Photogramm Eng Remote Sens 77:125–139
Das SK, PS P, Aithal B (2018) Automated Building Extraction using High Resolution Satellite Imagery though Ensemble Modelling and Machine Learning. Remote Sensing of Land 2, 31–46
Dorninger P, Pfeifer N (2008) A Comprehensive Automated 3D Approach for Building Extraction, Reconstruction, and Regularization from Airborne Laser Scanning Point Clouds. Sensors 8:7323–7343
Gao J, Chen Y, Wei Y, Li J (2021) Detection of Specific Building in Remote Sensing Images Using a Novel YOLO-S-CIOU Model. Case: Gas Station Identification. Sensors 21, 1375
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. in 2014 IEEE Conference on Computer Vision and Pattern Recognition 580–587 (IEEE). https://doi.org/10.1109/CVPR.2014.81
Guo M, Liu H, Xu Y, Huang Y (2020) Building Extraction Based on U-Net with an Attention Block and Multiple Losses. Remote Sens (Basel) 12:1400
Haala N, Brenner C (1999) Extraction of buildings and trees in urban environments. ISPRS J Photogramm Remote Sens 54:130–137
Han Q, Yin Q, Zheng X, Chen Z (2021) Remote sensing image building detection method based on Mask R-CNN. Complex IntellSyst https://doi.org/10.1007/s40747-021-00322-z
Huang X, Zhang L (2012) Morphological Building/Shadow Index for Building Extraction From High-Resolution Imagery Over Urban Areas. IEEE J Sel Top Appl Earth Obs Remote Sens 5:161–172
Ide H, Kurita T (2017) Improvement of learning for CNN with ReLU activation by sparse regularization. in 2017 International Joint Conference on Neural Networks (IJCNN) 2684–2691 (IEEE). https://doi.org/10.1109/IJCNN.2017.7966185
Ioffe S, Szegedy, C (2015) Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. CoRR abs/1502.03167
Johnson M. et al. (2016) Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation. CoRR abs/1611.04558
Joshi KK, Kono T (2009) Optimization of floor area ratio regulation in a growing city. Reg Sci Urban Econ 39:502–511
Khoshboresh-Masouleh M, Shah-Hosseini R (2021) A Deep Multi-Modal Learning Method and a New RGB-Depth Data Set for Building Roof Extraction. Photogramm Eng Remote Sensing 87:759–766
Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet Classification with Deep Convolutional Neural Networks. in Advances in Neural Information Processing Systems (eds. Pereira, F., Burges, C. J. C., Bottou, L. & Weinberger, K. Q.) vol. 25 (Curran Associates, Inc.)
Liu L et al (2020) Deep Learning for Generic Object Detection: A Survey. Int J Comput vis 128:261–318
Liu L et al (2021) Learning region-guided scale-aware feature selection for object detection. Neural Comput Appl 33:6389–6403
Liu W et al. (2016) SSD: Single Shot MultiBox Detector. in 21–37. https://doi.org/10.1007/978-3-319-46448-0_2
Maas HG, Vosselman G (1999) Two algorithms for extracting building models from raw laser altimetry data. ISPRS J Photogramm Remote Sens 54:153–163
Monna F. et al. (2021) Deep learning to detect built cultural heritage from satellite imagery. Spatial distribution and size of vernacular houses in Sumba, Indonesia. J Cult Herit 52, 171–183
Nurmasari Y, Wijayanto AW (2021) Oil Palm Plantation Detection in Indonesia Using Sentinel-2 and Landsat-8 Optical Satellite Imagery (Case Study: Rokan Hulu Regency, Riau Province). Intl J Remote Sens Earth Sci (IJReSES) 18:1–18. https://doi.org/10.30536/j.ijreses.2021.v18.a3537
Park Y, Guldmann JM (2019) Creating 3D city models with building footprints and LIDAR point cloud classification: A machine learning approach. Comput Environ Urban Syst 75:76–89
Park Y, Guldmann JM, Liu D (2021) Impacts of tree and building shades on the urban heat island: Combining remote sensing, 3D digital city and spatial regression approaches. Comput Environ Urban Syst 88:101655
Pham M-T, Courtrai L, Friguet C, Lefèvre S, Baussard A (2020) YOLO-Fine: One-Stage Detector of Small Objects Under Various Backgrounds in Remote Sensing Images. Remote Sens (Basel) 12, 2501
Putri SR, Wijayanto AW, Sakti AD (2022) Developing Relative Spatial Poverty Index Using Integrated Remote Sensing and Geospatial Big Data Approach: A Case Study of East Java Indonesia. ISPRS Int J Geoinf 11:275
Putri SR, Wijayanto AW (2022) Learning Bayesian Network for Rainfall Prediction Modeling in Urban Area using Remote Sensing Satellite Data (Case Study: Jakarta, Indonesia). in International Conference on Data Science and Official Statistics 77–90
Redmon J, Farhadi A (2018) YOLOv3: an incremental improvement. arXiv:1804.02767. https://doi.org/10.48550/arXiv.1804.02767
Redmon J, Divvala S, Girshick R, Farhadi A (2015) You only look once: unified, real-time object detection. arXiv:1506.02640. https://doi.org/10.48550/arXiv.1506.02640
Ren Y, Zhu C, Xiao S (2018) Small Object Detection in Optical Remote Sensing Images via Modified Faster R-CNN. Appl Sci 8:813
Rongshuang F, Yang C, Qiheng X, Jingxue W (2019) A high-resolution remote sensing image building extraction method based on deep learning. Acta Geodaetica Et Cartographica Sinica 48:34
Rylatt M, Gadsden S, Lomas K (2001) GIS-based decision support for solar energy planning in urban environments. Comput Environ Urban Syst 25:579–603
Saadi TDT, Wijayanto AW (2021) Machine learning applied to sentinel-2 and landsat-8 multispectral and medium-resolution satellite imagery for the detection of rice production areas in Nganjuk, East Java, Indonesia. Intl J Remote Sens Earth Sci (IJReSES) 18:19. https://doi.org/10.30536/j.ijreses.2021.v18.a3538
Singh G, Jouppi M, Zhang Z, Zakhor A (2015) Shadow based building extraction from single satellite image. in (eds. Bouman, C. A. & Sauer, K. D.) 94010F. doi:https://doi.org/10.1117/12.2083500
SpaceNet on Amazon Web Services (AWS) (2018) ‘Datasets.’ The SpaceNet Catalog. https://spacenet.ai/datasets/. Accessed 12 Feb 2022
Tian D, Han Y, Wang B, Guan T, Gu H, Wei W (2021) Review of object instance segmentation based on deep learning. J Electron Imaging 31. https://doi.org/10.1117/1.JEI.31.4.041205
Triscowati DW, Sartono B, Kurnia A, Domiri DD, Wijayanto AW (2019) Multitemporal remote sensing data for classification of food crops plant phase using supervised random forest. in Sixth Geoinformation Science Symposium (eds. Wibowo, S. B. et al.) 10 (SPIE). https://doi.org/10.1117/12.2547216
Triscowati DW, Sartono B, Kurnia A, Dirgahayu D, Wijayanto AW (2020) Classification of rice-plant growth phase using supervised random forest method based on landsat-8 multitemporal data. Intl J Remote Sens Earth Sci (IJReSES) 16:187. https://doi.org/10.30536/j.ijreses.2019.v16.a3217
van Etten A, Lindenbaum D, Bacastow TM (2018) SpaceNet: a remote sensing dataset and challenge series. arXiv:1807.01232. https://doi.org/10.48550/arXiv.1807.01232
Wen Q et al (2019) Automatic Building Extraction from Google Earth Images under Complex Backgrounds Based on Deep Instance Segmentation Network. Sensors 19:333
Wijayanto AW, Wahyu Triscowati D, Marsuhandi AH (2020) Maize field area detection in East Java, Indonesia: An integrated multispectral remote sensing and machine learning approach. in 2020 12th International Conference on Information Technology and Electrical Engineering (ICITEE) 168–173 (IEEE). https://doi.org/10.1109/ICITEE49829.2020.9271683
Wijayanto AW, Afira N, Nurkarim W (2022) Machine Learning Approaches using Satellite Data for Oil Palm Area Detection in Pekanbaru City, Riau. in 2022 IEEE International Conference on Cybernetics and Computational Intelligence (CyberneticsCom) 84–89 (IEEE). https://doi.org/10.1109/CyberneticsCom55287.2022.9865301
Xie Y, Cai J, Bhojwani R, Shekhar S, Knight J (2020) A locally-constrained YOLO framework for detecting small and densely-distributed building footprints. Int J Geogr Inf Sci 34:777–801
Xiong W. et al. (2018) The Microsoft 2017 Conversational Speech Recognition System. in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 5934–5938 (IEEE). https://doi.org/10.1109/ICASSP.2018.8461870.
Acknowledgements
The authors are grateful to acknowledge the support from Statistics Indonesia (BPS) and Politeknik Statistika STIS, and would like to thank for Van Etten and DigitalGlobe for providing the satellite imagery very high-resolution WorldView-3 for the building footprint mapping challenge (SpaceNet at Amazon Web Services (AWS). “Datasets.” The SpaceNet Catalog. Last modified October 1st, 2018. Accessed on [December 18th, 2021]. https://spacenet.ai/datasets/)
Funding
Not applicable.
Author information
Authors and Affiliations
Contributions
All authors contributed to the study conception, design and analysis. Data collection, experimental simulations, and writing of original draft preparation were performed by Wahidya Nurkarim. Selecting the methodology, supervising the project, review and editing the manuscript were performed by Arie Wahyu Wijayanto. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no known competing financial or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Communicated by: H. Babaie
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Nurkarim, W., Wijayanto, A.W. Building footprint extraction and counting on very high-resolution satellite imagery using object detection deep learning framework. Earth Sci Inform 16, 515–532 (2023). https://doi.org/10.1007/s12145-022-00895-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12145-022-00895-4