Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Earth Imagery Segmentation on Terrain Surface with Limited Training Labels: A Semi-supervised Approach based on Physics-Guided Graph Co-Training

Published: 05 January 2022 Publication History

Abstract

Given earth imagery with spectral features on a terrain surface, this paper studies surface segmentation based on both explanatory features and surface topology. The problem is important in many spatial and spatiotemporal applications such as flood extent mapping in hydrology. The problem is uniquely challenging for several reasons: first, the size of earth imagery on a terrain surface is often much larger than the input of popular deep convolutional neural networks; second, there exists topological structure dependency between pixel classes on the surface, and such dependency can follow an unknown and non-linear distribution; third, there are often limited training labels. Existing methods for earth imagery segmentation often divide the imagery into patches and consider the elevation as an additional feature channel. These methods do not fully incorporate the spatial topological structural constraint within and across surface patches and thus often show poor results, especially when training labels are limited. Existing methods on semi-supervised and unsupervised learning for earth imagery often focus on learning representation without explicitly incorporating surface topology. In contrast, we propose a novel framework that explicitly models the topological skeleton of a terrain surface with a contour tree from computational topology, which is guided by the physical constraint (e.g., water flow direction on terrains). Our framework consists of two neural networks: a convolutional neural network (CNN) to learn spatial contextual features on a 2D image grid, and a graph neural network (GNN) to learn the statistical distribution of physics-guided spatial topological dependency on the contour tree. The two models are co-trained via variational EM. Evaluations on the real-world flood mapping datasets show that the proposed models outperform baseline methods in classification accuracy, especially when training labels are limited.

References

[1]
Bjoern Andres, Jörg H. Kappes, Thorsten Beier, Ullrich Köthe, and Fred A. Hamprecht. 2011. Probabilistic image segmentation with closedness constraints. In 2011 International Conference on Computer Vision. IEEE, 2611–2618.
[2]
Vijay Badrinarayanan, Alex Kendall, and Roberto Cipolla. 2017. SegNet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 39, 12 (2017), 2481–2495.
[3]
Jonathan T. Barron and Ben Poole. 2016. The fast bilateral solver. In European Conference on Computer Vision. Springer, 617–632.
[4]
Christopher M. Bishop. 2006. Pattern Recognition and Machine Learning. Springer.
[5]
Hamish Carr, Jack Snoeyink, and Ulrike Axen. 2003. Computing contour trees in all dimensions. Computational Geometry 24, 2 (2003), 75–94.
[6]
Chao Chen, Daniel Freedman, and Christoph H. Lampert. 2011. Enforcing topological constraints in random field image segmentation. In CVPR 2011. IEEE, 2089–2096.
[7]
Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan L. Yuille. 2017. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Transactions on Pattern Analysis and Machine Intelligence 40, 4 (2017), 834–848.
[8]
Liang-Chieh Chen, George Papandreou, Florian Schroff, and Hartwig Adam. 2017. Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587 (2017).
[9]
Liang-Chieh Chen, Yukun Zhu, George Papandreou, Florian Schroff, and Hartwig Adam. 2018. Encoder-decoder with atrous separable convolution for semantic image segmentation. In Proceedings of the European Conference on Computer Vision (ECCV). 801–818.
[10]
Wuyang Chen, Ziyu Jiang, Zhangyang Wang, Kexin Cui, and Xiaoning Qian. 2019. Collaborative global-local networks for memory-efficient segmentation of ultra-high resolution images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 8924–8933.
[11]
Michaël Defferrard, Xavier Bresson, and Pierre Vandergheynst. 2016. Convolutional neural networks on graphs with fast localized spectral filtering. In Advances in Neural Information Processing Systems. 3844–3852.
[12]
Emre Eftelioglu, Zhe Jiang, Reem Ali, and Shashi Shekhar. 2016. Spatial computing perspective on food energy and water nexus. Journal of Environmental Studies and Sciences 6, 1 (2016), 62–76.
[13]
Emre Eftelioglu, Zhe Jiang, Xun Tang, and Shashi Shekhar. 2017. The nexus of food, energy, and water resources: Visions and challenges in spatial computing. In Advances in Geocomputation. Springer, 5–20.
[14]
Caner Hazirbas, Lingni Ma, Csaba Domokos, and Daniel Cremers. 2016. Fusenet: Incorporating depth into semantic segmentation via fusion-based CNN architecture. In Asian Conference on Computer Vision. Springer, 213–228.
[15]
Wenchong He and Zhe Jiang. 2020. Semi-supervised learning with the EM algorithm: A comparative study between unstructured and structured prediction. IEEE Transactions on Knowledge and Data Engineering.
[16]
Wenchong He, Arpan Man Sainju, Zhe Jiang, and Da Yan. 2021. Deep neural network for 3D surface segmentation based on contour tree hierarchy. In Proceedings of the 2021 SIAM International Conference on Data Mining (SDM). SIAM, 253–261.
[17]
Yufan He, Aaron Carass, Yihao Liu, Bruno M. Jedynak, Sharon D. Solomon, Shiv Saidha, Peter A. Calabresi, and Jerry L. Prince. 2019. Deep learning based topology guaranteed surface and MME segmentation of multiple sclerosis subjects from retinal OCT. Biomedical Optics Express 10, 10 (2019), 5042–5058.
[18]
Xiaoling Hu, Fuxin Li, Dimitris Samaras, and Chao Chen. 2019. Topology-preserving deep image segmentation. In Advances in Neural Information Processing Systems. 5658–5669.
[19]
Zhe Jiang. 2018. A survey on spatial prediction methods. IEEE Transactions on Knowledge and Data Engineering 31, 9 (2018), 1645–1664.
[20]
Zhe Jiang. 2020. Spatial structured prediction models: Applications, challenges, and techniques. IEEE Access 8 (2020), 38714–38727.
[21]
Zhe Jiang and Arpan Man Sainju. 2019. Hidden Markov contour tree: A spatial structured model for hydrological applications. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 804–813.
[22]
Zhe Jiang and Arpan Man Sainju. 2021. A hidden Markov tree model for flood extent mapping in heavily vegetated areas based on high resolution aerial imagery and DEM: A case study on Hurricane Matthew floods. International Journal of Remote Sensing 42, 3 (2021), 1160–1179.
[23]
Zhe Jiang and Shashi Shekhar. 2017. Spatial Big Data Science. Schweiz: Springer International Publishing AG.
[24]
Zhe Jiang, Miao Xie, and Arpan Man Sainju. 2021. Geographical hidden Markov tree. IEEE Transactions on Knowledge and Data Engineering 33, 2 (2021), 506–520.
[25]
Tarun Kalluri, Girish Varma, Manmohan Chandraker, and C. V. Jawahar. 2019. Universal semi-supervised semantic segmentation. In Proceedings of the IEEE International Conference on Computer Vision. 5259–5270.
[26]
Anuj Karpatne, Zhe Jiang, Ranga Raju Vatsavai, Shashi Shekhar, and Vipin Kumar. 2016. Monitoring land-cover changes: A machine-learning perspective. IEEE Geoscience and Remote Sensing Magazine 4, 2 (2016), 8–21.
[27]
Thomas N. Kipf and Max Welling. 2016. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016).
[28]
Andrew Lang, Aaron Carass, Ava K. Bittner, Howard S. Ying, and Jerry L. Prince. 2017. Improving graph-based OCT segmentation for severe pathology in Retinitis Pigmentosa patients. In Medical Imaging 2017: Biomedical Applications in Molecular, Structural, and Functional Imaging, Vol. 10137. International Society for Optics and Photonics, 101371M.
[29]
Yaguang Li, Rose Yu, Cyrus Shahabi, and Yan Liu. 2018. Diffusion convolutional recurrent neural network: Data-driven traffic forecasting. In 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings. OpenReview.net.
[30]
Jonathan Long, Evan Shelhamer, and Trevor Darrell. 2015. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3431–3440.
[31]
Lingni Ma, Jörg Stückler, Christian Kerl, and Daniel Cremers. 2017. Multi-view deep learning for consistent semantic mapping with rgb-d cameras. In 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 598–605.
[32]
Shervin Minaee, Yuri Boykov, Fatih Porikli, Antonio Plaza, Nasser Kehtarnavaz, and Demetri Terzopoulos. 2020. Image segmentation using deep learning: a survey. arXiv preprint arXiv:2001.05566 (2020).
[33]
Agata Mosinska, Pablo Marquez-Neila, Mateusz Koziński, and Pascal Fua. 2018. Beyond the pixel-wise loss for topology-aware delineation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3136–3145.
[34]
National Oceanic and Atmospheric Administration. [n.d.]. Data and Imagery from NOAA’s National Geodetic Survey. https://www.ngs.noaa.gov.
[35]
National Oceanic and Atmospheric Administration. 2018. National Water Model: Improving NOAA’s Water Prediction Services. http://water.noaa.gov/documents/wrn-national-water-model.pdf.
[36]
NCSU Libraries. 2018. LIDAR Based Elevation Data for North Carolina. https://www.lib.ncsu.edu/gis/elevation.
[37]
Radford M. Neal and Geoffrey E. Hinton. 1998. A view of the EM algorithm that justifies incremental, sparse, and other variants. In Learning in Graphical Models. Springer, 355–368.
[38]
Hyeonwoo Noh, Seunghoon Hong, and Bohyung Han. 2015. Learning deconvolution network for semantic segmentation. In Proceedings of the IEEE International Conference on Computer Vision. 1520–1528.
[39]
Sebastian Nowozin and Christoph H. Lampert. 2009. Global connectivity potentials for random field models. In 2009 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 818–825.
[40]
Manfred Opper and David Saad. 2001. Advanced Mean Field Methods: Theory and Practice. MIT press.
[41]
Yassine Ouali, Céline Hudelot, and Myriam Tami. 2020. Semi-supervised semantic segmentation with cross-consistency training. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 12674–12684.
[42]
Valerio Pascucci, Giorgio Scorzelli, Peer-Timo Bremer, and Ajith Mascarenhas. 2007. Robust on-line computation of Reeb graphs: Simplicity and speed. In ACM SIGGRAPH 2007 papers. 58–es.
[43]
Meng Qu, Yoshua Bengio, and Jian Tang. 2019. GMNN: Graph Markov neural networks. arXiv preprint arXiv:1905.06214 (2019).
[44]
Olaf Ronneberger, Philipp Fischer, and Thomas Brox. 2015. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical Image Computing and Computer-assisted Intervention. Springer, 234–241.
[45]
Arpan Man Sainju, Wenchong He, and Zhe Jiang. 2020. A hidden Markov contour tree model for spatial structured prediction. IEEE Transactions on Knowledge and Data Engineering.
[46]
Arpan Man Sainju, Wenchong He, Zhe Jiang, and Da Yan. 2020. Spatial classification with limited observations based on physics-aware structural constraint. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 898–905.
[47]
Arpan Man Sainju, Wenchong He, Zhe Jiang, Da Yan, and Haiquan Chen. 2021. Flood inundation mapping with limited observations based on physics-aware topography constraint. Frontiers in Big Data 4 (2021), 47. https://doi.org/10.3389/fdata.2021.707951
[48]
Shashi Shekhar, Zhe Jiang, Reem Y. Ali, Emre Eftelioglu, Xun Tang, Venkata Gunturi, and Xun Zhou. 2015. Spatiotemporal data mining: A computational perspective. ISPRS International Journal of Geo-Information 4, 4 (2015), 2306–2338.
[49]
Sara Vicente, Vladimir Kolmogorov, and Carsten Rother. 2008. Graph cut based image segmentation with connectivity priors. In 2008 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 1–8.
[50]
Jinghua Wang, Zhenhua Wang, Dacheng Tao, Simon See, and Gang Wang. 2016. Learning common and specific features for RGB-D semantic segmentation with deconvolutional networks. In European Conference on Computer Vision. Springer, 664–679.
[51]
Senzhang Wang, Jiannong Cao, and Philip Yu. 2020. Deep learning for spatio-temporal data mining: A survey. IEEE Transactions on Knowledge and Data Engineering.
[52]
Weiyue Wang and Ulrich Neumann. 2018. Depth-aware CNN for RGB-D segmentation. In Proceedings of the European Conference on Computer Vision (ECCV). 135–150.
[53]
Miao Xie, Zhe Jiang, and Arpan Man Sainju. 2018. Geographical hidden Markov tree for flood extent mapping. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (London, United Kingdom) (KDD’18). ACM, New York, NY, USA, 2545–2554.
[54]
Hengshuang Zhao, Xiaojuan Qi, Xiaoyong Shen, Jianping Shi, and Jiaya Jia. 2018. ICNet for real-time semantic segmentation on high-resolution images. In Proceedings of the European Conference on Computer Vision (ECCV). 405–420.
[55]
Leixin Zhou, Zisha Zhong, Abhay Shah, Bensheng Qiu, John Buatti, and Xiaodong Wu. 2019. Deep neural networks for surface segmentation meet conditional random fields. arXiv preprint arXiv:1906.04714 (2019).

Cited By

View all
  • (2024)Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented GenerationProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657673(2749-2754)Online publication date: 10-Jul-2024
  • (2024)Confidence Trigger Detection: Accelerating Real-Time Tracking-by-Detection Systems2024 5th International Conference on Electronic Communication and Artificial Intelligence (ICECAI)10.1109/ICECAI62591.2024.10674884(587-592)Online publication date: 31-May-2024
  • (2024)Generative deep learning for data generation in natural hazard analysis: motivations, advances, challenges, and opportunitiesArtificial Intelligence Review10.1007/s10462-024-10764-957:6Online publication date: 30-May-2024

Index Terms

  1. Earth Imagery Segmentation on Terrain Surface with Limited Training Labels: A Semi-supervised Approach based on Physics-Guided Graph Co-Training

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Transactions on Intelligent Systems and Technology
          ACM Transactions on Intelligent Systems and Technology  Volume 13, Issue 2
          April 2022
          392 pages
          ISSN:2157-6904
          EISSN:2157-6912
          DOI:10.1145/3508464
          • Editor:
          • Huan Liu
          Issue’s Table of Contents

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 05 January 2022
          Accepted: 01 August 2021
          Revised: 01 March 2021
          Received: 01 December 2020
          Published in TIST Volume 13, Issue 2

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. Earth image segmentation
          2. terrain surface
          3. contour tree
          4. physic-guided
          5. semi-supervised

          Qualifiers

          • Research-article
          • Refereed

          Funding Sources

          • National Science Foundation (NSF)
          • National Oceanic and Atmospheric Administration (NOAA)
          • Microsoft AI for Earth Grant
          • Extreme Science and Engineering Discovery Environment (XSEDE)

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)77
          • Downloads (Last 6 weeks)13
          Reflects downloads up to 04 Oct 2024

          Other Metrics

          Citations

          Cited By

          View all
          • (2024)Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented GenerationProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657673(2749-2754)Online publication date: 10-Jul-2024
          • (2024)Confidence Trigger Detection: Accelerating Real-Time Tracking-by-Detection Systems2024 5th International Conference on Electronic Communication and Artificial Intelligence (ICECAI)10.1109/ICECAI62591.2024.10674884(587-592)Online publication date: 31-May-2024
          • (2024)Generative deep learning for data generation in natural hazard analysis: motivations, advances, challenges, and opportunitiesArtificial Intelligence Review10.1007/s10462-024-10764-957:6Online publication date: 30-May-2024
          • (2023)Spatial Knowledge-Infused Hierarchical Learning: An Application in Flood Mapping on Earth ImageryProceedings of the 31st ACM International Conference on Advances in Geographic Information Systems10.1145/3589132.3625591(1-10)Online publication date: 13-Nov-2023

          View Options

          Get Access

          Login options

          Full Access

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          Full Text

          View this article in Full Text.

          Full Text

          HTML Format

          View this article in HTML Format.

          HTML Format

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media