Towards a foundation model for geospatial artificial intelligence (vision paper)

G Mai, C Cundy, K Choi, Y Hu, N Lao… - Proceedings of the 30th …, 2022 - dl.acm.org
Proceedings of the 30th International Conference on Advances in Geographic …, 2022dl.acm.org
Large pre-trained models, also known as foundation models (FMs), are trained in a task-
agnostic manner on large-scale data and can be adapted to a wide range of downstream
tasks by fine tuning, few-shot, or even zero-shot learning. Despite their successes in
language and vision tasks, we have yet to see an attempt to develop foundation models for
geospatial artificial intelligence (GeoAI). In this work, we explore the promises and
challenges for developing multimodal foundation models for GeoAI. We first show the …
Large pre-trained models, also known as foundation models (FMs), are trained in a task-agnostic manner on large-scale data and can be adapted to a wide range of downstream tasks by fine tuning, few-shot, or even zero-shot learning. Despite their successes in language and vision tasks, we have yet to see an attempt to develop foundation models for geospatial artificial intelligence (GeoAI). In this work, we explore the promises and challenges for developing multimodal foundation models for GeoAI. We first show the advantages of this idea by testing the performance of existing Large pre-trained Language Models (LLMs) (e.g. GPT-2 and GPT-3) on two geospatial semantics tasks. Results indicate that these task-agnostic LLMs can outperform task-specific fully-supervised models on both tasks with 2--9% improvement in a few-shot learning setting. However, we also show the limitations of these existing foundation models given the multimodality nature of GeoAI, especially when dealing with geometries in conjunction with other modalities. So we discuss the possibility of a multimodal foundation model which can reason over various types of geospatial data through geospatial alignments. We conclude this paper by discussing the unique risks and challenges to develop such model for GeoAI.
ACM Digital Library