research-article

Public Access

Unsupervised Representation Learning of Spatial Data via Multimodal Embedding

Authors:

Porter Jenkins,

Zhenhui LiAuthors Info & Claims

CIKM '19: Proceedings of the 28th ACM International Conference on Information and Knowledge Management

Pages 1993 - 2002

https://doi.org/10.1145/3357384.3358001

Published: 03 November 2019 Publication History

Abstract

Increasing urbanization across the globe has coincided with greater access to urban data; this enables researchers and city administrators with better tools to understand urban dynamics, such as crime, traffic, and living standards. In this paper, we study the Learning an Embedding Space for Regions (LESR) problem, wherein we aim to produce vector representations of discrete regions. Recent studies have shown that embedding geospatial regions in a latent vector space can be useful in a variety of urban computing tasks. However, previous studies do not consider regions across multiple modalities in an end-to-end framework. We argue that doing so facilitates the learning of greater semantic relationships among regions. We propose a novel method, RegionEncoder, that jointly learns region representations from satellite image, point-of-interest, human mobility, and spatial graph data. We demonstrate that these region embeddings are useful as features in two regression tasks and across two distinct urban environments. Additionally, we perform an ablation study that evaluates each major architectural component. Finally, we qualitatively explore the learned embedding space, and show that semantic relationships are discovered across modalities

References

[1]

Guillaume Alain and Yoshua Bengio. 2013. What Regularized Auto-Encoders Learn from the Data Generating Distribution. In ICLR'13. https://arxiv.org/pdf/ 1211.4246.pdf

[2]

Adrian Albert, Jasleen Kaur, and Marta C. Gonzalez. 2017. Using Convolutional Networks and Satellite Imagery to Identify Patterns in Urban Environments at a Large Scale. In Proceedings of the 23nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM.

[3]

Yoshua Bengio, Pascal Lamblin, Dan Popovici, and Hugo Larochelle. 2007. Greedy layer-wise training of deep networks. In Advances in Neural Information Processing Systems (NIPS).

[4]

Michael Defferrard, Xavier Bressson, and Pierre Vandergheynst. 2016. Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering. In Advances in neural information processing systems (NIPS).

[5]

Foursquare. 2015. Foursquare Venues Service. (2015). https://developer. foursquare.com/overview/venues.html

[6]

Andrea Frome, Greg Corrado, Jonathon Shlens, Samy Bengio, Jeffrey Dean, Marc'Aurelio Ranzato, and Tomas Mikolov. 2013. DeViSE: A Deep Visual- Semantic Embedding Model. In Advances in Neural Information Processing Systems (NIPS).

[7]

Yanjie Fu, Pengyang Wang, Le Wu, and Xiaolin Li. 2019. Efficient Region Embedding with Multi-view Spatial Networks: A Perspective of Locality-Constrained Spatial Autocorrelations. In Proceedings of the Association for the Advancement of Artificial Intelligence (AAAI).

[8]

Ian Goodfellow, Yoshua Bengio, and Aaron Courville. 2016. Deep Learning. MIT Press. http://www.deeplearningbook.org.

Digital Library

[9]

Google. 2018. Google Static Maps. (2018). https://developers.google.com/maps/ documentation/maps-static/intro

[10]

Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable Feature Learning for Networks. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.

Digital Library

[11]

Mikael Henaff, Joan Bruna, and Yann LeCunn. 2015. Deep Convolutional Networks of Graph-Structured Data. In Advances in neural information processing systems (NIPS).

[12]

Neal Jean, Sherrie Wang, Anshul Samara, George Azzari, David Lobell, and Stefano Ermon. 2019. Tile2Vec: Unsupervised Representation Learning for Spatially Distributed Data. In Proceedings of the Association for the Advancement of Artificial Intelligence (AAAI).

[13]

Thomas Kipf and Max Welling. 2017. Semi-supervised Classification With Graph Convolutional Networks. In ICLR'17.

[14]

Angeliki Lazaridou, Nghia The Pham, and Marco Baroni. 2015. Combining Language and Vision with a Multimodal Skip-gram Model. In HLT-NAACL.

[15]

Junhua Mao, Jiajing Xu, Yushi Jing, and Alan Yuille. 2016. Training and Evaluation Multimodal Word Embeddings with Large-scale Web Annotated Images. In Advances in Neural Information Processing Systems (NIPS).

[16]

Tomas Mikilov, Ilya Sutskever, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Distributed Representations of Words and Phrases and Their Compositionality. In Advances in Neural Information Processing Systems (NIPS).

[17]

Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. DeepWalk: Online Learning of Social Representations. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '14). ACM, New York, NY, USA, 701--710. https://doi.org/10.1145/2623330.2623732

Digital Library

[18]

Leo Breiman Statistics and Leo Breiman. 2001. Random Forests. In Machine Learning. 5--32.

[19]

Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, and Qiaozhu Mei. 2015. LINE: Large-scale Information Network Embedding. In WWW. ACM.

Digital Library

[20]

Open Data Team. 2018. New York City Data Dashboard. (2018). https://opendata. cityofnewyork.us/dashboard/

[21]

Open Data Team. 2019. Chicago Data Portal. (2019). https://data.cityofchicago. org/

[22]

Nicholas Thompson and Ian Bremmer. 2018. Divided we Fail. Wired (November 2018).

[23]

Waldo Tobler. 1970. A computer movie simulating urban growth in the Detroit region. (1970), 234--240.

[24]

Robert Tibshirani Trevor Hastie and Jerome Friedman. 2009. The Elements of Statistical Learning. Springer, New York, NY.

[25]

Hongjian Wang, Daniel Kifer, Corina Graif, and Zhenhui Li. 2016. Crime rate inference with big data. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, 635--644.

Digital Library

[26]

Hongjian Wang and Zhenhui Li. 2017. Region Representation Learning Via Mobility Flow. In Proceedings of CIKM'17. 10.

Digital Library

[27]

Pengyang Wang, Yanjie Fu, Jiawei Zhang, Pengfei Wang, Yu Zheng, and Charu Aggarwal. 2018. You Are How You Drive: Peer and Temporal-Aware Representation Learning for Driving Behavior Analysis. In Proceedings of the 24nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, 2457--2466.

Digital Library

[28]

Hua Wei, Guanjie Zheng, Huaxiu Yao, and Zhenhui Li. 2018. IntelliLight: A Reinforcement Learning Approach for Intelligent Traffic Light Control. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, 2496--2505.

Digital Library

[29]

Xiuwen Yi, Junbo Zhang, ZhaoyuanWang, Tianrui Li, and Yu Zheng. 2018. Deep Distributed Fusion Network for Air Quality Prediction. In Proceedings of the 24nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, 965--973.

Digital Library

[30]

Zillow. 2019. Real Estate Valuations in Chicago and New York. (2019). https: //www.zillow.com/

Cited By

Zou XYan YHao XHu YWen HLiu EZhang JLi YLi TZheng YLiang Y(2025)Deep learning for cross-domain data fusion in urban computingInformation Fusion10.1016/j.inffus.2024.102606113:COnline publication date: 1-Jan-2025
https://dl.acm.org/doi/10.1016/j.inffus.2024.102606
Zarbakhsh NMcArdle G(2024)Dwell Time Analytics for Understanding Place SimilarityProceedings of the 2024 7th International Conference on Geoinformatics and Data Analysis10.1145/3678599.3678603(7-13)Online publication date: 19-Apr-2024
https://dl.acm.org/doi/10.1145/3678599.3678603
Lee WLauw H(2024)Latent Representation Learning for Geospatial EntitiesACM Transactions on Spatial Algorithms and Systems10.1145/366347410:4(1-31)Online publication date: 2-May-2024
https://dl.acm.org/doi/10.1145/3663474
Show More Cited By

Index Terms

Unsupervised Representation Learning of Spatial Data via Multimodal Embedding
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Unsupervised learning
        Dimensionality reduction and manifold learning
    2. Machine learning approaches
      1. Learning latent representations

Recommendations

Beyond the First Law of Geography: Learning Representations of Satellite Imagery by Leveraging Point-of-Interests
WWW '22: Proceedings of the ACM Web Conference 2022

Satellite imagery depicts the earth’s surface remotely and provides comprehensive information for many applications, such as land use monitoring and urban planning. Existing studies on unsupervised representation learning for satellite images only take ...
Unifying Inter-region Autocorrelation and Intra-region Structures for Spatial Embedding via Collective Adversarial Learning
KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Unsupervised spatial representation learning aims to automatically identify effective features of geographic entities (i.e., regions) from unlabeled yet structural geographical data. Existing network embedding methods can partially address the problem ...
Unimodal and Multimodal Integrated Representation Learning via Improved Information Bottleneck for Multimodal Sentiment Analysis
Natural Language Processing and Chinese Computing
Abstract
Representation learning is a significant and challenging task in multimodal sentiment analysis (MSA). It aims to improve the performance of model by learning effective unimodal or multimodal representation. To obtain desired characteristics of ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

CIKM '19: Proceedings of the 28th ACM International Conference on Information and Knowledge Management

November 2019

3373 pages

ISBN:9781450369763

DOI:10.1145/3357384

General Chairs:
Wenwu Zhu
Tsinghua University, China
,
Dacheng Tao
University of Massachusetts, USA
,
Xueqi Cheng
Institute of Computing Technology, CAS, China
,
Program Chairs:
Peng Cui
Tsinghua University, China
,
Elke Rundensteiner
Worcester Polytechnic Institute, USA
,
David Carmel
Amazon Research, USA
,
Qi He
LinkedIn, USA
,
Jeffrey Xu Yu
Chinese University of Hong Kong, China

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 November 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Science Foundation

Conference

CIKM '19

Sponsor:

CIKM '19: The 28th ACM International Conference on Information and Knowledge Management

November 3 - 7, 2019

Beijing, China

Acceptance Rates

CIKM '19 Paper Acceptance Rate 202 of 1,031 submissions, 20%;

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Sponsor:
sigir
sigir

The 34th ACM International Conference on Information and Knowledge Management

November 10 - 14, 2025

Seoul , Republic of Korea

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

35
Total Citations
View Citations
2,370
Total Downloads

Downloads (Last 12 months)732
Downloads (Last 6 weeks)71

Reflects downloads up to 27 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Zou XYan YHao XHu YWen HLiu EZhang JLi YLi TZheng YLiang Y(2025)Deep learning for cross-domain data fusion in urban computingInformation Fusion10.1016/j.inffus.2024.102606113:COnline publication date: 1-Jan-2025
https://dl.acm.org/doi/10.1016/j.inffus.2024.102606
Zarbakhsh NMcArdle G(2024)Dwell Time Analytics for Understanding Place SimilarityProceedings of the 2024 7th International Conference on Geoinformatics and Data Analysis10.1145/3678599.3678603(7-13)Online publication date: 19-Apr-2024
https://dl.acm.org/doi/10.1145/3678599.3678603
Lee WLauw H(2024)Latent Representation Learning for Geospatial EntitiesACM Transactions on Spatial Algorithms and Systems10.1145/366347410:4(1-31)Online publication date: 2-May-2024
https://dl.acm.org/doi/10.1145/3663474
Belussi AMigliorini SEldawy A(2024)A Generic Machine Learning Model for Spatial Query Optimization based on Spatial EmbeddingsACM Transactions on Spatial Algorithms and Systems10.1145/365763310:4(1-33)Online publication date: 13-Apr-2024
https://dl.acm.org/doi/10.1145/3657633
Deng MZhang WZhao JWang ZZhou MLuo JChen C(2024)A Novel Framework for Joint Learning of City Region Partition and RepresentationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365285720:7(1-23)Online publication date: 17-Mar-2024
https://dl.acm.org/doi/10.1145/3652857
Yan YWen HZhong SChen WChen HWen QZimmermann RLiang YChua TNgo CKa-Wei Lee RKumar RLauw H(2024)UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the WebProceedings of the ACM Web Conference 202410.1145/3589334.3645378(4006-4017)Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3589334.3645378
Loddi G(2024)Mobile Traffic Time Series: Urban Region Representations and Synthetic Generation2024 25th IEEE International Conference on Mobile Data Management (MDM)10.1109/MDM61037.2024.00055(262-264)Online publication date: 24-Jun-2024
https://doi.org/10.1109/MDM61037.2024.00055
Sun FQi JChang YFan XKarunasekera STanin E(2024)Urban Region Representation Learning with Attentive Fusion2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00336(4409-4421)Online publication date: 13-May-2024
https://doi.org/10.1109/ICDE60146.2024.00336
Bevara RWagenvoord IHosseini FSharma HNunna VXiao T(2024)Census2Vec: Enhancing Socioeconomic Predictive Models with Geo-Embedded DataIntelligent Systems and Applications10.1007/978-3-031-66431-1_44(626-640)Online publication date: 31-Jul-2024
https://doi.org/10.1007/978-3-031-66431-1_44
Zarbakhsh NMcArdle G(2024)A Novel Framework for Spatiotemporal POI AnalysisWeb and Wireless Geographical Information Systems10.1007/978-3-031-60796-7_2(23-40)Online publication date: 18-Jun-2024
https://dl.acm.org/doi/10.1007/978-3-031-60796-7_2
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents