Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3357384.3358001acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article
Public Access

Unsupervised Representation Learning of Spatial Data via Multimodal Embedding

Published: 03 November 2019 Publication History

Abstract

Increasing urbanization across the globe has coincided with greater access to urban data; this enables researchers and city administrators with better tools to understand urban dynamics, such as crime, traffic, and living standards. In this paper, we study the Learning an Embedding Space for Regions (LESR) problem, wherein we aim to produce vector representations of discrete regions. Recent studies have shown that embedding geospatial regions in a latent vector space can be useful in a variety of urban computing tasks. However, previous studies do not consider regions across multiple modalities in an end-to-end framework. We argue that doing so facilitates the learning of greater semantic relationships among regions. We propose a novel method, RegionEncoder, that jointly learns region representations from satellite image, point-of-interest, human mobility, and spatial graph data. We demonstrate that these region embeddings are useful as features in two regression tasks and across two distinct urban environments. Additionally, we perform an ablation study that evaluates each major architectural component. Finally, we qualitatively explore the learned embedding space, and show that semantic relationships are discovered across modalities

References

[1]
Guillaume Alain and Yoshua Bengio. 2013. What Regularized Auto-Encoders Learn from the Data Generating Distribution. In ICLR'13. https://arxiv.org/pdf/ 1211.4246.pdf
[2]
Adrian Albert, Jasleen Kaur, and Marta C. Gonzalez. 2017. Using Convolutional Networks and Satellite Imagery to Identify Patterns in Urban Environments at a Large Scale. In Proceedings of the 23nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM.
[3]
Yoshua Bengio, Pascal Lamblin, Dan Popovici, and Hugo Larochelle. 2007. Greedy layer-wise training of deep networks. In Advances in Neural Information Processing Systems (NIPS).
[4]
Michael Defferrard, Xavier Bressson, and Pierre Vandergheynst. 2016. Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering. In Advances in neural information processing systems (NIPS).
[5]
Foursquare. 2015. Foursquare Venues Service. (2015). https://developer. foursquare.com/overview/venues.html
[6]
Andrea Frome, Greg Corrado, Jonathon Shlens, Samy Bengio, Jeffrey Dean, Marc'Aurelio Ranzato, and Tomas Mikolov. 2013. DeViSE: A Deep Visual- Semantic Embedding Model. In Advances in Neural Information Processing Systems (NIPS).
[7]
Yanjie Fu, Pengyang Wang, Le Wu, and Xiaolin Li. 2019. Efficient Region Embedding with Multi-view Spatial Networks: A Perspective of Locality-Constrained Spatial Autocorrelations. In Proceedings of the Association for the Advancement of Artificial Intelligence (AAAI).
[8]
Ian Goodfellow, Yoshua Bengio, and Aaron Courville. 2016. Deep Learning. MIT Press. http://www.deeplearningbook.org.
[9]
Google. 2018. Google Static Maps. (2018). https://developers.google.com/maps/ documentation/maps-static/intro
[10]
Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable Feature Learning for Networks. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
[11]
Mikael Henaff, Joan Bruna, and Yann LeCunn. 2015. Deep Convolutional Networks of Graph-Structured Data. In Advances in neural information processing systems (NIPS).
[12]
Neal Jean, Sherrie Wang, Anshul Samara, George Azzari, David Lobell, and Stefano Ermon. 2019. Tile2Vec: Unsupervised Representation Learning for Spatially Distributed Data. In Proceedings of the Association for the Advancement of Artificial Intelligence (AAAI).
[13]
Thomas Kipf and Max Welling. 2017. Semi-supervised Classification With Graph Convolutional Networks. In ICLR'17.
[14]
Angeliki Lazaridou, Nghia The Pham, and Marco Baroni. 2015. Combining Language and Vision with a Multimodal Skip-gram Model. In HLT-NAACL.
[15]
Junhua Mao, Jiajing Xu, Yushi Jing, and Alan Yuille. 2016. Training and Evaluation Multimodal Word Embeddings with Large-scale Web Annotated Images. In Advances in Neural Information Processing Systems (NIPS).
[16]
Tomas Mikilov, Ilya Sutskever, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Distributed Representations of Words and Phrases and Their Compositionality. In Advances in Neural Information Processing Systems (NIPS).
[17]
Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. DeepWalk: Online Learning of Social Representations. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '14). ACM, New York, NY, USA, 701--710. https://doi.org/10.1145/2623330.2623732
[18]
Leo Breiman Statistics and Leo Breiman. 2001. Random Forests. In Machine Learning. 5--32.
[19]
Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, and Qiaozhu Mei. 2015. LINE: Large-scale Information Network Embedding. In WWW. ACM.
[20]
Open Data Team. 2018. New York City Data Dashboard. (2018). https://opendata. cityofnewyork.us/dashboard/
[21]
Open Data Team. 2019. Chicago Data Portal. (2019). https://data.cityofchicago. org/
[22]
Nicholas Thompson and Ian Bremmer. 2018. Divided we Fail. Wired (November 2018).
[23]
Waldo Tobler. 1970. A computer movie simulating urban growth in the Detroit region. (1970), 234--240.
[24]
Robert Tibshirani Trevor Hastie and Jerome Friedman. 2009. The Elements of Statistical Learning. Springer, New York, NY.
[25]
Hongjian Wang, Daniel Kifer, Corina Graif, and Zhenhui Li. 2016. Crime rate inference with big data. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, 635--644.
[26]
Hongjian Wang and Zhenhui Li. 2017. Region Representation Learning Via Mobility Flow. In Proceedings of CIKM'17. 10.
[27]
Pengyang Wang, Yanjie Fu, Jiawei Zhang, Pengfei Wang, Yu Zheng, and Charu Aggarwal. 2018. You Are How You Drive: Peer and Temporal-Aware Representation Learning for Driving Behavior Analysis. In Proceedings of the 24nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, 2457--2466.
[28]
Hua Wei, Guanjie Zheng, Huaxiu Yao, and Zhenhui Li. 2018. IntelliLight: A Reinforcement Learning Approach for Intelligent Traffic Light Control. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, 2496--2505.
[29]
Xiuwen Yi, Junbo Zhang, ZhaoyuanWang, Tianrui Li, and Yu Zheng. 2018. Deep Distributed Fusion Network for Air Quality Prediction. In Proceedings of the 24nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, 965--973.
[30]
Zillow. 2019. Real Estate Valuations in Chicago and New York. (2019). https: //www.zillow.com/

Cited By

View all
  • (2025)Deep learning for cross-domain data fusion in urban computingInformation Fusion10.1016/j.inffus.2024.102606113:COnline publication date: 1-Jan-2025
  • (2024)Dwell Time Analytics for Understanding Place SimilarityProceedings of the 2024 7th International Conference on Geoinformatics and Data Analysis10.1145/3678599.3678603(7-13)Online publication date: 19-Apr-2024
  • (2024)Latent Representation Learning for Geospatial EntitiesACM Transactions on Spatial Algorithms and Systems10.1145/366347410:4(1-31)Online publication date: 2-May-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '19: Proceedings of the 28th ACM International Conference on Information and Knowledge Management
November 2019
3373 pages
ISBN:9781450369763
DOI:10.1145/3357384
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 November 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. multimodal embeding
  2. representation learning
  3. satellite imagery
  4. spatial data

Qualifiers

  • Research-article

Funding Sources

Conference

CIKM '19
Sponsor:

Acceptance Rates

CIKM '19 Paper Acceptance Rate 202 of 1,031 submissions, 20%;
Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)732
  • Downloads (Last 6 weeks)71
Reflects downloads up to 27 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2025)Deep learning for cross-domain data fusion in urban computingInformation Fusion10.1016/j.inffus.2024.102606113:COnline publication date: 1-Jan-2025
  • (2024)Dwell Time Analytics for Understanding Place SimilarityProceedings of the 2024 7th International Conference on Geoinformatics and Data Analysis10.1145/3678599.3678603(7-13)Online publication date: 19-Apr-2024
  • (2024)Latent Representation Learning for Geospatial EntitiesACM Transactions on Spatial Algorithms and Systems10.1145/366347410:4(1-31)Online publication date: 2-May-2024
  • (2024)A Generic Machine Learning Model for Spatial Query Optimization based on Spatial EmbeddingsACM Transactions on Spatial Algorithms and Systems10.1145/365763310:4(1-33)Online publication date: 13-Apr-2024
  • (2024)A Novel Framework for Joint Learning of City Region Partition and RepresentationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365285720:7(1-23)Online publication date: 17-Mar-2024
  • (2024)UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the WebProceedings of the ACM Web Conference 202410.1145/3589334.3645378(4006-4017)Online publication date: 13-May-2024
  • (2024)Mobile Traffic Time Series: Urban Region Representations and Synthetic Generation2024 25th IEEE International Conference on Mobile Data Management (MDM)10.1109/MDM61037.2024.00055(262-264)Online publication date: 24-Jun-2024
  • (2024)Urban Region Representation Learning with Attentive Fusion2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00336(4409-4421)Online publication date: 13-May-2024
  • (2024)Census2Vec: Enhancing Socioeconomic Predictive Models with Geo-Embedded DataIntelligent Systems and Applications10.1007/978-3-031-66431-1_44(626-640)Online publication date: 31-Jul-2024
  • (2024)A Novel Framework for Spatiotemporal POI AnalysisWeb and Wireless Geographical Information Systems10.1007/978-3-031-60796-7_2(23-40)Online publication date: 18-Jun-2024
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media