Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

A Novel Framework for Joint Learning of City Region Partition and Representation

Published: 16 May 2024 Publication History

Abstract

The proliferation of multimodal big data in cities provides unprecedented opportunities for modeling and forecasting urban problems, such as crime prediction and house price prediction, through data-driven approaches. A fundamental and critical issue in modeling and forecasting urban problems lies in identifying suitable spatial analysis units, also known as city region partition. Existing works rely on subjective domain knowledge for static partitions, which is general and universal for all tasks. In fact, different tasks may need different city region partitions. To address this issue, we propose JLPR, a task-oriented framework for Joint Learning of region Partition and Representation. To make partitions fit tasks, JLPR integrates the region partition into the representation model training and learns region partitions using the supervision signal from the downstream task. We evaluate the framework on two prediction tasks (i.e., crime prediction and housing price prediction) in Chicago. Experiments show that JLPR consistently outperforms state-of-the-art partitioning methods in both tasks, which achieves above 25% and 70% performance improvements in terms of mean absolute error for crime prediction and house price prediction tasks, respectively. Additionally, we meticulously undertake three visualization case studies, which yield profound and illuminating findings from diverse perspectives, demonstrating the remarkable effectiveness and superiority of our approach.

References

[1]
Sercan Ö Arık, Joel Shor, Rajarishi Sinha, Jinsung Yoon, Joseph R. Ledsam, Long T. Le, Michael W. Dusenberry, Nathanael C. Yoder, Kris Popendorf, Arkady Epshteyn, Johan Euphrosine, Elli Kanal, Isaac Jones, Chun-Liang Li, Beth Luan, Joe Mckenna, Vikas Menon, Shashank Singh, Mimi Sun, Ashwin Sura Ravi, Leyou Zhang, Dario Sava, Kane Cunningham, Hiroki Kayama, Thomas Tsai, Daisuke Yoneoka, Shuhei Nomura, Hiroaki Miyata, and Tomas Pfister. 2021. A prospective evaluation of AI-augmented epidemiology to forecast COVID-19 in the USA and Japan. NPJ Digital Medicine 4, 1 (2021), 146.
[2]
Filippo Maria Bianchi, Daniele Grattarola, and Cesare Alippi. 2020. Spectral clustering with graph neural networks for graph pooling. In Proceedings of the International Conference on Machine Learning. 874–883.
[3]
Umair Muneer Butt, Sukumar Letchmunan, Fadratul Hafinaz Hassan, Mubashir Ali, Anees Baqir, and Hafiz Husnain Raza Sherazi. 2020. Spatio-temporal crime hotspot detection and prediction: A systematic literature review. IEEE Access 8 (2020), 166553–166574.
[4]
Pablo Samuel Castro, Daqing Zhang, Chao Chen, Shijian Li, and Gang Pan. 2013. From taxi GPS traces to social and community dynamics: A survey. ACM Computing Surveys 46, 2 (2013), 1–34.
[5]
Chao Chen, Qiang Liu, Xingchen Wang, Chengwu Liao, and Daqing Zhang. 2021. semi-Traj2Graph: Identifying fine-grained driving style with GPS trajectory data via multi-task learning. IEEE Transactions on Big Data 8, 6 (2021), 1550–1565.
[6]
Chao Chen, Zhu Wang, and Bin Guo. 2016. The road to the Chinese smart city: Progress, challenges, and future directions. IT Professional 18, 1 (2016), 14–17.
[7]
Mingyu Deng, Wei Yang, Chao Chen, and Chenxi Liu. 2022. Exploring associations between streetscape factors and crime behaviors using Google Street View images. Frontiers of Computer Science 16, 4 (2022), 164316.
[8]
Alexandre Duval and Fragkiskos Malliaros. 2022. Higher-order clustering and pooling for graph neural networks. In Proceedings of the 31st ACM International Conference on Information and Knowledge Management. 426–435.
[9]
Yanjie Fu, Yong Ge, Yu Zheng, Zijun Yao, Yanchi Liu, Hui Xiong, and Jing Yuan. 2014. Sparse real estate ranking with online user reviews and offline moving behaviors. In Proceedings of the 2014 IEEE International Conference on Data Mining. IEEE, 120–129.
[10]
Yanjie Fu, Pengyang Wang, Jiadi Du, Le Wu, and Xiaolin Li. 2019. Efficient region embedding with multi-view spatial networks: A perspective of locality-constrained spatial autocorrelations. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 906–913.
[11]
Timnit Gebru, Jonathan Krause, Yilun Wang, Duyun Chen, Jia Deng, Erez Lieberman Aiden, and Li Fei-Fei. 2017. Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States. Proceedings of the National Academy of Sciences 114, 50 (2017), 13108–13113.
[12]
Arthur Getis and Jared Aldstadt. 2004. Constructing the spatial weights matrix using a local statistic. Geographical Analysis 36, 2 (2004), 90–104.
[13]
John A. Hartigan and Manchek A. Wong. 1979. Algorithm AS 136: A k-means clustering algorithm. Journal of the Royal Statistical Society: Series C (Applied Statistics) 28, 1 (1979), 100–108.
[14]
Tianyuan Huang, Zhecheng Wang, Hao Sheng, Andrew Y. Ng, and Ram Rajagopal. 2021. Learning neighborhood representation from multi-modal multi-graph: Image, text, mobility graph and beyond. arXiv preprint arXiv:2105.02489 (2021).
[15]
Porter Jenkins, Ahmad Farag, Suhang Wang, and Zhenhui Li. 2019. Unsupervised representation learning of spatial data via multimodal embedding. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management. 1993–2002.
[16]
Zhihan Jiang, Yan Liu, Xiaoliang Fan, Cheng Wang, Jonathan Li, and Longbiao Chen. 2020. Understanding urban structures and crowd dynamics leveraging large-scale vehicle mobility data. Frontiers of Computer Science 14 (2020), 1–12.
[17]
Gaganjot Kaur Kang, Jerry Zeyu Gao, Sen Chiao, Shengqiang Lu, and Gang Xie. 2018. Air quality prediction: Big data and machine learning approaches. International Journal of Environmental Science and Development 9, 1 (2018), 8–16.
[18]
Yuhao Kang, Fan Zhang, Wenzhe Peng, Song Gao, Jinmeng Rao, Fabio Duarte, and Carlo Ratti. 2021. Understanding house price appreciation using multi-source big geo-data and machine learning. Land Use Policy 111 (2021), 104919.
[19]
Kyoungok Kim. 2019. Identifying the structure of cities by clustering using a new similarity measure based on smart card data. IEEE Transactions on Intelligent Transportation Systems 21, 5 (2019), 2002–2011.
[20]
Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
[21]
Thomas N. Kipf and Max Welling. 2016. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016).
[22]
Fuxian Li, Huan Yan, Guangyin Jin, Yue Liu, Yong Li, and Depeng Jin. 2022. Automated spatio-temporal synchronous modeling with multiple graphs for traffic prediction. In Proceedings of the 31st ACM International Conference on Information and Knowledge Management. 1084–1093.
[23]
Tong Li, Yanxin Xi, Huandong Wang, Yong Li, Sasu Tarkoma, and Pan Hui. 2023. Learning representations of satellite imagery by leveraging point-of-interests. ACM Transactions on Intelligent Systems and Technology 14, 4 (2023), 1–32.
[24]
Yinfeng Li, Chen Gao, Xiaoyi Du, Huazhou Wei, Hengliang Luo, Depeng Jin, and Yong Li. 2022. Spatiotemporal-aware session-based recommendation with graph neural networks. In Proceedings of the 31st ACM International Conference on Information and Knowledge Management. 1209–1218.
[25]
Yexin Li, Yu Zheng, Huichu Zhang, and Lei Chen. 2015. Traffic prediction in a bike-sharing system. In Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems. 1–10.
[26]
Yuxuan Liang, Kun Ouyang, Junkai Sun, Yiwei Wang, Junbo Zhang, Yu Zheng, David Rosenblum, and Roger Zimmermann. 2021. Fine-grained urban flow prediction. In Proceedings of the Web Conference 2021. 1833–1845.
[27]
Harvey J. Miller. 2004. Tobler’s first law and spatial analysis. Annals of the Association of American Geographers 94, 2 (2004), 284–289.
[28]
Fionn Murtagh and Pierre Legendre. 2014. Ward’s hierarchical agglomerative clustering method: Which algorithms implement Ward’s criterion? Journal of Classification 31 (2014), 274–295.
[29]
Andrew Ng, Michael Jordan, and Yair Weiss. 2001. On spectral clustering: Analysis and an algorithm. In Proceedings of the 14th International Conference on Neural Information Processing Systems: Natural and Synthetic (NIPS ’01). 849–856.
[30]
Richard Peng, He Sun, and Luca Zanetti. 2015. Partitioning well-clustered graphs: Spectral clustering works! In Proceedings of the Conference on Learning Theory. 1423–1455.
[31]
Thilina Piyadasun, Buwaneka Kalansuriya, Malaka Gangananda, Minudika Malshan, H. M. N. Dilum Bandara, and Suresh Marru. 2017. Rationalizing police patrol beats using heuristic-based clustering. In Proceedings of the 2017 Moratuwa Engineering Research Conference (MERCon ’17). IEEE, 431–436.
[32]
Yingrong Qin, Chen Gao, Shuangqing Wei, Yue Wang, Depeng Jin, Jian Yuan, Lin Zhang, Dong Li, Jianye Hao, and Yong Li. 2023. Learning from hierarchical structure of knowledge graph for recommendation. ACM Transactions on Information Systems 42, 1 (2023), 1–24.
[33]
Hao Qu, Yongshun Gong, Meng Chen, Junbo Zhang, Yu Zheng, and Yilong Yin. 2022. Forecasting fine-grained urban flows via spatio-temporal contrastive self-supervision. IEEE Transactions on Knowledge and Data Engineering. Published Online, August 22, 2022.
[34]
Erich Schubert, Jörg Sander, Martin Ester, Hans Peter Kriegel, and Xiaowei Xu. 2017. DBSCAN revisited, revisited: Why and how you should (still) use DBSCAN. ACM Transactions on Database Systems 42, 3 (2017), 1–21.
[35]
Junkai Sun, Junbo Zhang, Qiaofei Li, Xiuwen Yi, Yuxuan Liang, and Yu Zheng. 2020. Predicting citywide crowd flows in irregular regions using multi-view graph convolutional networks. IEEE Transactions on Knowledge and Data Engineering 34, 5 (2020), 2348–2359.
[36]
Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jon Shlens, and Zbigniew Wojna. 2016. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2818–2826.
[37]
Anton Tsitsulin, John Palowitch, Bryan Perozzi, and Emmanuel Müller. 2020. Graph clustering with graph neural networks. arXiv preprint arXiv:2006.16904 (2020).
[38]
Hongjian Wang, Porter Jenkins, Hua Wei, Fei Wu, and Zhenhui Li. 2019. Learning task-specific city region partition. In Proceedings of the World Wide Web Conference. 3300–3306.
[39]
Hongjian Wang, Daniel Kifer, Corina Graif, and Zhenhui Li. 2016. Crime rate inference with big data. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 635–644.
[40]
Hongjian Wang and Zhenhui Li. 2017. Region representation learning via mobility flow. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. 237–246.
[41]
Xu Wang, Zimu Zhou, Yi Zhao, Xinglin Zhang, Kai Xing, Fu Xiao, Zheng Yang, and Yunhao Liu. 2019. Improving urban crowd flow prediction on flexible region partition. IEEE Transactions on Mobile Computing 19, 12 (2019), 2804–2817.
[42]
Zhecheng Wang, Haoyuan Li, and Ram Rajagopal. 2020. Urban2Vec: Incorporating street view imagery and POIs for multi-modal urban neighborhood embedding. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 1013–1020.
[43]
Congcong Wen, Shufu Liu, Xiaojing Yao, Ling Peng, Xiang Li, Yuan Hu, and Tianhe Chi. 2019. A novel spatiotemporal convolutional long short-term neural network for air pollution prediction. Science of the Total Environment 654 (2019), 1091–1099.
[44]
Shangbin Wu, Xu Yan, Xiaoliang Fan, Shirui Pan, Shichao Zhu, Chuanpan Zheng, Ming Cheng, and Cheng Wang. 2022. Multi-graph fusion networks for urban region embedding. arXiv preprint arXiv:2201.09760 (2022).
[45]
Liming Xu, Xianhua Zeng, Weisheng Li, and Bochuan Zheng. 2023. MFGAN: Multi-modal feature-fusion for CT metal artifact reduction using GANs. ACM Transactions on Multimedia Computing, Communications and Applications 19, 1s (2023), 1–17.
[46]
Ashima Yadav and Dinesh Kumar Vishwakarma. 2023. A deep multi-level attentive network for multimodal sentiment analysis. ACM Transactions on Multimedia Computing, Communications and Applications 19, 1 (2023), 1–19.
[47]
Zijun Yao, Yanjie Fu, Bin Liu, Wangsu Hu, and Hui Xiong. 2018. Representing urban functions through zone embedding with human mobility patterns. In Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI ’18).
[48]
Xueyan Yin, Genze Wu, Jinze Wei, Yanming Shen, Heng Qi, and Baocai Yin. 2021. Deep learning on traffic prediction: Methods, analysis, and future directions. IEEE Transactions on Intelligent Transportation Systems 23, 6 (2021), 4927–4943.
[49]
Zhitao Ying, Jiaxuan You, Christopher Morris, Xiang Ren, Will Hamilton, and Jure Leskovec. 2018. Hierarchical graph representation learning with differentiable pooling. In Proceedings of the 32nd Conference on Neural Information Processing Systems (NeurIPS ’18). 1–11.
[50]
Jing Yuan, Yu Zheng, and Xing Xie. 2012. Discovering regions of different functions in a city using human mobility and POIs. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 186–194.
[51]
Mingyang Zhang, Tong Li, Yong Li, and Pan Hui. 2021. Multi-view joint graph representation learning for urban region embedding. In Proceedings of the 29th International Joint Conference on Artificial Intelligence. 4431–4437.
[52]
Yunchao Zhang, Yanjie Fu, Pengyang Wang, Xiaolin Li, and Yu Zheng. 2019. Unifying inter-region autocorrelation and intra-region structures for spatial embedding via collective adversarial learning. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1700–1708.
[53]
Yunpeng Zheng, Gang Zhao, and Jian Liu. 2015. A novel grid based k-means cluster method for traffic zone division. In Cloud Computing and Big Data. Lecture Notes in Computer Science, Vol. 9106. Springer, 165–178.
[54]
Zhi-Hua Zhou. 2022. Rehearsal: Learning from prediction to decision. Frontiers of Computer Science 16, 4 (2022), 164352.

Index Terms

  1. A Novel Framework for Joint Learning of City Region Partition and Representation

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Multimedia Computing, Communications, and Applications
      ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 20, Issue 7
      July 2024
      973 pages
      EISSN:1551-6865
      DOI:10.1145/3613662
      • Editor:
      • Abdulmotaleb El Saddik
      Issue’s Table of Contents

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 16 May 2024
      Online AM: 17 March 2024
      Accepted: 09 March 2024
      Revised: 18 February 2024
      Received: 20 July 2023
      Published in TOMM Volume 20, Issue 7

      Check for updates

      Author Tags

      1. Region partition
      2. partition learning
      3. representation learning
      4. prediction task
      5. multimodal big data

      Qualifiers

      • Research-article

      Funding Sources

      • NSFC
      • Excellent Youth Foundation of Chongqing
      • Fundamental Research Funds for the Central Universities

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 212
        Total Downloads
      • Downloads (Last 12 months)212
      • Downloads (Last 6 weeks)19
      Reflects downloads up to 10 Nov 2024

      Other Metrics

      Citations

      View Options

      Get Access

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Full Text

      View this article in Full Text.

      Full Text

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media