Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3534678.3539250acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Selective Cross-City Transfer Learning for Traffic Prediction via Source City Region Re-Weighting

Published: 14 August 2022 Publication History

Abstract

Deep learning models have been demonstrated powerful in modeling complex spatio-temporal data for traffic prediction. In practice, effective deep traffic prediction models rely on large-scale traffic data, which is not always available in real-world scenarios. To alleviate the data scarcity issue, a promising way is to use cross-city transfer learning methods to fine-tune well-trained models from source cities with abundant data. However, existing approaches overlook the divergence between source and target cities, and thus, the trained model from source cities may contain noise or even harmful source knowledge. To address the problem, we propose CrossTReS, a selective transfer learning framework for traffic prediction that adaptively re-weights source regions to assist target fine-tuning. As a general framework for fine-tuning-based cross-city transfer learning, CrossTReS consists of a feature network, a weighting network, and a prediction model. We train the feature network with node- and edge-level domain adaptation techniques to learn generalizable spatial features for both source and target cities. We further train the weighting network via source-target joint meta-learning such that source regions helpful to target fine-tuning are assigned high weights. Finally, the prediction model is selectively trained on the source city with the learned weights to initialize target fine-tuning. We evaluate CrossTReS using real-world taxi and bike data, where under the same settings, CrossTReS outperforms state-of-the-art baselines by up to 8%. Moreover, the learned region weights offer interpretable visualization.

Supplemental Material

MP4 File
Deep learning has been shown powerful in modeling spatio-temporal data for traffic prediction. In practice, effective traffic prediction models rely on large-scale data, which is not always available in real-world scenarios. To alleviate the issue, a promising way is to apply cross-city transfer learning methods to fine-tune well-trained models from source cities with abundant data. However, existing methods overlook the divergence between source and target cities. Thus, the trained model from source cities may introduce noise or even harmful knowledge. In this video, we present CrossTReS, a selective transfer learning framework for traffic prediction that adaptively re-weights source regions to assist target fine-tuning. We evaluate CrossTReS using real-world data, where CrossTReS outperforms state-of-the-art baselines by up to 8%. Moreover, the learned region weights offer interpretable visualization.

References

[1]
Karsten M Borgwardt, Arthur Gretton, Malte J Rasch, Hans-Peter Kriegel, Bernhard Schölkopf, and Alex J Smola. 2006. Integrating structured biological data by kernel maximum mean discrepancy. Bioinformatics 22, 14 (2006), e49--e57.
[2]
Zhangjie Cao, Kaichao You, Mingsheng Long, Jianmin Wang, and Qiang Yang. 2019. Learning to transfer examples for partial domain adaptation. In Proceedings of the IEEE/CVF Conference on Computer Vision & Pattern Recognition. 2985--2994.
[3]
Zhao Chen, Vijay Badrinarayanan, Chen-Yu Lee, and Andrew Rabinovich. 2018. Gradnorm: Gradient normalization for adaptive loss balancing in deep multitask networks. In International Conference on Machine Learning. PMLR, 794--803.
[4]
Wenyuan Dai, Qiang Yang, Gui-Rong Xue, and Yong Yu. 2007. Boosting for Transfer Learning. In International Conference on Machine Learning. ACM, 193--200.
[5]
Yunshu Du, Wojciech M Czarnecki, Siddhant M Jayakumar, Mehrdad Farajtabar, Razvan Pascanu, and Balaji Lakshminarayanan. 2018. Adapting auxiliary losses using gradient similarity. arXiv preprint arXiv:1812.02224 (2018).
[6]
Chelsea Finn, Pieter Abbeel, and Sergey Levine. 2017. Model-agnostic metalearning for fast adaptation of deep networks. In International Conference on Machine Learning, Vol. 70. PMLR, 1126--1135.
[7]
Yanjie Fu, Pengyang Wang, Jiadi Du, Le Wu, and Xiaolin Li. 2019. Efficient region embedding with multi-view spatial networks: A perspective of localityconstrained spatial autocorrelations. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 906--913.
[8]
Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, François Laviolette, Mario Marchand, and Victor Lempitsky. 2016. Domain-adversarial training of neural networks. The Journal of Machine Learning Research 17, 1 (2016), 2096--2030.
[9]
Xu Geng, Yaguang Li, Leye Wang, Lingyu Zhang, Qiang Yang, Jieping Ye, and Yan Liu. 2019. Spatiotemporal multi-graph convolution network for ride-hailing demand forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 3656--3663.
[10]
Bin Guo, Jing Li, Vincent W Zheng, Zhu Wang, and Zhiwen Yu. 2018. Citytransfer: Transferring inter-and intra-city knowledge for chain store site recommendation based on multi-source urban data. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 1, 4 (2018), 1--23.
[11]
Xueting Han, Zhenhuan Huang, Bang An, and Jing Bai. 2021. Adaptive Transfer Learning on Graph Neural Networks. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 565--574.
[12]
Renhe Jiang, Du Yin, Zhaonan Wang, Yizhuo Wang, Jiewen Deng, Hangchen Liu, Zekun Cai, Jinliang Deng, Xuan Song, and Ryosuke Shibasaki. 2021. DL-Traff: Survey and Benchmark of Deep Learning Models for Urban Traffic Prediction. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 4515--4525.
[13]
Yaguang Li, Rose Yu, Cyrus Shahabi, and Yan Liu. 2017. Diffusion convolutional recurrent neural network: Data-driven traffic forecasting. In International Conference on Learning Representations.
[14]
Zheng Li, Ying Wei, Yu Zhang, and Qiang Yang. 2018. Hierarchical attention transfer network for cross-domain sentiment classification. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32. 5852--5859.
[15]
Zheng Li, Yun Zhang, Ying Wei, Yuxiang Wu, and Qiang Yang. 2017. End-to-End Adversarial Memory Network for Cross-domain Sentiment Classification. In Proceedings of the 26th International Joint Conference on Artificial Intelligence. 2237--2243.
[16]
Yan Liu, Bin Guo, Daqing Zhang, Djamal Zeghlache, Jingmin Chen, Ke Hu, Sizhe Zhang, Dan Zhou, and Zhiwen Yu. 2021. Knowledge Transfer with Weighted Adversarial Network for Cold-Start Store Site Recommendation. ACM Transactions on Knowledge Discovery from Data (TKDD) 15, 3 (2021), 1--27.
[17]
Mingsheng Long, Yue Cao, Jianmin Wang, and Michael Jordan. 2015. Learning transferable features with deep adaptation networks. In International Conference on Machine Learning. PMLR, 97--105.
[18]
Sinno Jialin Pan and Qiang Yang. 2009. A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering 22, 10 (2009), 1345--1359.
[19]
Leye Wang, Xu Geng, Xiaojuan Ma, Feng Liu, and Qiang Yang. 2019. Cross-city Transfer Learning for Deep Spatio-temporal Prediction. In Proceedings of the 28th International Joint Conference on Artificial Intelligence. 1893--1899.
[20]
Senzhang Wang, Hao Miao, Jiyue Li, and Jiannong Cao. 2022. Spatio-Temporal Knowledge Transfer for Urban Crowd Flow Prediction via Deep Attentive Adaptation Networks. IEEE Transactions on Intelligent Transportation Systems 23, 5 (2022), 4695--4705.
[21]
Ying Wei, Yu Zheng, and Qiang Yang. 2016. Transfer knowledge between cities. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1905--1914.
[22]
Kaiqiang Xu, Xinchen Wan, Hao Wang, Zhenghang Ren, Xudong Liao, Decang Sun, Chaoliang Zeng, and Kai Chen. 2021. TACC: A Full-stack Cloud Computing Infrastructure for Machine Learning Tasks. arXiv preprint:2110.01556 (2021).
[23]
Shuwen Yang, Guojie Song, Yilun Jin, and Lun Du. 2020. Domain Adaptive Classification on Heterogeneous Information Networks. In Proceedings of the 29th International Joint Conference on Artificial Intelligence. 1410--1416.
[24]
Huaxiu Yao, Yiding Liu, Ying Wei, Xianfeng Tang, and Zhenhui Li. 2019. Learning from multiple cities: A meta-learning approach for spatial-temporal prediction. In The World Wide Web Conference. 2181--2191.
[25]
Huaxiu Yao, Xianfeng Tang, Hua Wei, Guanjie Zheng, and Zhenhui Li. 2019. Revisiting spatial-temporal similarity: A deep learning framework for traffic prediction. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 5668--5675.
[26]
Huaxiu Yao, Fei Wu, Jintao Ke, Xianfeng Tang, Yitian Jia, Siyu Lu, Pinghua Gong, Jieping Ye, and Zhenhui Li. 2018. Deep multi-view spatial-temporal network for taxi demand prediction. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32. 2588--2595.
[27]
Bing Yu, Haoteng Yin, and Zhanxing Zhu. 2018. Spatio-temporal graph convolutional networks: a deep learning framework for traffic forecasting. In Proceedings of the 27th International Joint Conference on Artificial Intelligence. 3634--3640.
[28]
Jing Zhang, Zewei Ding, Wanqing Li, and Philip Ogunbona. 2018. Importance weighted adversarial nets for partial domain adaptation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8156--8164.
[29]
Junbo Zhang, Yu Zheng, and Dekang Qi. 2017. Deep spatio-temporal residual networks for citywide crowd flows prediction. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 31. 1655--1661.
[30]
Mingyang Zhang, Tong Li, Yong Li, and Pan Hui. 2020. Multi-View Joint Graph Representation Learning for Urban Region Embedding. In Proceedings of the 29th International Joint Conference on Artificial Intelligence. 4431--4437.
[31]
Yunchao Zhang, Yanjie Fu, Pengyang Wang, Xiaolin Li, and Yu Zheng. 2019. Unifying inter-region autocorrelation and intra-region structures for spatial embedding via collective adversarial learning. In Proceedings of 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 1700--1708.

Cited By

View all
  • (2025)Adaptive scheduling for Internet of Vehicles using deconfounded graph transfer learningComputer Networks10.1016/j.comnet.2024.110899256(110899)Online publication date: Jan-2025
  • (2024)Multiview Spatial-Temporal Meta-Learning for Multivariate Time Series ForecastingSensors10.3390/s2414447324:14(4473)Online publication date: 10-Jul-2024
  • (2024)Time-series Stay Frequency for Multi-City Next Location Prediction using Multiple BERTsProceedings of the 2nd ACM SIGSPATIAL International Workshop on Human Mobility Prediction Challenge10.1145/3681771.3699909(5-9)Online publication date: 29-Oct-2024
  • Show More Cited By

Index Terms

  1. Selective Cross-City Transfer Learning for Traffic Prediction via Source City Region Re-Weighting

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
        August 2022
        5033 pages
        ISBN:9781450393850
        DOI:10.1145/3534678
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 14 August 2022

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. selective transfer learning
        2. traffic prediction
        3. urban computing

        Qualifiers

        • Research-article

        Funding Sources

        Conference

        KDD '22
        Sponsor:

        Acceptance Rates

        Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

        Upcoming Conference

        KDD '25

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)422
        • Downloads (Last 6 weeks)38
        Reflects downloads up to 23 Dec 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2025)Adaptive scheduling for Internet of Vehicles using deconfounded graph transfer learningComputer Networks10.1016/j.comnet.2024.110899256(110899)Online publication date: Jan-2025
        • (2024)Multiview Spatial-Temporal Meta-Learning for Multivariate Time Series ForecastingSensors10.3390/s2414447324:14(4473)Online publication date: 10-Jul-2024
        • (2024)Time-series Stay Frequency for Multi-City Next Location Prediction using Multiple BERTsProceedings of the 2nd ACM SIGSPATIAL International Workshop on Human Mobility Prediction Challenge10.1145/3681771.3699909(5-9)Online publication date: 29-Oct-2024
        • (2024)SARN: Structurally-Aware Recurrent Network for Spatio-Temporal DisaggregationProceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems10.1145/3678717.3691295(338-349)Online publication date: 29-Oct-2024
        • (2024)Addressing Data Challenges to Drive the Transformation of Smart CitiesACM Transactions on Intelligent Systems and Technology10.1145/366348215:5(1-65)Online publication date: 7-Nov-2024
        • (2024)CrossLight: Offline-to-Online Reinforcement Learning for Cross-City Traffic Signal ControlProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671927(2765-2774)Online publication date: 25-Aug-2024
        • (2024)UniST: A Prompt-Empowered Universal Model for Urban Spatio-Temporal PredictionProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671662(4095-4106)Online publication date: 25-Aug-2024
        • (2024)Urban Foundation Models: A SurveyProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671453(6633-6643)Online publication date: 25-Aug-2024
        • (2024)AdaTM: Fine-grained Urban Flow Inference with Adaptive Knowledge Transfer across Multiple CitiesProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679856(3424-3432)Online publication date: 21-Oct-2024
        • (2024)FGITrans: Cross-City Transformer for Fine-grained Urban Flow InferenceProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679855(3415-3423)Online publication date: 21-Oct-2024
        • Show More Cited By

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media