Abstract
Traffic flow forecasting or prediction plays an important role in the traffic control and management of a city. Existing works mostly train a model using the traffic flow data of a city and then test the trained model using the data of the same city. It may not be truly intelligent as there are many cities around us and there should be some shared knowledge among different cities. The data of a city and its knowledge can be used to help improve the traffic flow forecasting of other cities. To address this motivation, we study building a universal deep learning model for multi-city traffic flow forecasting. In this paper, we exploit spatial-temporal correlations among different cities with multi-task learning to approach the traffic flow forecasting tasks of multiple cities. As a result, we propose a M ulti-city T raffic flow forecasting N etwork (MTN) via multi-task learning to extract the spatial dependency and temporal regularity among multiple cities later used to improve the performance of each individual city traffic flow forecasting collaboratively. In brief, the proposed model is a quartet of methods: (1) It integrates three temporal intervals and formulates a multi-interval component for each city to extract temporal features of each city; (2) A spatial-temporal attention layer with 3D Convolutional kernels is plugged into the neural networks to learn spatial-temporal relationship; (3) As traffic peak distributions of different cities are often similar, it proposes to use a peak zoom network to learn the peak effect of multiple cities and enhance the prediction performance on important time steps in different cities; (4) It uses a fusion layer to merge the outputs from distinct temporal intervals for the final forecasting results. Experimental results using real-world datasets from DIDI show the superior performance of the proposed model.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Abadi A, Rajabioun T, Ioannou P (2015) Traffic flow prediction for road transportation networks with limited traffic data. IEEE Trans Intell Transp Syst 16(2):653–662
Anwar T, Liu C, Vu H, Islam MS, Sellis T (2018) Capturing the spatiotemporal evolution in road traffic networks. IEEE Trans Knowl Data Eng 30(8):1426–1439
Bai L, Yao L, Kanhere S, Wang X, Sheng Q (2019a) Cross-city transfer learning for deep spatio-temporal prediction. In: Proceedings of the twenty-eighth international joint conference on artificial intelligence (IJCAI-19), pp 1893–1899
Bai L, Yao L, Kanhere S, Wang X, Sheng Q (2019b) Stg2seq: Spatial-temporal graph to sequence model for multi-step passenger demand forecasting. In: Proceedings of the twenty-eighth international joint conference on artificial intelligence (IJCAI-19), pp 1981–1987
Bian J, Tian D, Tang Y, Tao D (2019) Trajectory data classification: A review. ACM Trans Intell Syst Technol 10(4)
Cho K, van Merrienboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y (2014) Learning phrase representations using rnn encoder−decoder for statistical machine translation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP-14), pp 1724–1734
Chopra S, Hadsell R, LeCun Y (2005) Learning a similarity metric discriminatively with application to face verification. In: Proceedings of the 2005 IEEE conference on computer vision and pattern recognition (CVPR-05)
Eigen D, Fergus R (2015) Predicting depth, surface normals and semantics labels with a common multi-scale convolutional architecture. In: Proceedings of the 2015 IEEE international conference on computer vision (ICCV-15), pp 2650–2658
Fang S, Zhang Q, Meng G, Xiang S, Pan C (2019) Gstnet: Global spatial-temporal network for traffic flow prediction. In: Proceedings of the twenty-eighth international joint conference on artificial intelligence (IJCAI-19), pp 2286–2293
Gao R, Guo X, Sun F, Dai L, Zhu J, Hu C, Li H (1689) Aggressive driving saves more time? multi-task learning for customized travel time estimation. In: Proceedings of the twenty-eighth international joint conference on artificial intelligence (IJCAI-19)
Gao Y, Ma J, Liu W, Alan L, Yuille MZ (2019b) Nddr-cnn: Layer-wise feature fusing in multi-task cnn. In: Proceedings of the 2019 IEEE conference on computer vision and pattern recognition (CVPR-19)
Gong Y, Li Z, Zhang J, Liu W, Zheng Y, Kirsch C (2018) Network-wide crowd flow prediction of sydney trains via customized online non-negative matrix factorization. In: Proceedings of the 27th ACM international conference on information and knowledge management (CIKM-18), pp 1243–1252
Hoang MX, Zheng Y, Singh AK (2016) Fccf: Forecasting citywide crowd flows based on big data. In: Proceedings of the 24th ACM SIGSPATIAL international conference on advances in geographic information systems
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780
Ioffe S, Szegedy C (2015) Batch normalization: Accelerating deep network training by reducing internal covariate shift. In: Proceedings of the 32nd international conference on machine learning (ICML-15), pp 448–456
Kingma DP, Ba J (2015) Adam: A method for stochastic optimization. In: Proceedings of the 3rd international conference on learning representations (ICLR-15)
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444
Li Y, Zheng Y, Zhang H, Chen L (2015) Traffic prediction in a bike-sharing system. In: Proceedings of the 23rd ACM SIGSPATIAL international conference on advances in geographic information systems
Li Y, Yu R, Shahabi C, Liu Y (2018) Diffusion convolutional recurrent neural network: Data-driven traffic forecasting. In: Proceedings of the 6th international conference on learning representations (ICLR-18). https://openreview.net/forum?id=SJiHXGWAZ
Liao Q, Ding Y, Jiang ZL, Wang X, Zhang C, Zhang Q (2019) Multi-task deep convolutional neural network for cancer diagnosis. Neurocomputing 348:66–73
Ling H, Wang Z, Li P, Shi Y, Chen J, Zou F (2019) Improving person re-identification by multi-task learning. Neurocomputing 347:109–118
Misra I, Shrivastava A, Gupta A, Hebert M (2016) Cross-stitch networks for multi-task learning. In: Proceedings of the 2016 IEEE conference on computer vision and pattern recognition (CVPR-16), pp 3994–4003
Pan Z, Liang Y, Wang W, Yu Y, Zhang J, Zheng Y (2019) Urban traffic prediction from spatio-temporal data using deep meta learning. In: Proceedings of the 25th SIGKDD conference on knowledge discovery and data mining (KDD-19), pp 1720–1730
Ranjan R, Patel VM, Chellappa R (2019) Hyperface: A deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition. IEEE Trans Pattern Anal Mach Intell 41(1):121–135
Savchuk OP (2010) Large-scale dynamics of hypoxia in the baltic sea. In: Chemical structure of pelagic redox interfaces: observation and modeling, pp 137–160
Shi X, Chen Z, Wang H, Yeung DY, Wong WK, WOO W-C (2015) Convolutional LSTM network: A machine learning approach for precipitation nowcasting. In: Proceedings of the 28th international conference on neural information processing systems (NIPS-15), pp 802–810
Silva R, Kang SM, Airoldi EM (2015) Predicting traffic volumes and estimating the effects of shocks in massive transportation systems. In: Proceedings of the national academy of sciences, pp 5643–5648
Tian Y, Zhang K, Li J, Lin X, Yang B (2018) Multi-task deep convolutional neural network for cancer diagnosis. Neurocomputing 318:297–305
Wang H, Li Z, Kuo YH, Kifer D (2019a) A simple baseline for travel time estimation using large-scale trip data. ACM Trans Intell Syst Technol 10(2)
Wang Y, Zhang Y, Piao X, Liu H, Zhang K (2019b) Traffic data reconstruction via adaptive spatial-temporal correlations. IEEE Trans Intell Transp Syst 20(4):1531–1543
Wu F, Wang H, Li Z (2016) Interpreting traffic dynamics using ubiquitous urban data. In: Proceedings of the 24th ACM SIGSPATIAL international conference on advances in geographic information systems
Wu R, Luo G, Shao J, Tian L, Peng C (2018) Location prediction on trajectory data: A review. Big Data Min Anal 1(2):108–127
Wu Z, Pan S, Long G, Jiang J, Zhang C (2019) Graph wavenet for deep spatial-temporal graph modeling. In: Proceedings of the twenty-eighth international joint conference on artificial intelligence (IJCAI-19), pp 1907–1913
Xiao Q, Dai J, Luo J, Fujita H (2019) Multi-view manifold regularized learning-based method for prioritizing candidate disease mirnas. Knowl.-Based Syst 175:118–129
Xu Y, Kong Q, Klette R, Liu Y (2014) Accurate and interpretable bayesian mars for traffic flow prediction. IEEE Trans Intell Transp Syst 15(6):2457–2469
Yang P, Tan Q, Tong H, He J (2019a) Task-adversarial co-generative nets. In: Proceedings of the 25th SIGKDD conference on knowledge discovery and data mining (KDD-19), pp 1596–1604
Yang P, Tan Q, Ye J, Tong H, He J (2019b) Deep multi-task learning with adversarial-and-cooperativenets. In: Proceedings of the twenty-eighth international joint conference on artificial intelligence (IJCAI-19), pp 4078–4084
Yao H, Wu F, Ke J, Tang X, Jia Y, Lu S, Gong P, Ye J (2018) Deep multi-view spatial-temporal network for taxi demand prediction. In: Proceedings of the 32nd AAAI conference on artificial intelligence (AAAI-18), pp 2588–2595
Yao H, Tang X, Wei H, Zheng G, Li Z (2019) Revisiting spatial-temporal similarity: a deep learning framework for traffic prediction. In: Proceedings of the 33rd AAAI conference on artificial intelligence (AAAI-19)
Yim J, Jung H, Yoo B, Choi C, Park D, Kim J (2015) Rotating your face using multi-task deep neural network. In: Proceedings of the 2015 IEEE conference on computer vision and pattern recognition (CVPR-15), pp 676–684
Zhang H, Goodfellow I, Metaxas D, Odena A (2018a) Self-attention generative adversarial networks. arXiv:180508318
Zhang H, Zheng Y, Yu Y (2018b) Detectingurban anomalies using multiple spatio-temporal data sources. In: Proceedings of the ACM on interactive, mobile, wearable and ubiquitous technologies, vol 2
Zhang J, Zheng Y, Qi D, Li R, Yi X (2016) Dnn-based prediction model for spatio-temporal data. In: Proceedings of the 24th ACM SIGSPATIAL international conference on advances in geographic information systems
Zhang J, Zheng Y, Qi D, Li R, Yi X, Li T (2018c) Predicting citywide crowd flows using deep spatio-temporal residual networks. Artif Intell 259:147–166
Zhang J, Zheng Y, Sun J, Qi D (2019a) Flow prediction in spatio-temporal networks based on multitask deep learning. IEEE Trans Knowl Data Eng
Zhang Y, Yang Q (2017) A survey on multi-task learning. arXiv:170708114
Zhang Y, Yang Y, Li T, Fujita H (2019b) A multitask multiview clustering algorithm in heterogeneous situations based on lle and le. Knowl-Based Syst 163:776–786
Zhao L, Sun Q, Ye J, Chen F, Lu CT, Ramakrishnan N (2017) Feature constrained multi-task learning models for spatiotemporal event forecasting. IEEE Trans Knowl Data Eng 29(5):1059–1072
Zhou Y, Tao X, Yu Z, Fujita H (2019) Train-movement situation recognition for safety justification using moving-horizon tbm-based multisensor data fusion. Knowl-Based Syst 177:117– 126
Acknowledgements
This work was supported by the National Natural Science Foundation of China (Nos. 61976247 and 61572407) and the National Key Technology Research and Development Program (No. 2015BAH19F02). The datasets used in the experiments are from DIDI Chuxing GAIA Initiative. Hao Wang would like to thank a grant from Project Funded by China Postdoctoral Science Foundation (No. 2020M681960).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interests
We declare that all authors certify that they have NO affiliations with or involvement in any organization or entity with any financial interest (such as honoraria; educational grants; participation in speakers’ bureaus; membership, employment, consultancies, stock ownership, or other equity interest; and expert testimony or patent-licensing arrangements), or non-financial interest (such as personal or professional relationships, affiliations, knowledge or beliefs) in the subject matter or materials discussed in this manuscript.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zhang, Y., Yang, Y., Zhou, W. et al. Multi-city traffic flow forecasting via multi-task learning. Appl Intell 51, 6895–6913 (2021). https://doi.org/10.1007/s10489-020-02074-8
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-020-02074-8