Deep architecture for traffic flow prediction: Deep belief networks with multitask learning

W Huang, G Song, H Hong, K Xie - IEEE Transactions on …, 2014 - ieeexplore.ieee.org
IEEE Transactions on Intelligent Transportation Systems, 2014ieeexplore.ieee.org
Traffic flow prediction is a fundamental problem in transportation modeling and
management. Many existing approaches fail to provide favorable results due to being: 1)
shallow in architecture; 2) hand engineered in features; and 3) separate in learning. In this
paper we propose a deep architecture that consists of two parts, ie, a deep belief network
(DBN) at the bottom and a multitask regression layer at the top. A DBN is employed here for
unsupervised feature learning. It can learn effective features for traffic flow prediction in an …
Traffic flow prediction is a fundamental problem in transportation modeling and management. Many existing approaches fail to provide favorable results due to being: 1) shallow in architecture; 2) hand engineered in features; and 3) separate in learning. In this paper we propose a deep architecture that consists of two parts, i.e., a deep belief network (DBN) at the bottom and a multitask regression layer at the top. A DBN is employed here for unsupervised feature learning. It can learn effective features for traffic flow prediction in an unsupervised fashion, which has been examined and found to be effective for many areas such as image and audio classification. To the best of our knowledge, this is the first paper that applies the deep learning approach to transportation research. To incorporate multitask learning (MTL) in our deep architecture, a multitask regression layer is used above the DBN for supervised prediction. We further investigate homogeneous MTL and heterogeneous MTL for traffic flow prediction. To take full advantage of weight sharing in our deep architecture, we propose a grouping method based on the weights in the top layer to make MTL more effective. Experiments on transportation data sets show good performance of our deep architecture. Abundant experiments show that our approach achieved close to 5% improvements over the state of the art. It is also presented that MTL can improve the generalization performance of shared tasks. These positive results demonstrate that deep learning and MTL are promising in transportation research.
ieeexplore.ieee.org