Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3097983.3098018acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

The Simpler The Better: A Unified Approach to Predicting Original Taxi Demands based on Large-Scale Online Platforms

Published: 13 August 2017 Publication History
  • Get Citation Alerts
  • Abstract

    Taxi-calling apps are gaining increasing popularity for their efficiency in dispatching idle taxis to passengers in need. To precisely balance the supply and the demand of taxis, online taxicab platforms need to predict the Unit Original Taxi Demand (UOTD), which refers to the number of taxi-calling requirements submitted per unit time (e.g., every hour) and per unit region (e.g., each POI). Predicting UOTD is non-trivial for large-scale industrial online taxicab platforms because both accuracy and flexibility are essential. Complex non-linear models such as GBRT and deep learning are generally accurate, yet require labor-intensive model redesign after scenario changes (e.g., extra constraints due to new regulations). To accurately predict UOTD while remaining flexible to scenario changes, we propose LinUOTD, a unified linear regression model with more than 200 million dimensions of features. The simple model structure eliminates the need of repeated model redesign, while the high-dimensional features contribute to accurate UOTD prediction. We further design a series of optimization techniques for efficient model training and updating. Evaluations on two large-scale datasets from an industrial online taxicab platform verify that LinUOTD outperforms popular non-linear models in accuracy. We envision our experiences to adopt simple linear models with high-dimensional features in UOTD prediction as a pilot study and can shed insights upon other industrial large-scale spatio-temporal prediction problems.

    Supplementary Material

    MP4 File (ye_online_platforms.mp4)

    References

    [1]
    Didi Chuxing. https://en.wikipedia.org/wiki/Didi_Chuxing.
    [2]
    Grab. https://en.wikipedia.org/wiki/Grab_(application).
    [3]
    Uber. https://en.wikipedia.org/wiki/Uber_(company).
    [4]
    A Anwar, M. Volkov, and D. Rus. 2013. Changinow: A mobile application for efficient taxi allocation at airports. In 16th International IEEE Conference on Intelligent Transportation Systems, The Hague, The Netherlands. 694--701.
    [5]
    T. Cover and J. Thomas. 2006. Elements of information theory (2. ed.).
    [6]
    J. Dean, G. Corrado, R. Monga, K. Chen, M. Devin, M. Mao, A. Senior, P. Tucker, K. Yang, and Q. Le. 2012. Large scale distributed deep networks. In Advances in neural information processing systems. 1232--1240.
    [7]
    J. Friedman. 2001. Greedy function approximation: a gradien boosting machine. Annals of statistics(2001).
    [8]
    X. He, J. Pan, O. Jin, T. Xu, B. Liu, T. Xu, Y. Shi, A. Atallah R. Herbrich, and S. Bowers. 2014. Practical lessons from predicting clicks on ads at facebook. In Proceedings of the Eighth International Workshop on Data Mining for Online Advertising, New York City, New York, USA. 5:1--5:9.
    [9]
    G. Hinton and R. Salakhutdinov. 2006. Reducing the dimensionalit of data with neural networks. Science(2006), 504--507.
    [10]
    Q. Ho, J. Cipar, H. Cui, S. Lee, J. Kim, P. Gibbons, G. Gibson, G. Ganger, and E. Xing. 2013. More effective distributed ml via a stale synchronous parallel parameter server. In Advances in neural information processing systems. 1223--1231.
    [11]
    M. Li, D. Andersen, J. Park, A. Smola, A. Ahmed, V. Josifovski, J. Long, E. Shekita, and B. Su. 2014. Scaling distributed machine learning with the parameter server. In 11th USENIX Symposium on Operating Systems Design and Implementation, Broomfield, CO, USA. 583--598.
    [12]
    Y. Li, Y. Zheng, Y. Zhang, and L. Chen. 2015. Traffic prediction in a bike-sharing system. In Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems, Bellevue, WA, USA. 33:1--33:10.
    [13]
    B. McMahan, G Holt, D Sculley, M. Young, D. Ebner, J. Grady, L. Nie, T. Phillips, E. Davydov, D. Golovin, S. Chikkerur, D. Liu, M. Wattenberg, A. Hrafnkelsson, T. Boulos, and J. Kubica. 2013. Ad click prediction: a view from the trenches. In The 19th ACM International Conference on Knowledge Discovery and Data Mining, Chicago, IL, USA. 1222--1230.
    [14]
    L. Moreira-Matias, J. Gama, M. Ferreira, J. Mendes-Moreira, and L. Damas. 2013. Predicting taxi-passenger demand using streaming data. IEEE Transactions on Intelligent Transportation Systems (2013), 1393--1402.
    [15]
    N. Mukai and N. Yoden. 2012. Taxi demand forecasting based on taxi probe data by neural network. In Intelligent Interactive Multimedia: Systems and Services. 589--597.
    [16]
    J. Sun, D. Papadias, Y. Tao, and B. Liu. 2004. Querying about the past, the present, and the future in spatio-temporal. In Proceedings of the 20th International Conference on Data Engineering, Boston, MA, USA. 202--213.
    [17]
    Y. Tong, Y. Chen, Z. Zhou, L. Chen, J. Wang, Q. Yang, and J. Ye. 2017. The simpler the better: a unified approach to predicting original taxi demands on large-scale online platforms (Techinical report). http://www.cse.ust.hk/~yxtong/tr_prediction.pdf. (2017).
    [18]
    Y. Tong, J. She, B. Ding, L. Chen, T. Wo, and K. Xu. 2016. Online Minimum Matching in Real-Time Spatial Data: Experiments and Analysis. In Proceedings of the 42nd International Conference on Very Large Databases, New Delhi, India. 109--118.
    [19]
    Y. Tong, J. She, B. Ding, L. Wang, and L. Chen. 2016. Online mobile Micro-Task Allocation in spatial crowdsourcing. In 32nd IEEE International Conference on Data Engineering, Helsinki, Finland. 49--60.
    [20]
    V. Vapnik and V. Vapnik. 1998. Statistical learning theory. Wiley New York.
    [21]
    Li X., Pan G., Wu Z., Qi G., S. Li, D Zhang, W. Zhang, and Z. Wang. 2012. Prediction of urban human mobility using large-scale taxi traces and its applications. Frontiers of Computer Science in China (2012), 111--121.
    [22]
    L. Xiao. 2010. Dual averaging methods for regularized stochastic learning and online optimization. Journal of Machine Learning Research (2010), 2543--2596.
    [23]
    J. Yuan, Y. Zheng, X. Xie, and G. Sun. 2011. Driving with knowledge from the physical world. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA, USA. 316--324.
    [24]
    J. Yuan, Y. Zheng, C. Zhang, W. Xie, G. Xie, X.and Sun, and Y. Huang. 2010. T-drive: driving directions based on taxi trajectories. In 18th ACM SIGSPATIAL International Symposium on Advances in Geographic Information Systems, San Jose, CA, USA, Proceedings. 99--108.
    [25]
    J. Yuan, Y. Zheng, L. Zhang, X. Xie, and G. Sun. Where to find my next passenger. In Proceedings of the 13th international conference on Ubiquitous computing, Beijing, China. 109--118.
    [26]
    M. Zeiler. 2012. ADADELTA: an adaptive learning rate method. arXiv preprint (2012).
    [27]
    K. Zhang, Z. Feng, S. Chen, K. Huang, and G. Wang. 2016. A Framework for Passengers Demand Prediction and Recommendation. In IEEE International Conference on Services Computing, San Francisco, CA, USA. 340--347.
    [28]
    K. Zhao, D. Khryashchev, J. Freire, C. Silva, and H. Vo. 2016. Predicting taxi demand at high spatial resolution: approaching the limit of predictability. In 2016 IEEE International Conference on Big Data, Washington DC, USA. 833--842.
    [29]
    Y. Zheng, L. Capra, O. Wolfson, and H. Yang. 2014. Urban computing: concepts, methodologies, and applications. ACM Transactions on Intelligent Systems and Technology (2014), 38:1--38:55.
    [30]
    Y. Zheng, Y. Liu, J. Yuan, and X. Xie. 2011. Urban computing with taxicabs. In Proceedings of the 13th international conference on Ubiquitous computing, Beijing, China. 89--98

    Cited By

    View all
    • (2024)An Enhanced Taxi Demand Perception System Leveraging Fusion and Automated Sensor IntegrationTrends and Applications in Mechanical Engineering, Composite Materials and Smart Manufacturing10.4018/979-8-3693-1966-6.ch002(35-46)Online publication date: 19-Apr-2024
    • (2024)DeepMeshCity: A Deep Learning Model for Urban Grid PredictionACM Transactions on Knowledge Discovery from Data10.1145/365285918:6(1-26)Online publication date: 29-Apr-2024
    • (2024)Multimodal Transport Demand Forecasting via Federated LearningIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2023.332593625:5(4009-4020)Online publication date: May-2024
    • Show More Cited By

    Index Terms

    1. The Simpler The Better: A Unified Approach to Predicting Original Taxi Demands based on Large-Scale Online Platforms

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
        August 2017
        2240 pages
        ISBN:9781450348874
        DOI:10.1145/3097983
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 13 August 2017

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. feature engineering
        2. prediction
        3. unit original taxi demands

        Qualifiers

        • Research-article

        Funding Sources

        • Hong Kong CERG projects
        • National Natural Science Foundation of China
        • Microsoft Research Asia Gift Grant
        • Hong Kong RGC Project
        • HKUST-SSTSP
        • National Grand Fundamental Research 973 Program of China
        • NSFC Guang Dong Grant
        • SKLSDE (BUAA) Open Program
        • Google Faculty Award 2013
        • ITS

        Conference

        KDD '17
        Sponsor:

        Acceptance Rates

        KDD '17 Paper Acceptance Rate 64 of 748 submissions, 9%;
        Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

        Upcoming Conference

        KDD '24

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)113
        • Downloads (Last 6 weeks)10
        Reflects downloads up to 12 Aug 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)An Enhanced Taxi Demand Perception System Leveraging Fusion and Automated Sensor IntegrationTrends and Applications in Mechanical Engineering, Composite Materials and Smart Manufacturing10.4018/979-8-3693-1966-6.ch002(35-46)Online publication date: 19-Apr-2024
        • (2024)DeepMeshCity: A Deep Learning Model for Urban Grid PredictionACM Transactions on Knowledge Discovery from Data10.1145/365285918:6(1-26)Online publication date: 29-Apr-2024
        • (2024)Multimodal Transport Demand Forecasting via Federated LearningIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2023.332593625:5(4009-4020)Online publication date: May-2024
        • (2024)Coupling Makes Better: An Intertwined Neural Network for Taxi and Ridesourcing Demand Co-PredictionIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2023.331222425:2(1691-1705)Online publication date: Feb-2024
        • (2024)The Hierarchical Clustering of Human Mobility BehaviorsIEEE Transactions on Computational Social Systems10.1109/TCSS.2023.328146911:2(1876-1887)Online publication date: Apr-2024
        • (2024)Learning Time-Aware Graph Structures for Spatially Correlated Time Series Forecasting2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00338(4435-4448)Online publication date: 13-May-2024
        • (2024)Online relocating and matching of ride-hailing services: A model-based modular approachTransportation Research Part E: Logistics and Transportation Review10.1016/j.tre.2024.103600188(103600)Online publication date: Aug-2024
        • (2024)Predicting ride-hailing passenger demandFuture Generation Computer Systems10.1016/j.future.2024.02.026156:C(168-178)Online publication date: 18-Jul-2024
        • (2024)Multimodal joint prediction of traffic spatial-temporal data with graph sparse attention mechanism and bidirectional temporal convolutional networkAdvanced Engineering Informatics10.1016/j.aei.2024.10253362(102533)Online publication date: Oct-2024
        • (2024)Explaining Taxi Demand Prediction Models Based on Feature ImportanceArtificial Intelligence. ECAI 2023 International Workshops10.1007/978-3-031-50396-2_15(269-284)Online publication date: 21-Jan-2024
        • Show More Cited By

        View Options

        Get Access

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media