Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Model-Based Count Series Clustering for Bike Sharing System Usage Mining: A Case Study with the Vélib’ System of Paris

Published: 28 July 2014 Publication History

Abstract

Today, more and more bicycle sharing systems (BSSs) are being introduced in big cities. These transportation systems generate sizable transportation data, the mining of which can reveal the underlying urban phenomenon linked to city dynamics. This article presents a statistical model to automatically analyze the trip data of a bike sharing system. The proposed solution partitions (i.e., clusters) the stations according to their usage profiles. To do so, count series describing the stations’s usage through departure/arrival counts per hour throughout the day are built and analyzed. The model for processing these count series is based on Poisson mixtures and introduces a station scaling factor that handles the differences between the stations’s global usage. Differences between weekday and weekend usage are also taken into account. This model identifies the latent factors that shape the geography of trips, and the results may thus offer insights into the relationships between station neighborhood type (its amenities, its demographics, etc.) and the generated mobility patterns. In other words, the proposed method brings to light the different functions in different areas that induce specific patterns in BSS data. These potentials are demonstrated through an in-depth analysis of the results obtained on the Paris Vélib’ large-scale bike sharing system.

References

[1]
APUR. 2006. Etude de localisation des stations de vélos en libre service. Rapport. Technical Report 349. Atelier Parisien d’Urbanisme.
[2]
M. Benchimol, P. Benchimol, B. Chappert, A. De La Taille, F. Laroche, F. Meunier, and L. Robinet. 2011. Balancing the stations of a self-service bike hire system. RAIRO-Operations Research 45, 1 (Jan. 2011), 37--61.
[3]
D. M. Blei, A. Y. Ng, and M. I. Jordan. 2003. Latent Dirichlet Allocation. Journal of Machine Learning Research 3 (2003), 993--1022.
[4]
P. Borgnat, E. Fleury, C. Robardet, and A. Scherrer. 2009. Spatial analysis of dynamic movements of Vélo’v, Lyon’s shared bicycle program. In European Conference on Complex Systems (ECCS’09), Francois Kepes (Ed.). Complex Systems Society.
[5]
P. Borgnat, C. Robardet, P. Abry, P. Flandrin, J. B. Rouquier, and N. Tremblay. 2013. Dynamics On and Of Complex Networks, Volume 2. Springer Berlin Heidelberg, Chapter A Dynamical Network View of Lyon’s Vélo’v Shared Bicycle System. http://liris.cnrs.fr/publis/?id=5713
[6]
P. Borgnat, C. Robardet, J.-B. Rouquier, Abry Parice, E. Fleury, and P. Flandrin. 2011. Shared Bicycles in a City: A Signal processing and Data Analysis Perspective. Advances in Complex Systems 14, 3 (June 2011), 1--24.
[7]
H. Büttner, J. Mlasowky, T. Birkholz, D. Groper, a.C. Fernandez, G. Emberger, and M. Banfi. 2011. Optmising Bike Sharing in European Cities, A Handbook. Technical Report. Intelligent Energy Europe Program (IEE, OBIS projext).
[8]
F. Calabrese, M. Dia, G. Di Lorenzo, J. Ferreaira, and C. Ratti. 2013. Understanding individual mobility patterns from urban sensing data: A mobile phone trace example. Transportation Research Part C 26 (2013), 301--313.
[9]
D. Chemla, F. Meunier, and R. Wolfler Calvo. 2011. Balancing a bike-sharing system with multiple vehicles. In Congrès annuel de la société Française de recherche opérationelle et d’aide à la décision, ROADEF2011. Société Française de recherche opérationelle, Saint-Etienne, France.
[10]
P. De Maio. 2009. Bike-sharing: History, impacts, models of provision, and future. Journal of Public Transportation 12, 4 (2009), 41--56.
[11]
L. Dell’Olio, A. Ibeas, and J. L. Moura. 2011. Implementing bike-sharing systems. In ICE - Municipal Engineer 164, 2, 89--101.
[12]
A. P. Dempster, N. M. Laird, and D. B. Rubin. 1977. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society B 39 (1977), 1--38.
[13]
J. Dill. 2009. Bicycling for transportation and health: The role of infrastructure. Journal of Public Health Policy 30 (2009), 95--110.
[14]
R. O. Duda, P. E. Hart, and D. G. Stork. 2001. Pattern Classification (2nd ed.). Wiley, New York.
[15]
C. Fraley and A. Raftery. 2002. Model based clustering, discriminant analysis and density estimation. Journal of the American Statistical Association 97, 458 (2002), 611--631.
[16]
J. Froehlich, J. Neumann, and N. Oliver. 2008. Measuring the pulse of the city through shared bicycle programs. In UrbanSense 2008. 16--20.
[17]
J. Froehlich, J. Neumann, and N. Oliver. 2009. Sensing and predicting the pulse of the city through shared bicycling. In 21st International Joint Conference on Artificial Intelligence (IJCAI’09). AAAI Press, 1420--1426.
[18]
G. Govaert and M. Nadif. 2010. Latent block model for contingency table. Communications in Statistics-Theory and Methods 39, 3 (Jan. 2010), 416--425.
[19]
A. Hofleitner, R. Herring, P. Abbeel, and A. M. Bayen. 2012. Learning the dynamics of arterial traffic from probe data using a dynamic Bayesian network. IEEE Transactions on Intelligent Transportation Systems 13, 4 (2012), 1679--1693.
[20]
A. Kaltenbrunner, R. Meza, J. Grivolla, J. Codina, and R. Banchs. 2010. Urban cycles and mobility patterns: Exploring and predicting trends in a bicycle-based public transport system. Pervasive and Mobile Computing 6, 4 (2010), 455--466.
[21]
D. Karlis and L. Meligkosidou. 2003. Model based clustering for multivariate count data. In 18th International Workshop on Statistical Modelling. Katholieke Universiteit Leuven, 211--216.
[22]
Neal Lathia, A. Saniul, and L. Capra. 2012. Measuring the impact of opening the London shared bicycle scheme to casual users. Transportation Research Part C: Emerging Technologies 22 (June 2012), 88--102.
[23]
J. R. Lin and T. Yang. 2011. Strategic design of public bicycle sharing systems with service level constraints. Transportation Research Part E: Logistics and Transportation Review 47, 2 (2011), 284--294.
[24]
S. Ma, Y. Zheng, and O. Wolfson. 2013. T-Share: A Large-Scale Dynamic Taxi Ridesharing Service. In 29th IEEE International Conference on Data Engineering (ICDE’13). IEEE, Brisbanne, Australia.
[25]
G. J. Mclachlan and T. Krishnan. 1996. The EM Algorithm and Extension. Wiley.
[26]
G. J. Mclachlan and D. Peel. 2000. Finite Mixture Models. Wiley.
[27]
G. Michau, C. Robardet, L. Merchez, P. Jensen, P. Abry, P. Flandrin, and P. Borgnat. 2011. Peut-on attraper les utilisateurs de Vélo’v au Lasso? In XXIIIe Colloque GRETSI—Traitement du Signal et des Images. GRETSI, 46--50.
[28]
R. Nair, E. Miller-Hooks, R. C. Hampshire, and A. Bušić. 2012. Large-scale vehicle sharing systems: Analysis of Vélib’. International Journal of Sustainable Transportation 7, 1 (April 2012), 85--106.
[29]
M. E. Newman. 2006. Modularity and community structure in networks. Proceedings of the National Academy of Sciences 103, 23 (June 2006), 8577--8582.
[30]
P. Pucher and R. Buehler. 2008. Making cycling irresistible: Lessons from The Netherlands, Denmark and Germany. Transport Reviews 28, 4 (2008), 495--528.
[31]
R Core Team. 2012. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria.
[32]
C. Ratti, R. M. Pulselli, S. Williams, and D. Frenchman. 2006. Mobile landscapes: Using location data from cell phones for urban analysis. Environment and Planning B: Planning and Design 33, 5 (2006), 727--748.
[33]
Andrea Rau, Gilles Celeux, Marie-Laure Martin-Magniette, and Cathy Maugis-Rabusseau. 2011. Clustering High-Throughput Sequencing Data with Poisson Mixture Models. Technical Report RR-7786. INRIA.
[34]
Sarah Julia Thomas. 2010. Model-based Clustering for Multivariate Time Series of Counts. Ph.D. Dissertation. Rice University.
[35]
P. Vogel, T. Greiser, and D. C. Mattfeld. 2011. Understanding bike-sharing systems using data mining: Exploring activity patterns. Procedia—Social and Behavioral Sciences 20, 0 (2011), 514--523.
[36]
P. Vogel and D. C. Mattfeld. 2011. Strategic and operational planning of bike-sharing systems by data mining—A case study. In ICCL. Springer, Berlin, 127--141.
[37]
Z. Wangsheng, L. Shijian, and P. Gang. 2012. Mining the semantics of origin-destination flows using taxi traces. In ACM Conference on Ubiquitous Computing (UbiComp’12). ACM, 943--949.
[38]
J. Yuan, Y. Zheng, W. Xie, X. Xie, and Y. Huang. 2010. T-Drive: Driving directions based on taxi trajectories. In 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems (GIS’10). ACM, 99--108.
[39]
J. Yuan, Y. Zheng, and X. Xie. 2012. Discovering regions of different functions in a city using human mobility and POIs. In 18th ACM SIGKDD International Conference on Knowledge Discovery and Data mining (KDD’12). ACM, 186--194.
[40]
Y. Zheng, Y Liu, J. Yuan, and X. Xie. 2011. Urban omputing with taxicabs. In 13th ACM Conference on Ubiquitous Computing (UbiComp’11). ACM, 89--98.

Cited By

View all
  • (2024)A Systematic Review of the Coopetition Relationship between Bike‐Sharing and Public TransitJournal of Advanced Transportation10.1155/2024/66818952024:1Online publication date: 16-Jan-2024
  • (2024)AdaBoost.RDT: AdaBoost Integrated With Residual-Based Decision Tree for Demand Prediction of Bike Sharing Systems Under Extreme DemandsIEEE Access10.1109/ACCESS.2024.347401712(144316-144336)Online publication date: 2024
  • (2024)Bikesharing: The first- and last-mile service of public transportation? Evidence from an origin–destination perspectiveTransportation Research Part A: Policy and Practice10.1016/j.tra.2024.104162187(104162)Online publication date: Sep-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Intelligent Systems and Technology
ACM Transactions on Intelligent Systems and Technology  Volume 5, Issue 3
Special Section on Urban Computing
September 2014
361 pages
ISSN:2157-6904
EISSN:2157-6912
DOI:10.1145/2648782
  • Editor:
  • Qiang Yang
Issue’s Table of Contents
© 2014 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 July 2014
Accepted: 01 November 2013
Revised: 01 March 2013
Received: 01 October 2012
Published in TIST Volume 5, Issue 3

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Bike sharing systems
  2. clustering
  3. count data
  4. model-based clustering

Qualifiers

  • Research-article
  • Research
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)77
  • Downloads (Last 6 weeks)9
Reflects downloads up to 12 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)A Systematic Review of the Coopetition Relationship between Bike‐Sharing and Public TransitJournal of Advanced Transportation10.1155/2024/66818952024:1Online publication date: 16-Jan-2024
  • (2024)AdaBoost.RDT: AdaBoost Integrated With Residual-Based Decision Tree for Demand Prediction of Bike Sharing Systems Under Extreme DemandsIEEE Access10.1109/ACCESS.2024.347401712(144316-144336)Online publication date: 2024
  • (2024)Bikesharing: The first- and last-mile service of public transportation? Evidence from an origin–destination perspectiveTransportation Research Part A: Policy and Practice10.1016/j.tra.2024.104162187(104162)Online publication date: Sep-2024
  • (2024)Assessing the Temporal and Spatial Resilience of Bike-Sharing Demand: A Spatiotemporal Dynamic AnalysisKSCE Journal of Civil Engineering10.1016/j.kscej.2024.100045(100045)Online publication date: Sep-2024
  • (2024)Multivariate count time series segmentation with “sums and shares” and Poisson lognormal mixture models: a comparative study using pedestrian flows within a multimodal transport hubAdvances in Data Analysis and Classification10.1007/s11634-023-00543-918:2(455-491)Online publication date: 1-Jun-2024
  • (2024)Practicable solution approaches for differentiated pricing of vehicle sharing systemsCentral European Journal of Operations Research10.1007/s10100-024-00915-2Online publication date: 24-May-2024
  • (2023)eShare+: A Data-Driven Balancing Mechanism for Bike Sharing Systems Considering Both Quality of Service and MaintenanceIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.325372535:10(10497-10513)Online publication date: 1-Oct-2023
  • (2023)Fleet Rebalancing for Expanding Shared e-Mobility Systems: A Multi-Agent Deep Reinforcement Learning ApproachIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2022.323342224:4(3868-3881)Online publication date: Apr-2023
  • (2023)A Systematic Literature Review on Machine Learning in Shared MobilityIEEE Open Journal of Intelligent Transportation Systems10.1109/OJITS.2023.33343934(870-899)Online publication date: 2023
  • (2023)A Regression Mixture Model to understand the effect of the Covid-19 pandemic on Public Transport Ridership2023 IEEE International Conference on Data Mining Workshops (ICDMW)10.1109/ICDMW60847.2023.00163(1259-1268)Online publication date: 4-Dec-2023
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media