Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Discovering routines from large-scale human locations using probabilistic topic models

Published: 24 January 2011 Publication History

Abstract

In this work, we discover the daily location-driven routines that are contained in a massive real-life human dataset collected by mobile phones. Our goal is the discovery and analysis of human routines that characterize both individual and group behaviors in terms of location patterns. We develop an unsupervised methodology based on two differing probabilistic topic models and apply them to the daily life of 97 mobile phone users over a 16-month period to achieve these goals. Topic models are probabilistic generative models for documents that identify the latent structure that underlies a set of words. Routines dominating the entire group's activities, identified with a methodology based on the Latent Dirichlet Allocation topic model, include “going to work late”, “going home early”, “working nonstop” and “having no reception (phone off)” at different times over varying time-intervals. We also detect routines which are characteristic of users, with a methodology based on the Author-Topic model. With the routines discovered, and the two methods of characterizing days and users, we can then perform various tasks. We use the routines discovered to determine behavioral patterns of users and groups of users. For example, we can find individuals that display specific daily routines, such as “going to work early” or “turning off the mobile (or having no reception) in the evenings”. We are also able to characterize daily patterns by determining the topic structure of days in addition to determining whether certain routines occur dominantly on weekends or weekdays. Furthermore, the routines discovered can be used to rank users or find subgroups of users who display certain routines. We can also characterize users based on their entropy. We compare our method to one based on clustering using K-means. Finally, we analyze an individual's routines over time to determine regions with high variations, which may correspond to specific events.

References

[1]
Akoush, S. and Sameh, A. 2007. Mobile user movement prediction using bayesian learning for neural networks. In Proceedings of the ACM International Wireless Communications and Mobile Computing Conference (IWCMC). 191--196.
[2]
Blei, D. M., Griffiths, T. L., Jordan, M. I., and Tenenbaum, J. B. 2004. Hierarchical topic models and the nested Chinese restaurant process. In Proceedings of the Annual Conference on Neural Information Processing Systems (NIPS).
[3]
Blei, D. M., Ng, A. Y., and Jordan, M. I. 2003. Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993--1022.
[4]
Choudhury, T. and Pentland, A. 2003. Sensing and modeling human networks using the sociometer. In Proceedings of the IEEE International Symposium on Wearable Computers (ISWC). 216.
[5]
Choudhury, T., Philipose, M., Wyatt, D., and Lester, J. 2006. Towards activity databases: Using sensors and statistical models to summarize people's lives. IEEE Data Engin. Bull. 49--58.
[6]
Duda, R., Hart, P., and Stork, D. 2000. Pattern Classification 2nd Ed. Wiley-Interscience.
[7]
Eagle, N. and Pentland, A. 2009. Eigenbehaviors: Identifying structure in routine. Behav. Ecol. Sociobiol. 63, 7, 1057--1066.
[8]
Eagle, N., Pentland, A., and Lazer, D. 2009. Inferring social network structure using mobile phone data. Proc. Nat. Acad. Sci. 106, 36, 15274--15278.
[9]
Farrahi, K. and Gatica-Perez, D. 2008a. Discovering human routines from cell phone data with topic models. In Proceedings of the IEEE International Symposium on Wearable Computers (ISWC).
[10]
Farrahi, K., and Gatica-Perez, D. 2008b. What did you do today? Discovering daily routines from large-scale mobile data. In Proceedings of the ACM International Conference on Multimedia (ACM MM).
[11]
Gonzalez, M. C., Hidalgo, C. A., and Barabasi, A.-L. 2008. Understanding individual human mobility patterns. Nature 453, 7196, 779--782.
[12]
Griffiths, T. L. and Steyvers, M. 2004. Finding scientific topics. Proc. Nat. Acad. Sci. USA 101 Suppl 1, 5228--5235.
[13]
Heinrich, G. 2008. Parameter estimation for text analysis,. Tech. rep., University of Leipzig.
[14]
Hightower, J., Consolvo, S., Lamarca, A., Smith, I., and Hughes, J. 2005. Learning and recognizing the places we go. In Proceedings of the Workshop on Ubiquitous Computing (UbiComp). 159--176.
[15]
Huynh, T., Fritz, M., and Schiele, B. 2008. Discovery of activity patterns using topic models. In Proceedings of the Workshop on Ubiquitous Computing (UbiComp). 10--19.
[16]
Intille, S. S., Larson, K., Tapia, E. M., Beaudin, J. S., Kaushik, P., Nawyn, J., and Rockinson, R. 2006. Using a live-in laboratory for ubiquitous computing research. In Proceedings of the Workshop on Pervasive Computing. 349--365.
[17]
Kiukkonen, N., Blom, J., Dousse, O., Gatica-Perez, D., and Laurila, J. 2010. Towards rich mobile phone datasets: Lausanne data collection campaign. In Proceedings of the ACM International Conference on Pervasive Services (ICPS).
[18]
Krumm, J. and Horvitz, E. 2006. Predestination: Inferring destinations from partial trajectories. In Proceedings of the Workshop on Ubiquitous Computing (UbiComp).
[19]
Larson, K. and Intille, S. Placelab website. http://architecture.mit.edu/house_n/data/placelab/placelab.htm.
[20]
Letchner, J., Fox, D., and LaMarca, A. 2005. Large-scale localization from wireless signal strength. In Proceedings of the Natural Conference on Artificial Intelligence (AAAI). 15--20.
[21]
Liao, L., Fox, D., and Kautz, H. 2006. Location-based activity recognition. In Proceedings of the Annual Conference on Neural Information Processing Systems (NIPS). 787--794.
[22]
Ling, R. and Donner, J. 2009. Mobile Communication: Digital Media and Society Series. Polity Press.
[23]
Logan, B., Healey, J., Philipose, M., Tapia, E. M., and Intille, S. S. 2007. A long-term evaluation of sensing modalities for activity recognition. In Proceedings of the Workshop on Ubiquitous Computing (UbiComp). Lecture Notes in Computer Science, vol. 4717, Springer, 483--500.
[24]
MIT Technology Review. 10 emerging technologies 2008. http://www.technologyreview.com/specialreports/specialreport.aspx?id=25.
[25]
Monay, F. and Gatica-Perez, D. 2007. Modeling semantic aspects for cross-media image retrieval. IEEE Trans. Patt. Anal. Mach. Intell.
[26]
Niebles, J., Wang, H., and Fei-Fei, L. 2006. Unsupervised learning of human action categories using spatial-temporal words. Int. J. Comput. Vision.
[27]
Otsason, V., Varshavsky, A., Lamarca, A., and de Lara, E. 2005. Accurate gsm indoor localization. In Proceedings of the Workshop on Ubiquitous Computing (UbiComp). Springer, Berlin, 141--158.
[28]
Patterson, D., Liao, L., Fox, D., and Kautz, H. 2003. Inferring high-level behavior from low-level sensors. In Proceedings of the Workshop on Ubiquitous Computing (UbiComp). 73--89.
[29]
Pritchard, J., Stephens, M., and Donnelly, P. 2000. Inference of population structure using multilocus genotype data. Genetics. 155, 945--959.
[30]
Quelhas, P., Monay, F., Odobez, J.-M., Gatica-Perez, D., and Tuytelaars, T. 2007. A thousand words in a scene. IEEE Trans. Patt. Anal. Mach. Intell., 1575--89.
[31]
Reddy, S., Burke, J., Estrin, D., Hansen, M., and Strivastava, M. 2008. Using mobile phones to determine transportation mode. In Proceedings of the International Symposium on Wearable Computers (ISWC).
[32]
Rosen-Zvi, M., Griffiths, T., Steyvers, M., and Smyth, P. 2004. The author-topic model for authors and documents. In Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI). 487--494.
[33]
Sohn, T., Varshavsky, A., Lamarca, A., Chen, M., Choudhury, T., Smith, I., Consolvo, S., Hightower, J., Griswold, W., and de Lara, E. 2006. Mobility detection using everyday gsm traces. In Proceedings of the Workshop on Ubiquitous Computing (UbiComp). 212--224.
[34]
Tapia, E., Intille, S., Haskell, W., Larson, K., Wright, J., King, A., and Friedman, R. 2007. Real-time recognition of physical activities and their intensities using wireless accelerometers and a heart rate monitor. In Proceedings of the International Symposium on Wearable Computers (ISWC). 37--40.
[35]
Tapia, E. M., Intille, S. S., and Larson, K. 2004. Activity recognition in the home setting using simple and ubiquitous sensors. In Proceedings of PERVASIVE. 158--175.
[36]
Wang, Y., Bai, H., Stanton, M., Chen, W.-Y., and Chang, E. Y. 2009. PLDA: Parallel latent dirichlet allocation for large-scale applications. In Proceedings of AAIM. 301--314.
[37]
Wren, C., Ivanov, Y., Kaur, I., Leigh, D., Westhues, J., Wren, C. R., Ivanov, Y. A., Kaur, I., Leigh, D., and Westhues, J. 2007. Socialmotion: Measuring the hidden social life of a building. In Proceedings of the International Symposium on Location- and Context-Awareness (LoCA'07). 85--102.

Cited By

View all
  • (2025)Evaluating spatial effect of transportation planning factors on taxi CO2 emissionsScience of The Total Environment10.1016/j.scitotenv.2024.178142959(178142)Online publication date: Jan-2025
  • (2024)Daily Routine Recognition from Longitudinal, Real-Life Wearable Sensor Data for the Elderly2024 International Conference on Activity and Behavior Computing (ABC)10.1109/ABC61795.2024.10651644(1-9)Online publication date: 29-May-2024
  • (2024)Exploring changes in residents' daily activity patterns through sequence visualization analysisIET Intelligent Transport Systems10.1049/itr2.1251118:10(1879-1894)Online publication date: 17-Oct-2024
  • Show More Cited By

Index Terms

  1. Discovering routines from large-scale human locations using probabilistic topic models

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Intelligent Systems and Technology
      ACM Transactions on Intelligent Systems and Technology  Volume 2, Issue 1
      January 2011
      187 pages
      ISSN:2157-6904
      EISSN:2157-6912
      DOI:10.1145/1889681
      Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 24 January 2011
      Accepted: 01 May 2010
      Received: 01 March 2010
      Published in TIST Volume 2, Issue 1

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Human activity modeling
      2. reality mining
      3. topic models

      Qualifiers

      • Research-article
      • Research
      • Refereed

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)37
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 11 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2025)Evaluating spatial effect of transportation planning factors on taxi CO2 emissionsScience of The Total Environment10.1016/j.scitotenv.2024.178142959(178142)Online publication date: Jan-2025
      • (2024)Daily Routine Recognition from Longitudinal, Real-Life Wearable Sensor Data for the Elderly2024 International Conference on Activity and Behavior Computing (ABC)10.1109/ABC61795.2024.10651644(1-9)Online publication date: 29-May-2024
      • (2024)Exploring changes in residents' daily activity patterns through sequence visualization analysisIET Intelligent Transport Systems10.1049/itr2.1251118:10(1879-1894)Online publication date: 17-Oct-2024
      • (2024)Activity type detection of mobile phone data based on self-training: Application of the teacher–student cycling modelTransportation Research Part C: Emerging Technologies10.1016/j.trc.2024.104550161(104550)Online publication date: Apr-2024
      • (2024)Multi-day activity pattern recognition based on semantic embeddings of activity chainsTravel Behaviour and Society10.1016/j.tbs.2023.10068234(100682)Online publication date: Jan-2024
      • (2024)Exploring the impact of trip patterns on spatially aggregated crashes using floating vehicle trajectory data and graph Convolutional NetworksAccident Analysis & Prevention10.1016/j.aap.2023.107340194(107340)Online publication date: Jan-2024
      • (2023)Neural Topic Modeling via Discrete Variational InferenceACM Transactions on Intelligent Systems and Technology10.1145/357050914:2(1-33)Online publication date: 16-Feb-2023
      • (2023)Identifying latent activity behaviors and lifestyles using mobility data to describe urban dynamicsEPJ Data Science10.1140/epjds/s13688-023-00390-w12:1Online publication date: 18-May-2023
      • (2022)Characteristic Behavior of Human Multi-Joint Spatial Trajectory in Slalom SkiingJournal of Advanced Computational Intelligence and Intelligent Informatics10.20965/jaciii.2022.p080126:5(801-807)Online publication date: 20-Sep-2022
      • (2022)Application of LDA and word2vec to detect English off-topic compositionPLOS ONE10.1371/journal.pone.026455217:2(e0264552)Online publication date: 25-Feb-2022
      • Show More Cited By

      View Options

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media