Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3274895.3274982acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article
Public Access

Creating full individual-level location timelines from sparse social media data

Published: 06 November 2018 Publication History
  • Get Citation Alerts
  • Abstract

    In many domain applications, a continuous timeline of human locations is critical; for example for understanding possible locations wherea disease may spread, or the flow of traffic. While data sources such as GPS trackers or Call Data Records are temporally-rich, they are expensive, often not publicly available or garnered only in select locations, restricting their wide use. Conversely, geo-located social media data are publicly and freely available, but present challenges especially for full timeline inference due to their sparse nature. We propose a stochastic framework, Intermediate Location Computing (ILC) which uses prior knowledge about human mobility patterns to predict every missing location from an individual's social media timeline. We compare ILC with a state-of-the-art RNN baseline as well as methods that are optimized for next-location prediction only. For three major cities, ILC predicts the top 1 location for all missing locations in a timeline, at 1 and 2-hour resolution, with up to 77.2% accuracy (up to 6% better accuracy than all compared methods). Specifically, ILC also outperforms the RNN in settings of low data; both cases of very small number of users (under 50), as well as settings with more users, but with sparser timelines. In general, the RNN model needs a higher number of users to achieve the same performance as ILC. Overall, this work illustrates the tradeoff between prior knowledge of heuristics and more data, for an important societal problem of filling in entire timelines using freely available, but sparse social media data.

    References

    [1]
    Shunsuke Aoki, Kaoru Sezaki, Nicholas Jing Yuan, and Xing Xie. 2017. An Early Event Detection Technique with Bus GPS Data. In Proceedings of the 25th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. ACM, 49.
    [2]
    Akinori Asahara, Kishiko Maruyama, and Ryosuke Shibasaki. 2012. A mixed autoregressive hidden-markov-chain model applied to people's movements. In Proceedings of the 20th international conference on advances in geographic information systems. ACM, 414--417.
    [3]
    Jie Bao, Yu Zheng, and Mohamed F Mokbel. 2012. Location-based and preference-aware recommendation using sparse geo-social networking data. In Proceedings of the 20th international conference on advances in geographic information systems. ACM, 199--208.
    [4]
    Mitra Baratchi, Nirvana Meratnia, Paul JM Havinga, Andrew K Skidmore, and Bert AKG Toxopeus. 2014. A hierarchical hidden semi-Markov model for modeling mobility data. In Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing. ACM, 401--412.
    [5]
    Francesco Calabrese, Giusy Di Lorenzo, and Carlo Ratti. 2010. Human mobility prediction based on individual and collective geographical preferences. In Intelligent Transportation Systems (ITSC), 2010 13th International IEEE Conference on. IEEE, 312--317.
    [6]
    Luca Canzian and Mirco Musolesi. 2015. Trajectories of depression: unobtrusive monitoring of depressive states by means of smartphone mobility traces analysis. In Proceedings of the 2015 ACM international joint conference on pervasive and ubiquitous computing. ACM, 1293--1304.
    [7]
    Eunjoon Cho, Seth A Myers, and Jure Leskovec. 2011. Friendship and mobility: user movement in location-based social networks. In Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 1082--1090.
    [8]
    Yohan Chon, Hyojeong Shin, Elmurod Talipov, and Hojung Cha. 2012. Evaluating mobility models for temporal prediction with high-granularity mobility data. In Pervasive computing and communications (PerCom), 2012 IEEE international conference on. IEEE, 206--212.
    [9]
    Jacob Cohen. 1968. Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. Psychological bulletin 70, 4 (1968), 213.
    [10]
    Andrea Cuttone, Sune Lehmann, and Marta C González. 2018. Understanding Predictability and Exploration in Human Mobility. EPJ Data Science (2018).
    [11]
    Trinh Minh Tri Do and Daniel Gatica-Perez. 2012. Contextual conditional models for smartphone-based human mobility prediction. In Proceedings of the 2012 ACM conference on ubiquitous computing. ACM, 163--172.
    [12]
    Jie Feng, Yong Li, Chao Zhang, Funing Sun, Fanchao Meng, Ang Guo, and Depeng Jin. 2018. DeepMove: Predicting Human Mobility with Attentional Recurrent Networks. In Proceedings of the 2018 World Wide Web Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 1459--1468.
    [13]
    Sébastien Gambs, Marc-Olivier Killijian, and Miguel Núñez del Prado Cortez. 2010. Show me how you move and I will tell you who you are. In Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Security and Privacy in GIS and LBS. ACM, 34--41.
    [14]
    Sébastien Gambs, Marc-Olivier Killijian, and Miguel Núñez del Prado Cortez. 2012. Next place prediction using mobility markov chains. In Proceedings of the First Workshop on Measurement, Privacy, and Mobility. ACM, 3.
    [15]
    M. C. Gonzalez, C. A. Hidalgo, and A. L. Barabasi. 2008. Understanding individual human mobility patterns. Nature 453, 7196 (2008), 779--82.
    [16]
    Samiul Hasan, Christian M Schneider, Satish V Ukkusuri, and Marta C GonzÃąlez. 2013. Spatiotemporal patterns of urban human mobility. Journal of Statistical Physics 151, 1-2 (2013), 304--318.
    [17]
    Nabil Hossain, Tianran Hu, Roghayeh Feizi, Ann Marie White, Jiebo Luo, and Henry A Kautz. 2016. Precise Localization of Homes and Activities: Detecting Drinking-While-Tweeting Patterns in Communities. In ICWSM. 587--590.
    [18]
    Shenggong Ji, Yu Zheng, and Tianrui Li. 2016. Urban sensing based on human mobility. In Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing. ACM, 1040--1051.
    [19]
    David Jurgens, Tyler Finethy, James McCorriston, Yi Tian Xu, and Derek Ruths. 2015. Geolocation Prediction in Twitter Using Social Networks: A Critical Analysis and Review of Current Practice. ICWSM 15 (2015), 188--197.
    [20]
    Daniel Kahneman, Alan B Krueger, David A Schkade, Norbert Schwarz, and Arthur A Stone. 2004. A survey method for characterizing daily life experience: The day reconstruction method. Science 306, 5702 (2004), 1776--1780.
    [21]
    John Krumm and Eric Horvitz. 2006. Predestination: Inferring destinations from partial trajectories. UbiComp 2006: Ubiquitous Computing (2006), 243--260.
    [22]
    Yang Li, Dimitrios Gunopulos, Cewu Lu, and Leonidas Guibas. 2017. Urban Travel Time Prediction using a Small Number of GPS Floating Cars. In Proceedings of the 25th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. ACM, 3.
    [23]
    L. Liao, D. J. Patterson, D. Fox, and H. Kautz. 2007. Learning and inferring transportation routines. Artificial Intelligence 171, 5-6 (2007), 311--331.
    [24]
    Todd Litman. 2003. Measuring transportation: traffic, mobility and accessibility. Institute of Transportation Engineers. ITE Journal 73, 10 (2003), 28.
    [25]
    Jiajun Liu, Kun Zhao, Saeed Khan, Mark Cameron, and Raja Jurdak. 2015. Multi-scale population and mobility estimation with geo-tagged tweets. In Data Engineering Workshops (ICDEW), 2015 31st IEEE International Conference on. IEEE, 83--86.
    [26]
    Liang Liu, Anyang Hou, Assaf Biderman, Carlo Ratti, and Jun Chen. 2009. Understanding individual and collective mobility patterns from smart card records: A case study in Shenzhen. In Intelligent Transportation Systems, 2009. ITSC'09. 12th International IEEE Conference On. IEEE, 1--6.
    [27]
    Qiang Liu, Shu Wu, Liang Wang, and Tieniu Tan. 2016. Predicting the Next Location: A Recurrent Model with Spatial and Temporal Contexts. In AAAI. 194--200.
    [28]
    James McInerney, Jiangchuan Zheng, Alex Rogers, and Nicholas R Jennings. 2013. Modelling heterogeneous location habits in human populations for location prediction under data sparsity. In Proceedings of the 2013 ACM international joint conference on Pervasive and ubiquitous computing. ACM, 469--478.
    [29]
    Anne Vernez Moudon, Chanam Lee, Allen D Cheadle, Cheza Garvin, Donna Johnson, Thomas L Schmid, Robert D Weathers, and Lin Lin. 2006. Operational definitions of walkable neighborhood: theoretical and empirical insights. Journal of Physical Activity & Health 3 (2006), S99.
    [30]
    Anastasios Noulas, Salvatore Scellato, Neal Lathia, and Cecilia Mascolo. 2012. Mining user mobility features for next place prediction in location-based services. In Data mining (ICDM), 2012 IEEE 12th international conference on. IEEE, 1038--1043.
    [31]
    Andrew Rundle, Kathryn M Neckerman, Lance Freeman, Gina S Lovasi, Marnie Purciel, James Quinn, Catherine Richards, Neelanjan Sircar, and Christopher Weiss. 2009. Neighborhood food environment and walkability predict obesity in New York City. Environmental health perspectives 117, 3 (2009), 442.
    [32]
    Adam Sadilek, Henry Kautz, and Jeffrey P Bigham. 2012. Finding your friends and following them to where you are. In Proceedings of the fifth ACM international conference on Web search and data mining. ACM, 723--732.
    [33]
    Adam Sadilek and John Krumm. 2012. Far Out: Predicting Long-Term Human Mobility. In AAAI.
    [34]
    S. Scellato, M. Musolesi, C. Mascolo, V. Latora, and A. T. Campbell. 2011. NextPlace: A Spatio-temporal Prediction Framework for Pervasive Systems. Pervasive Computing 6696 (2011), 152--169. <GotoISI>://WOS:000305891700010
    [35]
    C. M. Song, Z. H. Qu, N. Blumm, and A. L. Barabasi. 2010. Limits of Predictability in Human Mobility. Science 327, 5968 (2010), 1018--1021.
    [36]
    L. B. Song, D. Kotz, R. Jain, and X. N. He. 2006. Evaluating next-cell predictors with extensive Wi-Fi mobility data. Ieee Transactions on Mobile Computing 5, 12 (2006), 1633--1649. <GotoISI>://WOS:000241195600001
    [37]
    Roberto CSNP Souza, Renato M Assunção, Derick M de Oliveira, Denise EF de Brito, and Wagner Meira Jr. 2016. Infection Hot Spot Mining from Social Media Trajectories. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 739--755.
    [38]
    Dan Tasse, Zichen Liu, Alex Sciuto, and Jason I Hong. 2017. State of the Geotags: Motivations and Recent Changes. In ICWSM. 250--259.
    [39]
    A. Wesolowski, T. Qureshi, M. F. Boni, P. R. Sundsoy, M. A. Johansson, S. B. Rasheed, K. Engo-Monsen, and C. O. Buckee. 2015. Impact of human mobility on the emergence of dengue epidemics in Pakistan. Proc Natl Acad Sci U S A 112, 38 (2015), 11887--92.
    [40]
    Cheng Yang, Maosong Sun, Wayne Xin Zhao, Zhiyuan Liu, and Edward Y Chang. 2017. A Neural Network Approach to Jointly Modeling Social Networks and Mobile Trajectories. ACM Transactions on Information Systems (TOIS) 35, 4 (2017), 36.
    [41]
    A. J. Yu and J. D. Cohen. 2008. Sequential effects: Superstition or rational behavior? Adv Neural Inf Process Syst 21 (2008), 1873--1880. https://www.ncbi.nlm.nih.gov/ /26412953
    [42]
    Quan Yuan, Gao Cong, Zongyang Ma, Aixin Sun, and Nadia Magnenat Thalmann. 2013. Time-aware point-of-interest recommendation. In Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval. ACM, 363--372.
    [43]
    Jia-Dong Zhang, Chi-Yin Chow, and Yanhua Li. 2014. Lore: Exploiting sequential influence for location recommendations. In Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. ACM, 103--112.

    Cited By

    View all
    • (2021)Neighborhood level chronic respiratory disease prevalence estimation using search query dataPLOS ONE10.1371/journal.pone.025238316:6(e0252383)Online publication date: 9-Jun-2021
    • (2020)SCPPACM Transactions on Spatial Algorithms and Systems10.1145/34234057:1(1-30)Online publication date: 29-Oct-2020

    Index Terms

    1. Creating full individual-level location timelines from sparse social media data

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        SIGSPATIAL '18: Proceedings of the 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
        November 2018
        655 pages
        ISBN:9781450358897
        DOI:10.1145/3274895
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 06 November 2018

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. social media
        2. sparse data
        3. spatial information and society

        Qualifiers

        • Research-article

        Funding Sources

        Conference

        SIGSPATIAL '18
        Sponsor:

        Acceptance Rates

        SIGSPATIAL '18 Paper Acceptance Rate 30 of 150 submissions, 20%;
        Overall Acceptance Rate 220 of 1,116 submissions, 20%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)26
        • Downloads (Last 6 weeks)2

        Other Metrics

        Citations

        Cited By

        View all
        • (2021)Neighborhood level chronic respiratory disease prevalence estimation using search query dataPLOS ONE10.1371/journal.pone.025238316:6(e0252383)Online publication date: 9-Jun-2021
        • (2020)SCPPACM Transactions on Spatial Algorithms and Systems10.1145/34234057:1(1-30)Online publication date: 29-Oct-2020

        View Options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Get Access

        Login options

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media