Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Using Online Geotagged and Crowdsourced Data to Understand Human Offline Behavior in the City: An Economic Perspective

Published: 11 December 2017 Publication History
  • Get Citation Alerts
  • Abstract

    The pervasiveness of mobile technologies today has facilitated the creation of massive online crowdsourced and geotagged data from individual users at different locations in a city. Such ubiquitous user-generated data allow us to study the social and behavioral trajectories of individuals across both digital and physical environments. This information, combined with traditional economic and behavioral indicators in the city (e.g., store purchases, restaurant visits, parking), can help us better understand human behavior and interactions with cities. In this study, we take an economic perspective and focus on understanding human economic behavior in the city by examining the performance of local businesses based on the values learned from crowsourced and geotagged data. Specifically, we extract multiple traffic and human mobility features from publicly available data source geomapping and geo-social-tagging techniques and examine the effects of both static and dynamic features on booking volume of local restaurants. Our study is instantiated on a unique dataset of restaurant bookings from OpenTable for 3,187 restaurants in New York City from November 2013 to March 2014. Our results suggest that foot traffic can increase local popularity and business performance, while mobility and traffic from automobiles may hurt local businesses, especially the well-established chains and high-end restaurants. We also find that, on average, one or more street closure (caused by events or construction projects) nearby leads to a 4.7% decrease in the probability of a restaurant being fully booked during the dinner peak. Our study demonstrates the potential to best make use of the large volumes and diverse sources of crowdsourced and geotagged user-generated data to create matrices to predict local economic demand in a manner that is fast, cheap, accurate, and meaningful.

    References

    [1]
    Michael Anderson and Jeremy Magruder. 2012. Learning from the crowd: Regression discontinuity estimates of the effects of an online review database. The Economic Journal 122, 563 (2012), 957--989.
    [2]
    Joshua Angrist and Alan B. Krueger. 2001. Instrumental Variables and the Search for Identification: From Supply and Demand to Natural Experiments. Technical Report. National Bureau of Economic Research.
    [3]
    Nikolay Archak, Anindya Ghose, and Panagiotis G. Ipeirotis. 2011. Deriving the pricing power of product features by mining consumer reviews. Management Science 57, 8 (2011), 1485--1509.
    [4]
    Peter C. Austin. 2011. An introduction to propensity score methods for reducing the effects of confounding in observational studies. Multivariate Behavioral Research 46, 3 (2011), 399--424.
    [5]
    Glenn C. Blomquist, Mark C. Berger, and John P. Hoehn. 1988. New estimates of quality of life in urban areas. The American Economic Review (1988), 89--107.
    [6]
    Erik Brynjolfsson, Yu Hu, and Michael D. Smith. 2003. Consumer surplus in the digital economy: Estimating the value of increased product variety at online booksellers. Management Science 49, 11 (2003), 1580--1596.
    [7]
    BusinessWeek. 2013. It costs $333 million to shut down Boston for a day. Retrieved from www.businessweek.com/articles/2013-04-19/it-costs-333-million-to-shut-down-boston-for-a-day.
    [8]
    Peter Calthorpe. 1993. The Next American Metropolis: Ecology, Community, and the American Dream. Princeton Architectural Press.
    [9]
    Zhiyuan Cheng, James Caverlee, Kyumin Lee, and Daniel Z. Sui. 2011. Exploring millions of footprints in location sharing services. ICWSM 2011 (2011), 81--88.
    [10]
    Paul C. Cheshire and Stefano Magrini. 2006. Population growth in European cities: Weather matters--but only nationally. Regional Studies 40, 1 (2006), 23--37.
    [11]
    Judith A. Chevalier and Dina Mayzlin. 2006. The effect of word of mouth on sales: Online book reviews. Journal of Marketing Research 43, 3 (2006), 345--354.
    [12]
    Eunjoon Cho, Seth A. Myers, and Jure Leskovec. 2011. Friendship and mobility: User movement in location-based social networks. In Proceedings of the 17th ACM SIGKDD. ACM, 1082--1090.
    [13]
    Thomas M. Cover and Joy A. Thomas. 2012. Elements of Information Theory. John Wiley 8 Sons.
    [14]
    Brett Danaher and Michael D. Smith. 2014. Gone in 60 seconds: The impact of the Megaupload shutdown on movie sales. International Journal of Industrial Organization 33 (2014), 1--8.
    [15]
    Reid Ewing and Susan Handy. 2009. Measuring the unmeasurable: Urban design qualities related to walkability. Journal of Urban Design 14, 1 (2009), 65--84.
    [16]
    Richard Florida. 2002. The economic geography of talent. Annals of the Association of American Geographers 92, 4 (2002), 743--755.
    [17]
    Chris Forman, Anindya Ghose, and Avi Goldfarb. 2009. Competition between local and electronic markets: How the benefit of buying online depends on where you live. Management Science 55, 1 (2009), 47--57.
    [18]
    Chris Forman, Avi Goldfarb, and Shane Greenstein. 2005. How did location affect adoption of the commercial internet? Global village vs. urban leadership. Journal of Urban Economics 58, 3 (2005), 389--420.
    [19]
    Yanjie Fu, Yong Ge, Yu Zheng, Zijun Yao, Yanchi Liu, Hui Xiong, and Jing Yuan. 2014. Sparse real estate ranking with online user reviews and offline moving behaviors. In Proceedings of the 2014 IEEE International Conference on Data Mining (ICDM), IEEE, 120--129.
    [20]
    Yanjie Fu, Hui Xiong, Yong Ge, Zijun Yao, Yu Zheng, and Zhi-Hua Zhou. 2014. Exploiting geographic dependencies for real estate appraisal: A mutual perspective of ranking and clustering. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1047--1056.
    [21]
    Sarah Jean Fusco, Katina Michael, and M. G. Michael. 2010. Using a social informatics framework to study the effects of location-based social networking on relationships between people: A review of literature. In Proceedings of the 2010 IEEE International Symposium on Technology and Society (ISTAS). IEEE, 157--171.
    [22]
    Anindya Ghose, Panagiotis G. Ipeirotis, and Beibei Li. 2012. Designing ranking systems for hotels on travel search engines by mining user-generated and crowdsourced content. Marketing Science 31, 3 (2012), 493--520.
    [23]
    Jerry A. Hausman. 1978. Specification tests in econometrics. Econometrica: Journal of the Econometric Society (1978), 1251--1271.
    [24]
    Jerry A. Hausman. 1996. Valuation of new goods under perfect and imperfect competition. In The Economics of New Goods. University of Chicago Press, 207--248.
    [25]
    Ting Hua, Feng Chen, Liang Zhao, Chang-Tien Lu, and Naren Ramakrishnan. 2013. STED: Semi-supervised targeted-interest event detectionin in twitter. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1466--1469.
    [26]
    Dmytro Karamshuk, Anastasios Noulas, Salvatore Scellato, Vincenzo Nicosia, and Cecilia Mascolo. 2013. Geo-spotting: Mining online location-based services for optimal retail store placement. In Proceedings of the 19th ACM SIGKDD. ACM, 793--801.
    [27]
    Kevin J. Krizek. 2003. Operationalizing neighborhood accessibility for land use-travel behavior research and regional modeling. Journal of Planning Education and Research 22, 3 (2003), 270--287.
    [28]
    Kun Kuang, Meng Jiang, Peng Cui, and Shiqiang Yang. 2016. Steering social media promotions with effective strategies. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM.
    [29]
    Dionysia Lambiri, Bianca Biagi, and Vicente Royuela. 2007. Quality of life in the economic and urban economic literature. Social Indicators Research 84, 1 (2007), 1--25.
    [30]
    Nishtha Langer, Chris Forman, Sunder Kekre, and Baohong Sun. 2012. Ushering buyers into electronic channels: An empirical analysis. Information Systems Research 23, 4 (2012), 1212--1231.
    [31]
    Zhenhui Li, Fei Wu, and Margaret C. Crofoot. 2013. Mining following relationships in movement data. In Proceedings of the 2013 IEEE 13th International Conference on Data Mining. IEEE, 458--467.
    [32]
    Janne Lindqvist, Justin Cranshaw, Jason Wiese, Jason Hong, and John Zimmerman. 2011. I’m the mayor of my house: Examining why people use foursquare-a social-driven location sharing application. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 2409--2418.
    [33]
    Todd Litman. 2003. Economic value of walkability. Transportation Research Record: Journal of the Transportation Research Board 1828 (2003), 3--11.
    [34]
    Xianghua Lu, Sulin Ba, Lihua Huang, and Yue Feng. 2013. Promotional marketing or word-of-mouth? Evidence from online restaurant reviews. Information Systems Research 24, 3 (2013), 596--612.
    [35]
    Eugene J. McCann. 2004. ‘Best places’: Interurban competition, quality of life and popular media discourse. Urban Studies 41, 10 (2004), 1909--1929.
    [36]
    Aviv Nevo. 2001. Measuring market power in the ready-to-eat cereal industry. Econometrica 69, 2 (2001), 307--342.
    [37]
    Anastasios Noulas, Salvatore Scellato, Neal Lathia, and Cecilia Mascolo. 2012. Mining user mobility features for next place prediction in location-based services. In Proceedings of the 2012 IEEE 12th International Conference on Data Mining (ICDM). IEEE, 1038--1043.
    [38]
    Deborah N. Peikes, Lorenzo Moreno, and Sean Michael Orzol. 2008. Propensity score matching. The American Statistician 62, 3 (2008), 222–231.
    [39]
    Olga Perdikaki, Saravanan Kesavan, and Jayashankar M. Swaminathan. 2012. Effect of traffic on sales and conversion rates of retail stores. Manufacturing 8 Service Operations Management 14, 1 (2012), 145--162.
    [40]
    Jennifer Roback. 1982. Wages, rents, and the quality of life. The Journal of Political Economy (1982), 1257--1278.
    [41]
    Daniel M. Romero, Chenhao Tan, and Johan Ugander. 2013. On the interplay between social and topical structure. In Proceedings of ICWSM.
    [42]
    Sherwin Rosen. 1979. Wage-based indexes of urban quality of life. Current Issues in Urban Economics 3 (1979), 324–345.
    [43]
    Andrew Rundle, Kathryn M. Neckerman, Lance Freeman, Gina S. Lovasi, Marnie Purciel, James Quinn, Catherine Richards, Neelanjan Sircar, Christopher Weiss, and others. 2009. Neighborhood food environment and walkability predict obesity in New York City. Environmental Health Perspectives 117, 3 (2009), 442--447.
    [44]
    Brian E. Saelens, James F. Sallis, and Lawrence D. Frank. 2003. Environmental correlates of walking and cycling: Findings from the transportation, urban design, and planning literatures. Annals of Behavioral Medicine 25, 2 (2003), 80--91.
    [45]
    Takeshi Sakaki, Makoto Okazaki, and Yutaka Matsuo. 2010. Earthquake shakes Twitter users: Real-time event detection by social sensors. In Proceedings of the 19th International Conference on World Wide Web. 851--860.
    [46]
    Salvatore Scellato, Anastasios Noulas, Renaud Lambiotte, and Cecilia Mascolo. 2011. Socio-spatial properties of online location-based social networks. ICWSM 11 (2011), 329--336.
    [47]
    Mark Edward Stover and Charles L. Leven. 1992. Methodological issues in the determination of the quality of life in urban areas. Urban Studies 29, 5 (1992), 737--754.
    [48]
    Catherine Tucker and Juanjuan Zhang. 2011. How does popularity information affect choices? A field experiment. Management Science 57, 5 (2011), 828--842.
    [49]
    J. Miguel Villas-Boas and Russell S. Winer. 1999. Endogeneity in brand choice models. Management Science 45, 10 (1999), 1324--1338.
    [50]
    Xun Wang, Feida Zhu, Jing Jiang, and Sujian Li. 2013. Real time event detection in twitter. In International Conference on Web-Age Information Management. Springer, 502--513.
    [51]
    Asha Weinstein Agrawal, Marc Schlossberg, and Katja Irvin. 2008. How far, by which route and why? A spatial analysis of pedestrian preference. Journal of Urban Design 13, 1 (2008), 81--98.
    [52]
    Inge M. Wetzer, Marcel Zeelenberg, and Rik Pieters. 2007. Never eat in that restaurant, I did! Exploring why people engage in negative word-of-mouth communication. Psychology 8 Marketing 24, 8 (2007), 661--680.
    [53]
    Chao Zhang, Guangyu Zhou, Quan Yuan, Honglei Zhuang, Yu Zheng, Lance Kaplan, Shaowen Wang, and Jiawei Han. 2016. GeoBurst: Real-time local event detection in geo-tagged tweet streams. SIGIR 2016.
    [54]
    Yu Zheng, Licia Capra, Ouri Wolfson, and Hai Yang. 2014. Urban computing: Concepts, methodologies, and applications. ACM Transactions on Intelligent Systems and Technology (TIST) 5, 3 (2014), 38.
    [55]
    Yu Zheng, Huichu Zhang, and Yong Yu. 2015. Detecting collective anomalies from multiple spatio-temporal datasets across different domains. In Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems. ACM, 2.

    Cited By

    View all

    Index Terms

    1. Using Online Geotagged and Crowdsourced Data to Understand Human Offline Behavior in the City: An Economic Perspective

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Transactions on Intelligent Systems and Technology
        ACM Transactions on Intelligent Systems and Technology  Volume 9, Issue 3
        Regular Papers and Special Issue: Urban Intelligence
        May 2018
        370 pages
        ISSN:2157-6904
        EISSN:2157-6912
        DOI:10.1145/3167125
        • Editor:
        • Yu Zheng
        Issue’s Table of Contents
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 11 December 2017
        Accepted: 01 March 2017
        Revised: 01 February 2017
        Received: 01 November 2016
        Published in TIST Volume 9, Issue 3

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Geotagged social media
        2. city demand
        3. crowdsourced user behavior
        4. econometric analysis
        5. econometrics
        6. location-based service
        7. mobility analytic

        Qualifiers

        • Research-article
        • Research
        • Refereed

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)14
        • Downloads (Last 6 weeks)3
        Reflects downloads up to 26 Jul 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2022)Usage Mining of the London Santander Bike-Sharing SystemComputer10.1109/MC.2022.319330855:12(98-108)Online publication date: Dec-2022
        • (2021)FNetSecurity and Communication Networks10.1155/2021/53957052021Online publication date: 1-Jan-2021
        • (2021)SNR: Squeezing Numerical Range Defuses Bit Error Vulnerability Surface in Deep Neural NetworksACM Transactions on Embedded Computing Systems10.1145/347700720:5s(1-25)Online publication date: 23-Sep-2021
        • (2021)Mondegreen: A Post-Processing Solution to Speech Recognition Error Correction for Voice Search QueriesProceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining10.1145/3447548.3467156(3569-3575)Online publication date: 14-Aug-2021
        • (2020)Dense Attention Fluid Network for Salient Object Detection in Optical Remote Sensing ImagesIEEE Transactions on Image Processing10.1109/TIP.2020.304208430(1305-1317)Online publication date: 22-Dec-2020
        • (undefined)The 'Glocal' Challenge: Impact of Culture Differences on Sharing Economy – An Empirical AnalysisSSRN Electronic Journal10.2139/ssrn.3278207

        View Options

        Get Access

        Login options

        Full Access

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media