Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2684822.2685296acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

On the Accuracy of Hyper-local Geotagging of Social Media Content

Published: 02 February 2015 Publication History

Abstract

Social media users share billions of items per year, only a small fraction of which is geotagged. We present a data-driven approach for identifying non-geotagged content items that can be associated with a hyper-local geographic area by modeling the location distributions of n-grams that appear in the text. We explore the trade-off between accuracy and coverage of this method. Further, we explore differences across content received from multiple platforms and devices, and show, for example, that content shared via different sources and applications produces significantly different geographic distributions, and that it is preferred to model and predict location for items according to their source. Our findings show the potential and the bounds of a data-driven approach to assigning location data to short social media texts, and offer implications for all applications that use data-driven approaches to locate content.

References

[1]
S. Ahern, M. Naaman, R. Nair, and J. H.-I. Yang. World explorer. In Proc. of the Joint Conf. on Digital Libraries (JCDL), 2007.
[2]
S. Chandra, L. Khan, and F. B. Muhaya. Estimating twitter user location using social interactions--a content based approach. In Proc. of the IEEE Conf. on Privacy Security, Risk and Trust (PASSAT) and IEEE Conf. on Social Computing (SocialCom), 2011.
[3]
Z. Cheng, J. Caverlee, and K. Lee. You are where you tweet. In Proc. of the ACM Conf. on Information and Knowledge Management (CIKM), 2010.
[4]
R. Compton, D. Jurgens, and D. Allen. Geotagging one hundred million Twitter accounts with total variation minimization. Available From CoRR, abs/1404.7152, 2014.
[5]
D. J. Crandall, L. Backstrom, D. Huttenlocher, and J. Kleinberg. Mapping the world's photos. In Proc. of the International Conf. on World Wide Web (WWW), 2009.
[6]
N. Dalvi, R. Kumar, and B. Pang. Object matching in tweets with spatial models. In Proc. of the ACM Conf. on Web Search and Data Mining (WSDM), 2012.
[7]
J. Eisenstein, A. Ahmed, and E. P. Xing. Sparse additive generative models of text. In Proc. of the International Conf. on Machine Learning, (ICML), 2011.
[8]
I. Grabovitch-Zuyev, Y. Kanza, E. Kravi, and B. Pat. On the correlation between textual content and geospatial locations in microblogs. In Proc. of the ACM Workshop on Managing and Mining Enriched Geo-Spatial Data (GeoRich), 2014.
[9]
B. Hecht and M. Stephens. A tale of cities: Urban biases in volunteered geographic information. In Proc. of the AAAI Conf. on Weblogs and Social Media (ICWSM), 2014.
[10]
L. Hong, A. Ahmed, S. Gurumurthy, A. J. Smola, and K. Tsioutsiouliklis. Discovering geographical topics in the twitter stream. In Proc. of the International Conf. on World Wide Web (WWW), 2012.
[11]
L. S. Kennedy and M. Naaman. Generating diverse and representative image search results for landmarks. In Proc. of the International Conf. on World Wide Web (WWW),2008.
[12]
S. Kinsella, V. Murdock, and N. O'Hare. I'm eating a sandwich in Glasgow. In Proc. of the ACM International Workshop on Search and Mining User-Generated Contents (SMUC), 2011.
[13]
C. C. Kling, J. Kunegis, S. Sizov, and S. Staab. Detecting non-gaussian geographical topics in tagged photo collections. In Proc. of the 7th ACM International Conf. on Web Search and Data Mining (WSDM), 2014.
[14]
K. Leetaru, S. Wang, A. Padmanabhan, and E. Shook. Mapping the global Twitter heartbeat: The geography of Twitter. First Monday, 18(5), May 2013.
[15]
C. Li and A. Sun. Fine-grained location extraction from tweets with temporal awareness. In Proc. of the ACM Conf. on Research & Development in Information Retrieval (SIGIR), 2014.
[16]
D. Lian and X. Xie. Mining check-in history for personalized location naming. ACM Transactions on Intelligent Systems and Technology (TIST), 5(2):32, 2014.
[17]
J. Mahmud, J. Nichols, and C. Drews. Home location identification of Twitter users. Available from CoRR, abs/1403.2345, 2014.
[18]
J. N. Matias and A. Monroy-Hernandez. NewsPad: Designing for collaborative storytelling in neighborhoods. In CHI'14 Extended Abstracts on Human Factors in Computing Systems, 2014.
[19]
R. Priedhorsky, A. Culotta, and S. Y. Del Valle. Inferring the origin locations of tweets with quantitative confidence. In Proc. of the ACM Conf. on Computer Supported Cooperative Work (CSCW), 2014.
[20]
D. Quercia, J. Ellis, L. Capra, and J. Crowcroft. Tracking gross community happiness from tweets. In Proc. of the ACM Conf. on Computer Supported Cooperative Work (CSCW), 2012.
[21]
D. Quercia, R. Schifanella, and L. M. Aiello. The shortest path to happiness: Recommending beautiful, quiet, and happy routes in the city. In Proc. of the ACM Conf. on Hypertext and Social Media, 2014.
[22]
T. Rattenbury, N. Good, and M. Naaman. Towards automatic extraction of event and place semantics from Flickr tags. In Proc. of the ACM Conf. on Research and Development in Information Retrieval (SIGIR), 2007.
[23]
S. Roller, M. Speriosu, S. Rallapalli, B. Wing, and J. Baldridge. Supervised text-based geolocation using language models on an adaptive grid. In Proc. of the Joint Conf. on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), 2012.
[24]
A. Sadilek, H. A. Kautz, and V. Silenzio. Modeling spread of disease from social interactions. In AAAI Conf. on Weblogs and Social Media (ICWSM), 2012.
[25]
A. Schulz, A. Hadjakos, H. Paulheim, J. Nachtwey, and M. Muhlhauser. A multi-indicator approach for geolocalization of tweets. In AAAI Conf. on Weblogs and Social Media (ICWSM), 2013.
[26]
P. Serdyukov, V. Murdock, and R. van Zwol. Placing Flickr photos on a map. In Proc. of the ACM Conf. on Research and Development in Information Retrieval (SIGIR), 2009.
[27]
B. Shaw, J. Shea, S. Sinha, and A. Hogue. Learning to rank for spatistemporal search. In Proc. of the ACM Conf. on Web Search and Data Mining (WSDM) 2013.
[28]
D. Tasse and J. I. Hong. Using social media data to understand cities. Technical report, Carnegie Mellon University, 2014.
[29]
B. Thomee and A. Rae. Uncovering locally characterizing regions within geotagged data. In Proc. of the International Conf. on World Wide Web (WWW), 2013.
[30]
B. Thomee, G. De Francisci Morales. Automatic discovery of global and local equivalence relationships in labeled geo-spatial data In Proc. of the ACM Conf. on Hypertext and Social Media, 2014.
[31]
B. Thomee, J.G. Moreno, D.A. Shamma. Who's time is it anyway?: investigating the accuracy of camera timestamps In Proc. of the ACM Conf. on Multimedia, 2014.
[32]
H. wen Chang, D. Lee, M. Eltaher, and J. Lee. @Phillies Tweeting from Philly? Predicting Twitter user locations with spatial word usage. In IEEE/ACM International Conf. on Advances in Social Networks Analysis and Mining (ASONAM), 2012.
[33]
B. P. Wing and J. Baldridge. Simple supervised document geolocation with geodesic grids. In Proc. of the Annual Meeting of the Assoc. for Computational Linguistics: Human Language Technologies (HLT), 2011.
[34]
C. Xia, R. Schwartz, K. Xie, A. Krebs, A. Langdon, J. Ting, and M. Naaman. CityBeat: Real-time social media visualization of hyper-local city data. In Companion Publication of the 23rd International Conf. on World Wide Web (WWW), 2014.

Cited By

View all
  • (2024)A Geospatial Perspective on Data Ownership, the Right to be Forgotten, Copyrights, and Plagiarism in Generative AIProceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems10.1145/3678717.3691269(477-480)Online publication date: 29-Oct-2024
  • (2024)Understanding the impact of geotagging on location inference models for accurate generalization to non-geotagged datasetsGeomatica10.1016/j.geomat.2024.10000476:1(100004)Online publication date: Jul-2024
  • (2023)Predicting Location of Tweets Using Machine Learning ApproachesApplied Sciences10.3390/app1305302513:5(3025)Online publication date: 26-Feb-2023
  • Show More Cited By

Index Terms

  1. On the Accuracy of Hyper-local Geotagging of Social Media Content

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      WSDM '15: Proceedings of the Eighth ACM International Conference on Web Search and Data Mining
      February 2015
      482 pages
      ISBN:9781450333177
      DOI:10.1145/2684822
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 02 February 2015

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. geotagging
      2. location-based services
      3. social media

      Qualifiers

      • Research-article

      Funding Sources

      Conference

      WSDM 2015

      Acceptance Rates

      WSDM '15 Paper Acceptance Rate 39 of 238 submissions, 16%;
      Overall Acceptance Rate 498 of 2,863 submissions, 17%

      Upcoming Conference

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)28
      • Downloads (Last 6 weeks)6
      Reflects downloads up to 26 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)A Geospatial Perspective on Data Ownership, the Right to be Forgotten, Copyrights, and Plagiarism in Generative AIProceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems10.1145/3678717.3691269(477-480)Online publication date: 29-Oct-2024
      • (2024)Understanding the impact of geotagging on location inference models for accurate generalization to non-geotagged datasetsGeomatica10.1016/j.geomat.2024.10000476:1(100004)Online publication date: Jul-2024
      • (2023)Predicting Location of Tweets Using Machine Learning ApproachesApplied Sciences10.3390/app1305302513:5(3025)Online publication date: 26-Feb-2023
      • (2023)LocBERT: Improving Social Media User Location Prediction Using Fine-Tuned BERTDatabase and Expert Systems Applications - DEXA 2023 Workshops10.1007/978-3-031-39689-2_3(23-32)Online publication date: 21-Aug-2023
      • (2022)Towards the Inference of Travel Purpose with Heterogeneous Urban DataIEEE Transactions on Big Data10.1109/TBDATA.2019.29218238:1(166-177)Online publication date: 1-Feb-2022
      • (2022)Twitter Location Prediction usnig Machine Learning Algorithms2022 International Interdisciplinary Humanitarian Conference for Sustainability (IIHC)10.1109/IIHC55949.2022.10060688(1066-1070)Online publication date: 18-Nov-2022
      • (2022)Geo-based recommendation system utilising geo tagging and K-means clusteringSpatial Information Research10.1007/s41324-022-00495-w31:3(253-263)Online publication date: 16-Nov-2022
      • (2022)Individual mobility pattern in Malaysia during COVID‐19 Recovery Movement Control Order partial lockdownGeo: Geography and Environment10.1002/geo2.1139:1Online publication date: 20-Jun-2022
      • (2021)Understanding Natural Disaster Scenes from Mobile Images Using Deep LearningApplied Sciences10.3390/app1109395211:9(3952)Online publication date: 27-Apr-2021
      • (2021)Location Classification Based on TweetsProceedings of the 4th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery10.1145/3486635.3491075(51-60)Online publication date: 2-Nov-2021
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media