Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2442796.2442805acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article

Multimodal geo-tagging in social media websites using hierarchical spatial segmentation

Published: 06 November 2012 Publication History

Abstract

These days the sharing of photographs and videos is very popular in social networks. Many of these social media websites such as Flickr, Facebook and Youtube allows the user to manually label their uploaded videos with geo-information using a interface for dragging them into the map. However, the manually labelling for a large set of social media is still borring and error-prone. For this reason we present a hierarchical, multi-modal approach for estimating the GPS information. Our approach makes use of external resources like gazetteers to extract toponyms in the metadata and of visual and textual features to identify similar content. First, the national borders detection recognizes the country and its dimension to speed up the estimation and to eliminate geographical ambiguity. Next, we use a database of more than 3.2 million Flickr images to group them together into geographical regions and to build a hierarchical model. A fusion of visual and textual methods for different granularities is used to classify the videos' location into possible regions. The Flickr videos are tagged with the geo-information of the most similar training image within the regions that is previously filtered by the probabilistic model for each test video. In comparison with existing GPS estimation and image retrieval approaches at the Placing Task 2011 we will show the effectiveness and high accuracy relative to the state-of-the art solutions.

References

[1]
http://translate.google.com.
[2]
http://www.geonames.org.
[3]
http://www.wikipedia.org.
[4]
http://code.google.com/apis/maps/index.html.
[5]
S. Agarwal, N. Snavely, I. Simon, S. Seitz, and R. Szeliski. Building rome in a day. In Computer Vision, 2009 IEEE 12th International Conference on.
[6]
E. Albuz, E. Kocalar, and A. Khokhar. Scalable color image indexing and retrieval using vector wavelets. Knowledge and Data Engineering, IEEE Transactions on, 13(5):851--861, 2001.
[7]
J. Baldridge. The OpenNLP Project. http://www.opennlp.com, 2005.
[8]
S. Chatzichristofis and Y. Boutalis. Cedd: Color and edge directivity descriptor: A compact descriptor for image indexing and retrieval. Computer Vision Systems, pages 312--322, 2008.
[9]
J. Choi, G. Friedland, V. Ekambaram, and K. Ramchandran. Multimodal location estimation of consumer media: Dealing with sparse training data. In proceedings of the IEEE International Conference on Multimedia and Expo (ICME 2012), Melbourne, Australia, 2012.
[10]
D. Crandall, L. Backstrom, D. Huttenlocher, and J. Kleinberg. Mapping the World's Photos. In Proceedings of the 18th international conference on World wide web, pages 761--770. ACM, 2009.
[11]
H. Feichtinger and T. Strohmer. Gabor analysis and algorithms: Theory and applications. Birkhauser, 1998.
[12]
J. Hays and A. Efros. Im2gps: estimating geographic information from a single image. In Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on, pages 1--8. Ieee, 2008.
[13]
J. Huang, S. Kumar, M. Mitra, W. Zhu, and R. Zabih. Image indexing using color correlograms. In Computer Vision and Pattern Recognition, 1997. Proceedings., 1997 IEEE Computer Society Conference on, pages 762--768. IEEE, 1997.
[14]
P. Kelm, S. Schmiedeke, and T. Sikora. A hierarchical, multi-modal approach for placing videos on the map using millions of flickr photographs. In ACM Multimedia 2011 (Workshop on Social and Behavioral Networked Media Access - SBNMA). ACM, Nov. 2011.
[15]
P. Kelm, S. Schmiedeke, and T. Sikora. Multi-modal, multi-resource methods for placing Flickr videos on the map. In Proceedings of the 1st ACM International Conference on Multimedia Retrieval, ICMR '11, New York, NY, USA, 2011. ACM.
[16]
C. Keßler, K. Janowicz, and M. Bishr. An agenda for the next generation gazetteer: Geographic information contribution and retrieval. In Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pages 91--100. ACM, 2009.
[17]
M. Lux and S. Chatzichristofis. Lire: lucene image retrieval: an extensible java cbir library. In Proceeding of the 16th ACM international conference on Multimedia, pages 1085--1088. ACM, 2008.
[18]
B. Manjunath, J. Ohm, V. Vasudevan, and A. Yamada. Color and texture descriptors. Circuits and Systems for Video Technology, IEEE Transactions on, 11(6):703--715, 2001.
[19]
O. A. B. Penatti, L. T. Li, J. Almeida, and R. da S. Torres. A visual approach for video geocoding using bag-of-scenes. In Proceedings of the 2nd ACM International Conference on Multimedia Retrieval, ICMR '12, pages 53:1--53:8, New York, NY, USA, 2012. ACM.
[20]
X. Sevillano, T. Piatrik, K. Chandramouli, Q. Zhang, and E. Izquierdoy. Geo-tagging online videos using semantic expansion and visual analysis. In Image Analysis for Multimedia Interactive Services (WIAMIS), 2012 13th International Workshop on, pages 1--4. IEEE, 2012.
[21]
I. Simon, N. Snavely, and S. Seitz. Scene summarization for online image collections. In Computer Vision, 2007. ICCV 2007. IEEE 11th International Conference on, pages 1--8. IEEE, 2007.
[22]
P. Smart, C. Jones, and F. Twaroch. Multi-source toponym data integration and mediation for a meta-gazetteer service. In Geographic Information Science, Lecture Notes in Computer Science.
[23]
H. Tamura, S. Mori, and T. Yamawaki. Textural features corresponding to visual perception. Systems, Man and Cybernetics, IEEE Transactions on, 8(6):460--473, 1978.

Cited By

View all
  • (2021)Enriching videos with automatic place recognition in google mapsMultimedia Tools and Applications10.1007/s11042-021-11253-981:16(23105-23121)Online publication date: 29-Jul-2021
  • (2015)Exploiting Spatial Relationship between Scenes for Hierarchical Video GeotaggingProceedings of the 5th ACM on International Conference on Multimedia Retrieval10.1145/2671188.2749354(363-370)Online publication date: 22-Jun-2015
  • (2014)Georeferencing Flickr Resources Based on Multimodal FeaturesMultimodal Location Estimation of Videos and Images10.1007/978-3-319-09861-6_8(127-152)Online publication date: 5-Oct-2014
  • Show More Cited By

Index Terms

  1. Multimodal geo-tagging in social media websites using hierarchical spatial segmentation

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    LBSN '12: Proceedings of the 5th ACM SIGSPATIAL International Workshop on Location-Based Social Networks
    November 2012
    67 pages
    ISBN:9781450316989
    DOI:10.1145/2442796
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 06 November 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. geotagging
    2. hierarchical segmentation
    3. placing task

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    SIGSPATIAL'12
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 8 of 15 submissions, 53%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 03 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)Enriching videos with automatic place recognition in google mapsMultimedia Tools and Applications10.1007/s11042-021-11253-981:16(23105-23121)Online publication date: 29-Jul-2021
    • (2015)Exploiting Spatial Relationship between Scenes for Hierarchical Video GeotaggingProceedings of the 5th ACM on International Conference on Multimedia Retrieval10.1145/2671188.2749354(363-370)Online publication date: 22-Jun-2015
    • (2014)Georeferencing Flickr Resources Based on Multimodal FeaturesMultimodal Location Estimation of Videos and Images10.1007/978-3-319-09861-6_8(127-152)Online publication date: 5-Oct-2014
    • (2013)DCT-based features for categorisation of social media in compressed domain2013 IEEE 15th International Workshop on Multimedia Signal Processing (MMSP)10.1109/MMSP.2013.6659304(295-300)Online publication date: Sep-2013

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media