Abstract
While the geographical tag has brought a novel insight into the multimedia content analysis and understanding, how to improve the tagging accuracy has been rarely exploited. In this paper, we present a novel geographical retagging algorithm to improve the inaccurate geographical tags from an automatic photo content based association and refinement perspective. We do not resort to the time-consuming camera pose estimation and scene geometry recovery schemes like structure-from-motion. Instead, our algorithm is deployed based on a very simple neighbor statistical significance test, i.e., geographically nearby images, if near duplicate, should follow a more smooth affine transform comparing with those farther aways. Such an assumption is robust to noisy photo contents caused by multiple factors, such as indoor/outdoor changes, occlusions, or viewing angle changes. It is also very fast comparing to alternative approaches like structure-from-motion or simultaneous localization and matching. We have shown the accuracy, efficiency, and robustness of the proposed retagging algorithm for refining the geographical tags of Flickr images.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Kennedy, L., Naaman, M., Ahern, S.: How flickr helps us make sense of the world: context and content in community contributed media collections. ACM Multimedia (2007) 1,3
Cao, L.-L., Yu, J., Luo, J., Huang, T.S.: Enhancing Semantic and Geographic Annotation of Web Images via Logistic canonical correlation regression. ACM Multimedia (2009) 3
Ji, R., Xie, X., Yao, H., Ma, W.-Y.: Mining city landmarks from blogs by graph modeling. ACM Multimedia (2009) 3, 4
Zheng, Y.-T., Zhao, M., Song, Y., Adam, H.: Tour the world: building a web-scale landmark recognition engine. In: CVPR (2009) 1, 2, 4
Irschara, A., Zach, C., Frahm, J., Bischof, H.: From structure-from-motion point clouds to fast location recognition. In: CVPR (2009) 2, 3, 4
Xiao, J., Chen, J., Yeung, D.-Y., Quan, L.: Structuring Visual Words in 3D for Arbitrary-View Object Localization. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part III. LNCS, vol. 5304, pp. 725–737. Springer, Heidelberg (2008) 4
Jia, M., Fan, X., Xie, X., Li, M., Ma, W.-Y.: Photo-to-search: Using camera phones to inquire of the surrounding world. In: MDM (2006) 4
Ji, R., Yao, H., Wang, J., Xu, P., Liu, X.: Clustering-based subspace SVM ensemble for relevance feedback learning. In: ICME (2008) 4
Ji, R., Xie, X., Yao, H., Wu, Y., Ma, W.-Y.: Vocabulary tree incremental indexing for scalable location recognition. In: ICME (2008) 4
Wang, M., Ni, B., Hua, X.-S., Chua, T.-S.: Assistive tagging: A survey of multimedia tagging with human-computer joint exploration. ACM Computing Surveys (2012) 4
Wang, M., Hong, R., Li, G., Zha, Z.-J., Yan, S., Chua, T.-S.: Event driven Web Video Summarization by Tag Localization and Key-Shot Identification. TIP (2012) 4
Wang, M., Hong, R., Yuan, X.-T., Yan, S., Chua, T.-S.: Movie2Comics: Towards a Lively Video Content Presentation. TMM (2012) 4
Ji, R., Duan, L.-Y., Chen, J., Yao, H., Yuan, J., Rui, Y., Gao, W.: Location Discriminative Vocabulary Coding for Mobile Landmark Search. IJCV (2012) 4
Ji, R., Yao, H., Liu, W., Sun, X., Tian, Q.: Task Dependent Visual Codebook Compression. TIP (2012) 4
Ji, R., Duan, L.-Y., Yao, H., Xie, L., Rui, Y., Gao, W.: Learning to Distribute Vocabulary Indexing for Scalable Visual Search. TMM (2012) 4
Ji, R., Gao, Y., Zhong, B., Yao, H., Tian, Q.: Mining City Landmarks by Modeling Reconstruction Sparsity. TOMCCAP (2011) 4
Gao, Y., Tang, J., Hong, R., Dai, Q., Chua, T., Jain, R.: W2Go: A Travel Guidance System by Automatic Landmark Ranking. ACM Multimedia, 123–132 (2010) 4
Gao, Y., Wang, M., Tao, D., Ji, R., Dai, Q.: 3D Object Retrieval and Recognition with Hypergraph Analysis. TIP (2012) 4
Gao, Y., Wang, M., Zha, Z., Tian, Q., Dai, Q., Zhang, N.: Less is More: Efficient 3D Object Retrieval with Query View Selection. TMM (2011) 4
Schindler, G., Brown, M.: City-scale location recognition. In: CVPR (2007) 2, 4
Ji, R., Xie, X., Yao, H., Ma, W.-Y.: Mining city landmarks from blogs by graph modeling. ACM Multimedia, 105–114 (2009) 4
Cristani, M., Perina, A., Castellani, U., Murino, V.: Geolocated image analysis using latent representations. In: CVPR (2008) 2, 4
Crandall, D., Backstrom, L., Huttenlocher, D., Kleinberg, J.: Mapping the world’s photos. In: WWW (2009) 4
Hays, J., Efros, A.: IMG2GPS: estimating geographic information from a single image. In: CVPR (2008) 4
Kalogerakis, E., Vesselova, O., Hays, J., Efros, A., Hertzmann, A.: Image sequence geolocation with human travel priors. In: CVPR (2009) 4
Ji, R., Yao, H., Xie, X., Tian, Q.: Vocabulary Hierarchy Optimization and Transfer for Scalable Image Search. IEEE MM (2011) 4
Ji, R., Xie, X., Yao, H., Ma, W.-Y.: Mining City Landmarks by Graph Modeling. ACM Multimedia (2009) 4
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cao, L., Gao, Y., Liu, Q., Ji, R. (2013). Geographical Retagging. In: Li, S., et al. Advances in Multimedia Modeling. Lecture Notes in Computer Science, vol 7733. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35728-2_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-35728-2_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35727-5
Online ISBN: 978-3-642-35728-2
eBook Packages: Computer ScienceComputer Science (R0)