Abstract
This paper presents a novel method of automatic image semantic annotation. Our approach is based on the Image-Keyword Document Model (IKDM) with image features discretization. According to IKDM, the image keyword annotation is conducted using image similarity measurement based on language model from text information retrieval domain. Through the experiments on a testing set of 5000 annotated images, our approach demonstrates great improvement of annotation performance compared with the known discretization-based image annotation model such as CMRM. Our approach also performs better in annotation time compared with the continuous model such as CRM.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Assfalg, J., Bertini, M., Colombo, C., Del Bimbo, A.: Semantic Annotation of Sports Videos. IEEE Multimedia (April-June 2002)
Barnard, K., Duygulu, P., Forsyth, D.: Clustering Art. In: Proceedings of IEEE ICPR (2001)
Blei, D., Jordan, M.I.: Modeling annotated data. In: Proc. of the 26th Intl. ACM SIGIR Conf., pp. 127–134 (2003)
Berman, A., Shapiro, L.G.: Efficient image retrieval with multiple distance measures. In: Storage and Retrieval for Image and Video Databases(SPIE), pp. 12–21 (1997)
Cusano, C., Ciocca, G., Schettini, R.: Image Annotation Using Svm. In: Proceedings of Internet imaging IV, vol. SPIE 5304 (2004)
Duygulu, P., Barnard, K., de Freitas, N., Forsyth, D.: Object recognition as machine translation:learning a lexicon for a fixed image vocabulary. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2353, pp. 97–112. Springer, Heidelberg (2002)
Fayyad, U., Irani, K.: Multi-interval discretization of continuous-valued attributes for classification learning. In: Proc. 13th IJCAI, pp. 1022C–1027C (1993)
Fountain, S., Tan, T.: Content Based Annotation and Retrieval. In: RAIDER IRSG (1998)
Gupta, A., Weymouth, T.E., Jain, R.: Semantic queries with pictures: the VIMSYS model. In: VLDB, pp. 69–79 (1991)
Jaser, E., Kittler, J., Christmas, W.J.: Hierarchical Decision Making Scheme for Sports Video Categorisation with Temporal Post-Processing. In: CVPR, vol. II, pp. 908–913 (2004)
Jeon, J., Lavrenko, V., Manmatha, R.: Automatic image annotation and retrieval using cross-media relevance models. In: Proc. of 26th ACM SIGIR, pp. 119–126 (2003)
Jin, R., Chai, J., Si, L.: Effective Automatic Image Annotation Via A Coherent Language Model and Active Learning. In: Proc. of ACM Multimedia (2004)
Lavrenko, V., Manmatha, R., Jeon, J.: A Model for Learning the Semantics of Pictures. In: Proceedings of Advances in Neural Information Processing (2003)
Zhang, L., Chen, L., Li, M., Zhang, H.: Automated annotation of human faces in family albums. In: Proc. of ACM Multimedia, pp. 355–358 (2003)
Mori, Y., Takahashi, H., Oka, R.: Image-to-word transformation based on dividing and vector quantizing images with words. In: Proc. of MISRM (1999)
Muller, H., Muller, W., Marchand-Maillet, S., Pun, T., Squire, D.: Strategies for Positive and Negative Relevance Feedback in Image Retrieval. In: ICPR, pp. 5043–5042 (2000)
Lew, M., Sebe, N., Eakins, J.: Challenges of image and video retrieval. In: Lew, M., Sebe, N., Eakins, J.P. (eds.) CIVR 2002. LNCS, vol. 2383, pp. 1–6. Springer, Heidelberg (2002)
Monay, F., Gatica-Perez, D.: On Image Auto- Annotation with Latent Space Models. In: Proceedings of ACM Multimedia Conf. (2003)
Naphade, M.R., Kozintsev, I.V., Huang, T.S.: A Factor Graph Framework for Semantic Video Inexing. IEEE Trans. on Circuits and Systems for Video Technology 12(1) (2002)
Wang, W., Zhang, A.: Evaluation of low-level features by decisive feature patterns. In: Proc. of IEEE ICME (2004)
Rui, Y., Huang, T.S.: A novel relevance feedback technique in image retrieval. In: Proc. of the 7th ACM Int.Conf. on Multimedia, pp. 67–70 (1999)
Smith, J.R., Chang, S.-F.: VisualSEEk: A Fully Automated Content-Based Image Query System. In: Proc. of ACM Multimedia, pp. 87–98 (1996)
Tao, J.L., Hung, Y.P.: A bayesian method for content-based image retrieval by use of relevance feedback. In: Chang, S.-K., Chen, Z., Lee, S.-Y. (eds.) VISUAL 2002. LNCS, vol. 2314, pp. 76–87. Springer, Heidelberg (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhou, X., Chen, L., Ye, J., Zhang, Q., Shi, B. (2005). Automatic Image Semantic Annotation Based on Image-Keyword Document Model. In: Leow, WK., Lew, M.S., Chua, TS., Ma, WY., Chaisorn, L., Bakker, E.M. (eds) Image and Video Retrieval. CIVR 2005. Lecture Notes in Computer Science, vol 3568. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11526346_22
Download citation
DOI: https://doi.org/10.1007/11526346_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27858-0
Online ISBN: 978-3-540-31678-7
eBook Packages: Computer ScienceComputer Science (R0)