Abstract
In this paper, we study computational models and techniques to merge textual and image features to classify images on the World Wide Web (WWW). A vector-based framework is used to index images on the basis of textual, pictorial and composite (textual-pictorial) information. The scheme makes use of weighted document terms and color invariant image features to obtain a highdimensional image descriptor in vector form to be used as an index. Experiments are conducted on a representative set of more than 100.000 images down loaded from the WWW together with their associated text. Performance evaluations are reported on the accuracy of merging textual and pictorial information for image classification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Favella, J. and Meza, V., “Image-retrieval Agent: Integrating Image Content and Text”, IEEE Int. Sys., 1999.
T. Gevers and Arnold W.M. Smeulders, “PicToSeek: Combining Color and Shape Invariant Features for Image Retrieval”, IEEE Trans. on Image Processing, 9(1), pp. 102–120, 2000.
S. Sclaroff, M. La Cascia, S. Sethi, L. Taycher, “Unifying Textual and Visual Cues for Content-based Image Retrieval on the World Wide Web,” CVIU, 75(1/2), 1999.
J.R. Smith and S.-F. Chang, “VisualSEEK: A Fully Automated Content-based Image Query System,” ACM Multimedia, 1996.
A. Vailaya, M. Figueiredo, A. Jain, H. Zhang, “Content-based Hierarchical Classification of Vacation Images,” IEEE ICMCS, June 7-11 1999, 1999.
H.-H. Yu and W. Wolf, “Scene Classification Methods for Image and Video Databases”, Proc. SPIE on DISAS, 1995.
D. Zhong, H. j. Zhang, S.-F. Chang, “Clustering Methods for Video Browsing and Annotation”, Proc. SPIE on SRIVD, 1995.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gevers, T., Aldershoff, F., Geusebroek, JM. (2000). Integrating Visual and Textual Cues for Image Classification. In: Laurini, R. (eds) Advances in Visual Information Systems. VISUAL 2000. Lecture Notes in Computer Science, vol 1929. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-40053-2_37
Download citation
DOI: https://doi.org/10.1007/3-540-40053-2_37
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41177-2
Online ISBN: 978-3-540-40053-0
eBook Packages: Springer Book Archive