Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1374296.1374327acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmobimediaConference Proceedingsconference-collections
research-article

A learning approach to semantic image analysis

Published: 18 September 2006 Publication History

Abstract

In this paper, a learning approach coupling Support Vector Machines (SVMs) and a Genetic Algorithm (GA) is presented for knowledge-assisted semantic image analysis in specific domains. Explicitly defined domain knowledge under the proposed approach includes objects of the domain of interest and their spatial relations. SVMs are employed using low-level features to extract implicit information for each object of interest via training in order to provide an initial annotation of the image regions based solely on visual features. To account for the inherent visual information ambiguity, fuzzy spatial relations along with the previously computed initial annotations are supplied to a genetic algorithm, which decides on the globally most plausible annotation. Experiments with images of the beach vacation domain demonstrate the performance of the proposed approach.

References

[1]
A. W. M. Smeulders, M. Worring, S. Santini, A. Gupta and R. Jain, "Content-based image retrieval at the end of the early years", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 22, iss. 12, pp. 1349--1380, 2000.
[2]
W. Al-Khatib, Y. F. Day, A. Ghafoor, P. B. Berra, "Semantic Annotation of Images and Videos for Multimedia Analysis", 2nd European Semantic Web Conference (ESWC), Herakleion, Greece, 2005.
[3]
K. Petridis, S. Bloehdorn, C. Saathoff, N. Simou, S. Dasiopoulou, V. Tzouvaras, S. Handschuh, Y. Avrithis, I. Kompatsiaris and S. Staab, "Knowledge Representation and Semantic Annotation of Multimedia Content", IEE Proceedings on Vision Image and Signal Processing, Special issue on Knowledge-Based Digital Media Processing, Vol. 153, No. 3, pp. 255--262, June 2006.
[4]
J. Assfalg, M. Berlini, A. Del Bimbo, W. Nunziat, P. Pala, "Soccer Highlights Detection and Recognition using HMMs", IEEE International Conference on Multimedia & Expo (ICME), pp. 825--828, 2005.
[5]
L. Zhang, F. Z. Lin, B. Zhang, "Support Vector Machine Learning for Image Retrieval", International Conference on Image Processing, October, 2001.
[6]
S. Dasiopoulou, V. Mezaris, V. K. Papastathis, I. Kompatsiaris, M. G. Strintzis, "Knowledge-Assisted Semantic Video Object Detection", IEEE Transactions on Circuits and Systems for Video Technology, Special Issue on Analysis and Understanding for Video Adaptation, vol. 15, no. 10, pp. 1210--1224, 2005.
[7]
L. Hollink, S. Little, J. Hunter, "Evaluating the Application of Semantic Inferencing Rules to Image Annotation", 3rd International Conference on Knowledge Capture (K-CAP05), Banff, Canada, 2005.
[8]
K. I. Kim, K. Jung, S. H. Park, and H. J. Kim, "Support vector machines for texture classification", IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 24, no. 11, pp. 1542.1550, Nov. 2002.
[9]
O. Chapelle, P. Haffner and V. Vapnik, "Support vector machines for histogram-based image classification", IEEE Transactions on Neural Networks, vol. 10, no. 5, pp. 10551064, September 1999.
[10]
M. Mitchell, "An introduction to genetic algorithms", MIT Press, 1995.
[11]
N. Voisine, S. Dasiopoulou, F. Precioso, V. Mezaris, I. Kompatsiaris and M. G. Strintzis, "A Genetic Algorithm-based Approach to Knowledge-assisted Video Analysis", Proc. IEEE International Conference on Image Processing (ICIP 2005), Genova, 2005.
[12]
T. Adamek, N. O'Connor, N. Murphy, "Region-based Segmentation of Images Using Syntactic Visual Features", Workshop on Image Analysis for Multimedia Interactive Services, (WIAMIS), Montreux, Switzerland, 2005.
[13]
"MPEG-7 Visual Experimentation Model (XM)", Version 10.0, ISO/IEC/JTC1/SC29/WG11, Doc. N4062, Mar., 2001.
[14]
S. Skiadopoulos, C. Giannoukos, N. Sarkas, P. Vassiliadis, T. Sellis, M. Koubarakis, "2D topological and direction relations in the world of minimum bounding circles", IEEE Transactions on Knowledge and Data Engineering, vol. 17, iss. 12, pp. 1610--1623, 2005.
[15]
Y. Wang, F. Makedon, J. Ford, L. Shen, D. Golding, "Generating Fuzzy Semantic Metadata Describing Spatial Relations from Images using the R-Histogram", JCDL '04, June 7--11, Tucson, Arizona, USA, 2004.
[16]
S. Staab and R. Studer, "Handbook on ontologies", in Int. Handbooks on Information Systems. Berlin, Germany: Springer-Verlag, 2004.
[17]
D. Tax and R. Duin, "Using two-class classifiers for multi-class classification", in Proc. Int. Conf. Pattern Recognition, Quebec City, Canada, vol. 2, pp. 124127, 2002.
[18]
C.-C. Chang and C.-J. Lin., "LIBSVM: A library for support vector machines", http://www.csie.ntu.edu.tw/cjlin/libsvm, 2001.
[19]
D. Goldberg, K. Deb, "A comparative analysis of selection schemes used in genetic algorithms", In Foundations of Genetic Algorithms, G. Rawlins, 69--93, 1991.

Cited By

View all
  • (2009)Semantic Video Analysis and UnderstandingEncyclopedia of Information Science and Technology, Second Edition10.4018/978-1-60566-026-4.ch543(3419-3425)Online publication date: 2009

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
MobiMedia '06: Proceedings of the 2nd international conference on Mobile multimedia communications
September 2006
281 pages
ISBN:1595935177
DOI:10.1145/1374296
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 September 2006

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. fuzzy directional relations
  2. genetic algorithms (GAs)
  3. object recognition
  4. ontologies
  5. semantic image analysis
  6. support vector machines (SVMs)

Qualifiers

  • Research-article

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2009)Semantic Video Analysis and UnderstandingEncyclopedia of Information Science and Technology, Second Edition10.4018/978-1-60566-026-4.ch543(3419-3425)Online publication date: 2009

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media