research-article

A learning approach to semantic image analysis

Authors:

G. Th. Papadopoulos,

S. DasiopoulouAuthors Info & Claims

MobiMedia '06: Proceedings of the 2nd international conference on Mobile multimedia communications

Article No.: 29, Pages 1 - 6

https://doi.org/10.1145/1374296.1374327

Published: 18 September 2006 Publication History

Abstract

In this paper, a learning approach coupling Support Vector Machines (SVMs) and a Genetic Algorithm (GA) is presented for knowledge-assisted semantic image analysis in specific domains. Explicitly defined domain knowledge under the proposed approach includes objects of the domain of interest and their spatial relations. SVMs are employed using low-level features to extract implicit information for each object of interest via training in order to provide an initial annotation of the image regions based solely on visual features. To account for the inherent visual information ambiguity, fuzzy spatial relations along with the previously computed initial annotations are supplied to a genetic algorithm, which decides on the globally most plausible annotation. Experiments with images of the beach vacation domain demonstrate the performance of the proposed approach.

References

[1]

A. W. M. Smeulders, M. Worring, S. Santini, A. Gupta and R. Jain, "Content-based image retrieval at the end of the early years", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 22, iss. 12, pp. 1349--1380, 2000.

Digital Library

[2]

W. Al-Khatib, Y. F. Day, A. Ghafoor, P. B. Berra, "Semantic Annotation of Images and Videos for Multimedia Analysis", 2nd European Semantic Web Conference (ESWC), Herakleion, Greece, 2005.

Digital Library

[3]

K. Petridis, S. Bloehdorn, C. Saathoff, N. Simou, S. Dasiopoulou, V. Tzouvaras, S. Handschuh, Y. Avrithis, I. Kompatsiaris and S. Staab, "Knowledge Representation and Semantic Annotation of Multimedia Content", IEE Proceedings on Vision Image and Signal Processing, Special issue on Knowledge-Based Digital Media Processing, Vol. 153, No. 3, pp. 255--262, June 2006.

[4]

J. Assfalg, M. Berlini, A. Del Bimbo, W. Nunziat, P. Pala, "Soccer Highlights Detection and Recognition using HMMs", IEEE International Conference on Multimedia & Expo (ICME), pp. 825--828, 2005.

[5]

L. Zhang, F. Z. Lin, B. Zhang, "Support Vector Machine Learning for Image Retrieval", International Conference on Image Processing, October, 2001.

[6]

S. Dasiopoulou, V. Mezaris, V. K. Papastathis, I. Kompatsiaris, M. G. Strintzis, "Knowledge-Assisted Semantic Video Object Detection", IEEE Transactions on Circuits and Systems for Video Technology, Special Issue on Analysis and Understanding for Video Adaptation, vol. 15, no. 10, pp. 1210--1224, 2005.

Digital Library

[7]

L. Hollink, S. Little, J. Hunter, "Evaluating the Application of Semantic Inferencing Rules to Image Annotation", 3rd International Conference on Knowledge Capture (K-CAP05), Banff, Canada, 2005.

Digital Library

[8]

K. I. Kim, K. Jung, S. H. Park, and H. J. Kim, "Support vector machines for texture classification", IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 24, no. 11, pp. 1542.1550, Nov. 2002.

Digital Library

[9]

O. Chapelle, P. Haffner and V. Vapnik, "Support vector machines for histogram-based image classification", IEEE Transactions on Neural Networks, vol. 10, no. 5, pp. 10551064, September 1999.

Digital Library

[10]

M. Mitchell, "An introduction to genetic algorithms", MIT Press, 1995.

Digital Library

[11]

N. Voisine, S. Dasiopoulou, F. Precioso, V. Mezaris, I. Kompatsiaris and M. G. Strintzis, "A Genetic Algorithm-based Approach to Knowledge-assisted Video Analysis", Proc. IEEE International Conference on Image Processing (ICIP 2005), Genova, 2005.

[12]

T. Adamek, N. O'Connor, N. Murphy, "Region-based Segmentation of Images Using Syntactic Visual Features", Workshop on Image Analysis for Multimedia Interactive Services, (WIAMIS), Montreux, Switzerland, 2005.

[13]

"MPEG-7 Visual Experimentation Model (XM)", Version 10.0, ISO/IEC/JTC1/SC29/WG11, Doc. N4062, Mar., 2001.

[14]

S. Skiadopoulos, C. Giannoukos, N. Sarkas, P. Vassiliadis, T. Sellis, M. Koubarakis, "2D topological and direction relations in the world of minimum bounding circles", IEEE Transactions on Knowledge and Data Engineering, vol. 17, iss. 12, pp. 1610--1623, 2005.

Digital Library

[15]

Y. Wang, F. Makedon, J. Ford, L. Shen, D. Golding, "Generating Fuzzy Semantic Metadata Describing Spatial Relations from Images using the R-Histogram", JCDL '04, June 7--11, Tucson, Arizona, USA, 2004.

[16]

S. Staab and R. Studer, "Handbook on ontologies", in Int. Handbooks on Information Systems. Berlin, Germany: Springer-Verlag, 2004.

Digital Library

[17]

D. Tax and R. Duin, "Using two-class classifiers for multi-class classification", in Proc. Int. Conf. Pattern Recognition, Quebec City, Canada, vol. 2, pp. 124127, 2002.

[18]

C.-C. Chang and C.-J. Lin., "LIBSVM: A library for support vector machines", http://www.csie.ntu.edu.tw/cjlin/libsvm, 2001.

[19]

D. Goldberg, K. Deb, "A comparative analysis of selection schemes used in genetic algorithms", In Foundations of Genetic Algorithms, G. Rawlins, 69--93, 1991.

Cited By

Mezaris VPapadopoulos G(2009)Semantic Video Analysis and UnderstandingEncyclopedia of Information Science and Technology, Second Edition10.4018/978-1-60566-026-4.ch543(3419-3425)Online publication date: 2009
https://doi.org/10.4018/978-1-60566-026-4.ch543

Index Terms

A learning approach to semantic image analysis
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
      2. Computer vision tasks
        Scene understanding
  2. Machine learning
2. Information systems
  1. Information storage systems
    1. Record storage systems
      1. Record storage alternatives

Recommendations

Feature weighting and SVM parameters optimization based on genetic algorithms for classification problems

Support Vector Machines (SVMs) are widely known as an efficient supervised learning model for classification problems. However, the success of an SVM classifier depends on the perfect choice of its parameters as well as the structure of the data. Thus, ...
A Statistical Learning Approach to Spatial Context Exploitation for Semantic Image Analysis
ICPR '10: Proceedings of the 2010 20th International Conference on Pattern Recognition

In this paper, a statistical learning approach to spatial context exploitation for semantic image analysis is presented. The proposed method constitutes an extension of the key parts of the authors' previous work on spatial context utilization, where a ...
Efficient chromosome encoding and problem-specific mutation methods for the flexible bay facility layout problem

Two chromosome encoding methods are compared for finding solutions to the nondeterministic polynomial-time hard flexible bay facilities layout problem via genetic algorithm (GA). Both methods capitalize on the random key GA approach to produce ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

MobiMedia '06: Proceedings of the 2nd international conference on Mobile multimedia communications

September 2006

281 pages

ISBN:1595935177

DOI:10.1145/1374296

Copyright © 2006 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 September 2006

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Sixth Framework Programme

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
294
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 02 Sep 2024

Other Metrics

View Author Metrics

Citations

Cited By

Mezaris VPapadopoulos G(2009)Semantic Video Analysis and UnderstandingEncyclopedia of Information Science and Technology, Second Edition10.4018/978-1-60566-026-4.ch543(3419-3425)Online publication date: 2009
https://doi.org/10.4018/978-1-60566-026-4.ch543

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents