research-article

Hamming embedding similarity-based image classification

Authors:

Rachid Benmokhtar,

Hervé Jégou, and

Patrick GrosAuthors Info & Claims

ICMR '12: Proceedings of the 2nd ACM International Conference on Multimedia Retrieval

June 2012

Article No.: 19, Pages 1 - 8

https://doi.org/10.1145/2324796.2324820

Published: 05 June 2012 Publication History

Abstract

In this paper, we propose a novel image classification framework based on patch matching. More precisely, we adapt the Hamming Embedding technique, first introduced for image search to improve the bag-of-words representation. This matching technique allows the fast comparison of descriptors based on their binary signatures, which refines the matching rule based on visual words and thereby limits the quantization error. Then, in order to allow the use of efficient and suitable linear kernel-based SVM classification, we propose a mapping method to cast the scores output by the Hamming Embedding matching technique into a proper similarity space. Comparative experiments of our proposed approach and other existing encoding methods on two challenging datasets PASCAL VOC 2007 and Caltech-256, report the interest of the proposed scheme, which outperforms all methods based on patch matching and even provide competitive results compared with the state-of-the-art coding techniques.

References

[1]

R. Behmo, P. Marcombes, A. Dalalyan, and V. Prinet. Towards optimal naive bayes nearest neighbors. In ECCV, September 2010.

Digital Library

[2]

L. Bo and C. Sminchisescu. Efficient match kernels between sets of features for visual recognition. In NIPS, 2009.

[3]

O. Boiman, E. Shechman, and M. Irani. In defense of nearest neighbor based image classification. In CVPR, June 2008.

[4]

B. Caputo and L. Jie. A performance evaluation of exact and approximate match kernels for object recognition. Electronic Letters on Computer Vision and Image Analysis, 8(3):15--26, 2009.

[5]

K. Chatfield, V. Lempitsky, A. Vedaldi, and A. Zisserman. The devil is in the details: an evaluation of recent feature encoding methods. In BMVC, September 2011.

[6]

J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. ImageNet: A Large-Scale Hierarchical Image Database. In CVPR, June 2009.

[7]

O. Duchenne, A. Joulin, and J. Ponce. A graph-matching kernel for object categorization. In ICCV, September 2011.

Digital Library

[8]

M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman. The PASCAL Visual Object Classes Challenge 2007 (VOC2007) Results, 2007.

[9]

G. Griffin, A. Holub, and P. Perona. Caltech-256 object category dataset. Technical Report 7694, California Institute of Technology, 2007.

[10]

H. Harzallah, F. Jurie, and C. Schmid. Combining efficient object localization and image classification. In ICCV, September 2009.

[11]

T. Jaakkola and D. Haussler. Exploiting generative models in discriminative classifiers. In NIPS, 1998.

Digital Library

[12]

M. Jain, H. Jégou, and P. Gros. Asymmetric hamming embedding: taking the best of our bits for large scale image search. In ACM Multimedia, November 2011.

Digital Library

[13]

H. Jégou, M. Douze, and C. Schmid. Hamming embedding and weak geometric consistency for large scale image search. In ECCV, October 2008.

Digital Library

[14]

H. Jégou, M. Douze, and C. Schmid. On the burstiness of visual elements. In CVPR, June 2009.

[15]

H. Jégou, M. Douze, and C. Schmid. Improving bag-of-features for large scale image search. IJCV, 87(3):316--336, February 2010.

Digital Library

[16]

H. Jégou, M. Douze, C. Schmid, and P. Pérez. Aggregating local descriptors into a compact image representation. In CVPR, June 2010.

[17]

J. Kim and K. Grauman. Asymmetric region-to-image matching for comparing images with generic object categories. In CVPR, June 2010.

[18]

S. Lazebnik, C. Schmid, and J. Ponce. Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In CVPR, June 2006.

Digital Library

[19]

D. Lowe. Distinctive image features from scale-invariant keypoints. IJCV, 60(2):91--110, 2004.

Digital Library

[20]

F. Perronnin and C. R. Dance. Fisher kernels on visual vocabularies for image categorization. In CVPR, June 2007.

[21]

F. Perronnin, J. Sánchez, and T. Mensink. Improving the fisher kernel for large-scale image classification. In ECCV, September 2010.

Digital Library

[22]

J. Sivic and A. Zisserman. Video Google: A text retrieval approach to object matching in videos. In ICCV, pages 1470--1477, October 2003.

Digital Library

[23]

T. Tuytelaars, M. Fritz, K. Saenko, and T. Darrel. The NBNN kernel. In ICCV, September 2011.

Digital Library

[24]

J. van Gemert, C. Veenman, A. Smeulders, and J. Geusebroek. Visual word ambiguity. PAMI, 32(7):1271--1283, July 2010.

Digital Library

[25]

J. Wang, J. Yang, K. Yu, F. Lv, T. Huang, and Y. Gong. Locality-constrained linear coding for image classification. In CVPR, June 2010.

[26]

J. Yang, Y. Li, Y. Tian, L. Duan, and W. Gao. Group sensitive multiple kernel learning for object categorization. In ICCV, September 2009.

[27]

J. Yang, K. Yu, Y. Gong, and T. Huang. Linear spatial pyramid matching using sparse coding for image classification. In CVPR, pages 1794--1801, 2009.

[28]

J. Zhang, M. Marszalek, S. Lazebnik, and C. Schmid. Local features and kernels for classification of texture and object categories: A comprehensive study. IJCV, 73:213--238, June 2007.

Digital Library

[29]

X. Zhou, K. Yu, T. Zhang, and T. S. Huang. Image classification using super-vector coding of local image descriptors. In ECCV, September 2010.

Digital Library

Cited By

Yoon HKang SKim S(2023)A non‐verbal teaching behaviour analysis for improving pointing out gestures: The case of asynchronous video lecture analysis using deep learningJournal of Computer Assisted Learning10.1111/jcal.1293340:3(1006-1018)Online publication date: 29-Dec-2023
https://doi.org/10.1111/jcal.12933
Zheng LYang YTian Q(2018)SIFT Meets CNN: A Decade Survey of Instance RetrievalIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2017.270974940:5(1224-1244)Online publication date: 1-May-2018
https://doi.org/10.1109/TPAMI.2017.2709749
Douze MRevaud JVerbeek JJégou HSchmid C(2016)Circulant Temporal Encoding for Video Retrieval and Temporal AlignmentInternational Journal of Computer Vision10.1007/s11263-015-0875-0119:3(291-306)Online publication date: 1-Sep-2016
https://dl.acm.org/doi/10.1007/s11263-015-0875-0
Show More Cited By

Index Terms

Hamming embedding similarity-based image classification
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision representations
        Image representations
2. Information systems
  1. Information retrieval

Recommendations

Region-Based Spatial Sampling for Image Classification
ICIG '13: Proceedings of the 2013 Seventh International Conference on Image and Graphics

Local descriptors with Bag-of-Words representation were widely used for image classification. Especially, local descriptors of dense spatial sampling were demonstrated to be able to further improve performances of image classification. However, denser ...
Read More
A new discriminative coding method for image classification

The bag-of-words (BOW) based methods are widely used in image classification. However, huge number of visual information is omitted inevitably in the quantization step of the BOW. Recently, NBNN and its improved methods like Local NBNN were proposed to ...
Read More
An Image Classification Method Based on Matching Similarity and TF-IDF Value of Region
ISCID '13: Proceedings of the 2013 Sixth International Symposium on Computational Intelligence and Design - Volume 02

Traditional image classification methods are mainly based on the overall color statistics and content semantics of the image itself. However, due to the poor distinctiveness of overall color statistics and content semantics, traditional methods often ...
Read More

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

ICMR '12: Proceedings of the 2nd ACM International Conference on Multimedia Retrieval

June 2012

489 pages

ISBN:9781450313292

DOI:10.1145/2324796

Conference Chairs:
Horace H. S. Ip
City University of Hong Kong
,
Yong Rui
Microsoft, China

Copyright © 2012 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 June 2012

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

ICMR '12

Sponsor:

SIGMM

ICMR '12: International Conference on Multimedia Retrieval

June 5 - 8, 2012

Hong Kong, China

Acceptance Rates

ICMR '12 Paper Acceptance Rate 50 of 145 submissions, 34%;

Overall Acceptance Rate 254 of 830 submissions, 31%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

18
Total Citations
View Citations
314
Total Downloads

Downloads (Last 12 months)20
Downloads (Last 6 weeks)2

Other Metrics

View Author Metrics

Citations

Cited By

Yoon HKang SKim S(2023)A non‐verbal teaching behaviour analysis for improving pointing out gestures: The case of asynchronous video lecture analysis using deep learningJournal of Computer Assisted Learning10.1111/jcal.1293340:3(1006-1018)Online publication date: 29-Dec-2023
https://doi.org/10.1111/jcal.12933
Zheng LYang YTian Q(2018)SIFT Meets CNN: A Decade Survey of Instance RetrievalIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2017.270974940:5(1224-1244)Online publication date: 1-May-2018
https://doi.org/10.1109/TPAMI.2017.2709749
Douze MRevaud JVerbeek JJégou HSchmid C(2016)Circulant Temporal Encoding for Video Retrieval and Temporal AlignmentInternational Journal of Computer Vision10.1007/s11263-015-0875-0119:3(291-306)Online publication date: 1-Sep-2016
https://dl.acm.org/doi/10.1007/s11263-015-0875-0
Tolias GAvrithis YJégou H(2016)Image Search with Selective Match KernelsInternational Journal of Computer Vision10.1007/s11263-015-0810-4116:3(247-261)Online publication date: 1-Feb-2016
https://dl.acm.org/doi/10.1007/s11263-015-0810-4
Krutz DMalachowsky SShihab EWainwright RCorchado JBechini AHong J(2015)Examining the effectiveness of using concolic analysis to detect code clonesProceedings of the 30th Annual ACM Symposium on Applied Computing10.1145/2695664.2695929(1610-1615)Online publication date: 13-Apr-2015
https://dl.acm.org/doi/10.1145/2695664.2695929
Leveau VJoly ABuisson OValduriez PHauptmann ANgo CXue XJiang YSnoek CVasconcelos N(2015)Kernelizing Spatially Consistent Visual Matches for Fine-Grained ClassificationProceedings of the 5th ACM on International Conference on Multimedia Retrieval10.1145/2671188.2749328(155-162)Online publication date: 22-Jun-2015
https://dl.acm.org/doi/10.1145/2671188.2749328
Takahashi TKurita T(2015)Mixture of Subspaces Image Representation and Compact Coding for Large-Scale Image RetrievalIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2014.238209237:7(1469-1479)Online publication date: 1-Jul-2015
https://dl.acm.org/doi/10.1109/TPAMI.2014.2382092
Iscen ATolias GGosselin PJegou H(2015)A Comparison of Dense Region Detectors for Image Search and Fine-Grained ClassificationIEEE Transactions on Image Processing10.1109/TIP.2015.242355724:8(2369-2381)Online publication date: 1-Aug-2015
https://dl.acm.org/doi/10.1109/TIP.2015.2423557
Shi MFuron TJégou HHua KRui YSteinmetz RHanjalic ANatsev AZhu W(2014)A Group Testing Framework for Similarity Search in High-dimensional SpacesProceedings of the 22nd ACM international conference on Multimedia10.1145/2647868.2654895(407-416)Online publication date: 3-Nov-2014
https://dl.acm.org/doi/10.1145/2647868.2654895
Krapac JPerronnin FFuron TJégou HKankanhalli MRueger SManmatha RJose Jvan Rijsbergen K(2014)Instance classification with prototype selectionProceedings of International Conference on Multimedia Retrieval10.1145/2578726.2578786(431-434)Online publication date: 1-Apr-2014
https://dl.acm.org/doi/10.1145/2578726.2578786
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents