Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2324796.2324820acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article

Hamming embedding similarity-based image classification

Published: 05 June 2012 Publication History
  • Get Citation Alerts
  • Abstract

    In this paper, we propose a novel image classification framework based on patch matching. More precisely, we adapt the Hamming Embedding technique, first introduced for image search to improve the bag-of-words representation. This matching technique allows the fast comparison of descriptors based on their binary signatures, which refines the matching rule based on visual words and thereby limits the quantization error. Then, in order to allow the use of efficient and suitable linear kernel-based SVM classification, we propose a mapping method to cast the scores output by the Hamming Embedding matching technique into a proper similarity space. Comparative experiments of our proposed approach and other existing encoding methods on two challenging datasets PASCAL VOC 2007 and Caltech-256, report the interest of the proposed scheme, which outperforms all methods based on patch matching and even provide competitive results compared with the state-of-the-art coding techniques.

    References

    [1]
    R. Behmo, P. Marcombes, A. Dalalyan, and V. Prinet. Towards optimal naive bayes nearest neighbors. In ECCV, September 2010.
    [2]
    L. Bo and C. Sminchisescu. Efficient match kernels between sets of features for visual recognition. In NIPS, 2009.
    [3]
    O. Boiman, E. Shechman, and M. Irani. In defense of nearest neighbor based image classification. In CVPR, June 2008.
    [4]
    B. Caputo and L. Jie. A performance evaluation of exact and approximate match kernels for object recognition. Electronic Letters on Computer Vision and Image Analysis, 8(3):15--26, 2009.
    [5]
    K. Chatfield, V. Lempitsky, A. Vedaldi, and A. Zisserman. The devil is in the details: an evaluation of recent feature encoding methods. In BMVC, September 2011.
    [6]
    J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. ImageNet: A Large-Scale Hierarchical Image Database. In CVPR, June 2009.
    [7]
    O. Duchenne, A. Joulin, and J. Ponce. A graph-matching kernel for object categorization. In ICCV, September 2011.
    [8]
    M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman. The PASCAL Visual Object Classes Challenge 2007 (VOC2007) Results, 2007.
    [9]
    G. Griffin, A. Holub, and P. Perona. Caltech-256 object category dataset. Technical Report 7694, California Institute of Technology, 2007.
    [10]
    H. Harzallah, F. Jurie, and C. Schmid. Combining efficient object localization and image classification. In ICCV, September 2009.
    [11]
    T. Jaakkola and D. Haussler. Exploiting generative models in discriminative classifiers. In NIPS, 1998.
    [12]
    M. Jain, H. Jégou, and P. Gros. Asymmetric hamming embedding: taking the best of our bits for large scale image search. In ACM Multimedia, November 2011.
    [13]
    H. Jégou, M. Douze, and C. Schmid. Hamming embedding and weak geometric consistency for large scale image search. In ECCV, October 2008.
    [14]
    H. Jégou, M. Douze, and C. Schmid. On the burstiness of visual elements. In CVPR, June 2009.
    [15]
    H. Jégou, M. Douze, and C. Schmid. Improving bag-of-features for large scale image search. IJCV, 87(3):316--336, February 2010.
    [16]
    H. Jégou, M. Douze, C. Schmid, and P. Pérez. Aggregating local descriptors into a compact image representation. In CVPR, June 2010.
    [17]
    J. Kim and K. Grauman. Asymmetric region-to-image matching for comparing images with generic object categories. In CVPR, June 2010.
    [18]
    S. Lazebnik, C. Schmid, and J. Ponce. Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In CVPR, June 2006.
    [19]
    D. Lowe. Distinctive image features from scale-invariant keypoints. IJCV, 60(2):91--110, 2004.
    [20]
    F. Perronnin and C. R. Dance. Fisher kernels on visual vocabularies for image categorization. In CVPR, June 2007.
    [21]
    F. Perronnin, J. Sánchez, and T. Mensink. Improving the fisher kernel for large-scale image classification. In ECCV, September 2010.
    [22]
    J. Sivic and A. Zisserman. Video Google: A text retrieval approach to object matching in videos. In ICCV, pages 1470--1477, October 2003.
    [23]
    T. Tuytelaars, M. Fritz, K. Saenko, and T. Darrel. The NBNN kernel. In ICCV, September 2011.
    [24]
    J. van Gemert, C. Veenman, A. Smeulders, and J. Geusebroek. Visual word ambiguity. PAMI, 32(7):1271--1283, July 2010.
    [25]
    J. Wang, J. Yang, K. Yu, F. Lv, T. Huang, and Y. Gong. Locality-constrained linear coding for image classification. In CVPR, June 2010.
    [26]
    J. Yang, Y. Li, Y. Tian, L. Duan, and W. Gao. Group sensitive multiple kernel learning for object categorization. In ICCV, September 2009.
    [27]
    J. Yang, K. Yu, Y. Gong, and T. Huang. Linear spatial pyramid matching using sparse coding for image classification. In CVPR, pages 1794--1801, 2009.
    [28]
    J. Zhang, M. Marszalek, S. Lazebnik, and C. Schmid. Local features and kernels for classification of texture and object categories: A comprehensive study. IJCV, 73:213--238, June 2007.
    [29]
    X. Zhou, K. Yu, T. Zhang, and T. S. Huang. Image classification using super-vector coding of local image descriptors. In ECCV, September 2010.

    Cited By

    View all
    • (2023)A non‐verbal teaching behaviour analysis for improving pointing out gestures: The case of asynchronous video lecture analysis using deep learningJournal of Computer Assisted Learning10.1111/jcal.1293340:3(1006-1018)Online publication date: 29-Dec-2023
    • (2018)SIFT Meets CNN: A Decade Survey of Instance RetrievalIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2017.270974940:5(1224-1244)Online publication date: 1-May-2018
    • (2016)Circulant Temporal Encoding for Video Retrieval and Temporal AlignmentInternational Journal of Computer Vision10.1007/s11263-015-0875-0119:3(291-306)Online publication date: 1-Sep-2016
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ICMR '12: Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
    June 2012
    489 pages
    ISBN:9781450313292
    DOI:10.1145/2324796
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 05 June 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Hamming embedding
    2. evaluation
    3. image classification
    4. similarity-based learning

    Qualifiers

    • Research-article

    Conference

    ICMR '12
    Sponsor:

    Acceptance Rates

    ICMR '12 Paper Acceptance Rate 50 of 145 submissions, 34%;
    Overall Acceptance Rate 254 of 830 submissions, 31%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)20
    • Downloads (Last 6 weeks)2

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)A non‐verbal teaching behaviour analysis for improving pointing out gestures: The case of asynchronous video lecture analysis using deep learningJournal of Computer Assisted Learning10.1111/jcal.1293340:3(1006-1018)Online publication date: 29-Dec-2023
    • (2018)SIFT Meets CNN: A Decade Survey of Instance RetrievalIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2017.270974940:5(1224-1244)Online publication date: 1-May-2018
    • (2016)Circulant Temporal Encoding for Video Retrieval and Temporal AlignmentInternational Journal of Computer Vision10.1007/s11263-015-0875-0119:3(291-306)Online publication date: 1-Sep-2016
    • (2016)Image Search with Selective Match KernelsInternational Journal of Computer Vision10.1007/s11263-015-0810-4116:3(247-261)Online publication date: 1-Feb-2016
    • (2015)Examining the effectiveness of using concolic analysis to detect code clonesProceedings of the 30th Annual ACM Symposium on Applied Computing10.1145/2695664.2695929(1610-1615)Online publication date: 13-Apr-2015
    • (2015)Kernelizing Spatially Consistent Visual Matches for Fine-Grained ClassificationProceedings of the 5th ACM on International Conference on Multimedia Retrieval10.1145/2671188.2749328(155-162)Online publication date: 22-Jun-2015
    • (2015)Mixture of Subspaces Image Representation and Compact Coding for Large-Scale Image RetrievalIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2014.238209237:7(1469-1479)Online publication date: 1-Jul-2015
    • (2015)A Comparison of Dense Region Detectors for Image Search and Fine-Grained ClassificationIEEE Transactions on Image Processing10.1109/TIP.2015.242355724:8(2369-2381)Online publication date: 1-Aug-2015
    • (2014)A Group Testing Framework for Similarity Search in High-dimensional SpacesProceedings of the 22nd ACM international conference on Multimedia10.1145/2647868.2654895(407-416)Online publication date: 3-Nov-2014
    • (2014)Instance classification with prototype selectionProceedings of International Conference on Multimedia Retrieval10.1145/2578726.2578786(431-434)Online publication date: 1-Apr-2014
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media