Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2964284.2964329acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Academic Coupled Dictionary Learning for Sketch-based Image Retrieval

Published: 01 October 2016 Publication History
  • Get Citation Alerts
  • Abstract

    In the last few years, the query-by-visual-example paradigm gained popularity, specially for content based retrieval systems. As sketches represent a natural way of expressing a synthetic query, recent research efforts focused on developing algorithmic solutions to address the sketch-based image retrieval (SBIR) problem. Within this context, we propose a novel approach for SBIR that, unlike previous methods, is able to exploit the visual complexity inherently present in sketches and images. We introduce academic learning, a paradigm in which the sample learning order is constructed both from the data, as in self-paced learning, and from partial curricula. We propose an instantiation of this paradigm within the framework of coupled dictionary learning to address the SBIR task. We also present an efficient algorithm to learn the dictionaries and the codes, and to pace the learning combining the reconstruction error, the prior knowledge suggested by the partial curricula and the cross-domain code coherence. In order to evaluate the proposed approach, we report an extensive experimental validation showing that the proposed method outperforms the state-of-the-art in coupled dictionary learning and in SBIR on three different publicly available datasets.

    References

    [1]
    B. Alexe, T. Deselaers, and V. Ferrari. Measuring the objectness of image windows. IEEE Trans. on PAMI, 34(11):2189--2202, 2012.
    [2]
    A. Beck and M. Teboulle. A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J. Imag. Sciences, 2(1):183--202, 2009.
    [3]
    Y. Bengio, J. Louradour, R. Collobert, and J. Weston. Curriculum learning. In ICML, 2009.
    [4]
    A. D. Bimbo and P. Pala. Visual image retrieval by elastic matching of user sketches. IEEE Trans. on PAMI, 19(2):121--132, 1997.
    [5]
    Y. Cao, H. Wang, C. Wang, Z. Li, L. Zhang, and L. Zhang. Mindfinder: interactive sketch-based image search on millions of images. In ACM Multimedia, 2010.
    [6]
    A. Chalechale, G. Naghdy, and A. Mertins. Sketch-based image matching using angular partitioning. IEEE Trans. on Systems, Man, and Cybernetics, 35(1):28--41, 2005.
    [7]
    M. Eitz, J. Hays, and M. Alexa. How do humans sketch objects? ACM Trans. on Graphics, 31(4):44, 2012.
    [8]
    M. Eitz, K. Hildebrand, T. Boubekeur, and M. Alexa. An evaluation of descriptors for large-scale image retrieval from sketched feature lines. Computers & Graphics, 34(5):482--498, 2010.
    [9]
    M. Eitz, K. Hildebrand, T. Boubekeur, and M. Alexa. Sketch-based image retrieval: Benchmark and bag-of-features descriptors!. IEEE Trans. on Visualization and Computer Graphics, 17(11):1624--1636, 2011.
    [10]
    M. Everingham, L. Van Gool, C. K. Williams, J. Winn, and A. Zisserman. The pascal visual object classes (VOC) challenge. IJCV, 88(2):303--338, 2010.
    [11]
    J. Guo, C. Wang, and H. Chao. Building effective representations for sketch recognition. In AAAI, 2015.
    [12]
    R. Hu and J. Collomosse. A performance evaluation of gradient field hog descriptor for sketch based image retrieval. CVIU, 117(7):790--806, 2013.
    [13]
    D.-A. Huang and Y.-C. F. Wang. Coupled dictionary and feature space learning with applications to cross-domain image synthesis and recognition. In ICCV, 2013.
    [14]
    Y. Jia. Caffe: An open source convolutional architecture for fast feature embedding, 2013.
    [15]
    L. Jiang, D. Meng, T. Mitamura, and A. G. Hauptmann. Easy samples first: Self-paced reranking for zero-example multimedia search. In ACM Multimedia, 2014.
    [16]
    L. Jiang, D. Meng, S.-I. Yu, Z. Lan, S. Shan, and A. Hauptmann!. Self-paced learning with diversity!. In NIPS, 2014.
    [17]
    L. Jiang, D. Meng, Q. Zhao, S. Shan, and A. G. Hauptmann. Self-paced curriculum learning. In AAAI, 2015.
    [18]
    M. P. Kumar, B. Packer, and D. Koller. Self-paced learning for latent variable models. In NIPS, 2010.
    [19]
    H. Lee, A. Battle, R. Raina, and A. Y. Ng. Efficient sparse coding algorithms. In NIPS, 2006.
    [20]
    Y. J. Lee and K. Grauman. Learning the easy things first: Self-paced visual category discovery. In CVPR, 2011.
    [21]
    Y. Li, T. M. Hospedales, Y.-Z. Song, and S. Gong. Fine-grained sketch-based image retrieval by matching deformable part models. In BMVC, 2014.
    [22]
    Y.-L. Lin, C.-Y. Huang, H.-J. Wang, and W.-C. Hsu. 3d sub-query expansion for improving sketch-based multi-view image retrieval. In ICCV, 2013.
    [23]
    J. Mairal, F. Bach, J. Ponce, G. Sapiro, and A. Zisserman. Non-local sparse models for image restoration. In ICCV, 2009.
    [24]
    Y. Matsui. Challenge for manga processing: Sketch-based manga retrieval. In ACM Multimedia, 2015.
    [25]
    A. Pentina, V. Sharmanska, and C. H. Lampert. Curriculum learning of multiple tasks. CVPR, 2015.
    [26]
    Y. Qi, Y.-Z. Song, T. Xiang, H. Zhang, T. Hospedales, Y. Li, and J. Guo. Making better use of edges via perceptual grouping. In CVPR, 2015.
    [27]
    J. M. Saavedra, J. M. Barrios, and S. Orand. Sketch based image retrieval using learned keyshapes (LKS). In BMVC, 2015.
    [28]
    J. M. Saavedra and B. Bustos. An improved histogram of edge local orientations for sketch-based image retrieval. In Pattern Recognition, pages 432--441. Springer, 2010.
    [29]
    J. M. Saavedra and B. Bustos. Sketch-based image retrieval using keyshapes. Multimedia Tools and Applications, 73(3):2033--2062, 2014.
    [30]
    A. Sharma and D. W. Jacobs. Bypassing synthesis: Pls for face recognition with pose, low-resolution and sketch. In CVPR, 2011.
    [31]
    X. Sun, C. Wang, C. Xu, and L. Zhang. Indexing billions of images for sketch-based retrieval. In ACM Multimedia, 2013.
    [32]
    J. S. Supancic and D. Ramanan. Self-paced learning for long-term tracking. In CVPR, 2013.
    [33]
    X. Tang and X. Wang. Face sketch recognition. IEEE Trans. Cir. Sys. Video Tech., 14(1):50--57, 2004.
    [34]
    Y. Tang, Y.-B. Yang, and Y. Gao. Self-paced dictionary learning for image classification. In ACM Multimedia, 2012.
    [35]
    J. B. Tenenbaum and W. T. Freeman. Separating style and content with bilinear models. Neural Computation, 12(6):1247--1283, 2000.
    [36]
    S. Wang, L. Zhang, Y. Liang, and Q. Pan. Semi-coupled dictionary learning with applications to image super-resolution and photo-sketch synthesis. In CVPR, 2012.
    [37]
    X. Wang and X. Tang. Face photo-sketch synthesis and recognition. IEEE Trans. on PAMI, 31(11):1955--1967, 2009.
    [38]
    C. Xiao, C. Wang, L. Zhang, and L. Zhang. Sketch-based image retrieval via shape words. In ACM ICMR, 2015.
    [39]
    C. Xu, D. Tao, and C. Xu. Multi-view self-paced learning for clustering. In IJCAI, 2015.
    [40]
    D. Xu, E. Ricci, Y. Yan, J. Song, and N. Sebe. Learning deep representations of appearance and motion for anomalous event detection. In BMVC, 2015.
    [41]
    Y. Yan, Y. Yang, H. Shen, D. Meng, G. Liu, A. Hauptmann, and N. Sebe. Complex event detection via event oriented dictionary learning. In AAAI, 2015.
    [42]
    J. Yang, J. Wright, T. S. Huang, and Y. Ma. Image super-resolution via sparse representation. IEEE Trans. on Image Processing, 19(11):2861--2873, 2010.
    [43]
    S. M. Yoon, G.-J. Yoon, and T. Schreck. User-drawn sketch-based 3d object retrievalusing sparse coding. Multimedia Tools App., 74(13):4707--4722, 2015.
    [44]
    X. Zhai, Y. Peng, and J. Xiao. Heterogeneous metric learning with joint graph regularization for cross-media retrieval. In AAAI, 2013.
    [45]
    Q. Zhao, D. Meng, L. Jiang, Q. Xie, Z. Xu, and A. G. Hauptmann. Self-paced learning for matrix factorization. In AAAI, 2015.

    Cited By

    View all
    • (2024)A Systematic Literature Review of Deep Learning Approaches for Sketch-Based Image Retrieval: Datasets, Metrics, and Future DirectionsIEEE Access10.1109/ACCESS.2024.335793912(14847-14869)Online publication date: 2024
    • (2020)Semantic Enhanced Sketch Based Image Retrieval with Incomplete Multimodal Query2020 IEEE Sixth International Conference on Multimedia Big Data (BigMM)10.1109/BigMM50055.2020.00022(86-93)Online publication date: Sep-2020
    • (2019)Image Classification via Hierarchical Dictionary Learning2019 Chinese Control And Decision Conference (CCDC)10.1109/CCDC.2019.8832839(4630-4634)Online publication date: Jun-2019
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '16: Proceedings of the 24th ACM international conference on Multimedia
    October 2016
    1542 pages
    ISBN:9781450336031
    DOI:10.1145/2964284
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 October 2016

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. dictionary learning
    2. self-paced and curriculum learning
    3. sketch-based image retrieval

    Qualifiers

    • Research-article

    Conference

    MM '16
    Sponsor:
    MM '16: ACM Multimedia Conference
    October 15 - 19, 2016
    Amsterdam, The Netherlands

    Acceptance Rates

    MM '16 Paper Acceptance Rate 52 of 237 submissions, 22%;
    Overall Acceptance Rate 995 of 4,171 submissions, 24%

    Upcoming Conference

    MM '24
    The 32nd ACM International Conference on Multimedia
    October 28 - November 1, 2024
    Melbourne , VIC , Australia

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)2

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)A Systematic Literature Review of Deep Learning Approaches for Sketch-Based Image Retrieval: Datasets, Metrics, and Future DirectionsIEEE Access10.1109/ACCESS.2024.335793912(14847-14869)Online publication date: 2024
    • (2020)Semantic Enhanced Sketch Based Image Retrieval with Incomplete Multimodal Query2020 IEEE Sixth International Conference on Multimedia Big Data (BigMM)10.1109/BigMM50055.2020.00022(86-93)Online publication date: Sep-2020
    • (2019)Image Classification via Hierarchical Dictionary Learning2019 Chinese Control And Decision Conference (CCDC)10.1109/CCDC.2019.8832839(4630-4634)Online publication date: Jun-2019
    • (2019)Group sparse based locality --- sensitive dictionary learning for video semantic analysisMultimedia Tools and Applications10.1007/s11042-018-6417-378:6(6721-6744)Online publication date: 1-Mar-2019
    • (2018)Interactive Sports AnalyticsACM Transactions on Computer-Human Interaction10.1145/318559625:2(1-32)Online publication date: 11-Apr-2018
    • (2018)Cross-Paced Representation Learning With Partial Curricula for Sketch-Based Image RetrievalIEEE Transactions on Image Processing10.1109/TIP.2018.283738127:9(4410-4421)Online publication date: Sep-2018
    • (2018)Adaptive codebook modeling based multiple objects detection2018 Chinese Control And Decision Conference (CCDC)10.1109/CCDC.2018.8407540(2471-2475)Online publication date: Jun-2018
    • (2018)A time-series matching approach for symmetric-invariant boundary image matchingMultimedia Tools and Applications10.1007/s11042-017-5323-477:16(20979-21001)Online publication date: 1-Aug-2018
    • (2018)3D CAD model retrieval based on the softassign quadratic assignment algorithmMultimedia Tools and Applications10.1007/s11042-017-5197-577:13(16249-16265)Online publication date: 1-Jul-2018
    • (2017)Sketch based image retrieval via image-aided cross domain learning2017 IEEE International Conference on Image Processing (ICIP)10.1109/ICIP.2017.8296970(3685-3689)Online publication date: Sep-2017
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media