research-article

Academic Coupled Dictionary Learning for Sketch-based Image Retrieval

Authors:

Xavier Alameda-Pineda,

Elisa Ricci, and

Nicu SebeAuthors Info & Claims

MM '16: Proceedings of the 24th ACM international conference on Multimedia

October 2016

Pages 1326 - 1335

https://doi.org/10.1145/2964284.2964329

Published: 01 October 2016 Publication History

Abstract

In the last few years, the query-by-visual-example paradigm gained popularity, specially for content based retrieval systems. As sketches represent a natural way of expressing a synthetic query, recent research efforts focused on developing algorithmic solutions to address the sketch-based image retrieval (SBIR) problem. Within this context, we propose a novel approach for SBIR that, unlike previous methods, is able to exploit the visual complexity inherently present in sketches and images. We introduce academic learning, a paradigm in which the sample learning order is constructed both from the data, as in self-paced learning, and from partial curricula. We propose an instantiation of this paradigm within the framework of coupled dictionary learning to address the SBIR task. We also present an efficient algorithm to learn the dictionaries and the codes, and to pace the learning combining the reconstruction error, the prior knowledge suggested by the partial curricula and the cross-domain code coherence. In order to evaluate the proposed approach, we report an extensive experimental validation showing that the proposed method outperforms the state-of-the-art in coupled dictionary learning and in SBIR on three different publicly available datasets.

References

[1]

B. Alexe, T. Deselaers, and V. Ferrari. Measuring the objectness of image windows. IEEE Trans. on PAMI, 34(11):2189--2202, 2012.

Digital Library

[2]

A. Beck and M. Teboulle. A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J. Imag. Sciences, 2(1):183--202, 2009.

Digital Library

[3]

Y. Bengio, J. Louradour, R. Collobert, and J. Weston. Curriculum learning. In ICML, 2009.

Digital Library

[4]

A. D. Bimbo and P. Pala. Visual image retrieval by elastic matching of user sketches. IEEE Trans. on PAMI, 19(2):121--132, 1997.

Digital Library

[5]

Y. Cao, H. Wang, C. Wang, Z. Li, L. Zhang, and L. Zhang. Mindfinder: interactive sketch-based image search on millions of images. In ACM Multimedia, 2010.

Digital Library

[6]

A. Chalechale, G. Naghdy, and A. Mertins. Sketch-based image matching using angular partitioning. IEEE Trans. on Systems, Man, and Cybernetics, 35(1):28--41, 2005.

Digital Library

[7]

M. Eitz, J. Hays, and M. Alexa. How do humans sketch objects? ACM Trans. on Graphics, 31(4):44, 2012.

Digital Library

[8]

M. Eitz, K. Hildebrand, T. Boubekeur, and M. Alexa. An evaluation of descriptors for large-scale image retrieval from sketched feature lines. Computers & Graphics, 34(5):482--498, 2010.

Digital Library

[9]

M. Eitz, K. Hildebrand, T. Boubekeur, and M. Alexa. Sketch-based image retrieval: Benchmark and bag-of-features descriptors!. IEEE Trans. on Visualization and Computer Graphics, 17(11):1624--1636, 2011.

Digital Library

[10]

M. Everingham, L. Van Gool, C. K. Williams, J. Winn, and A. Zisserman. The pascal visual object classes (VOC) challenge. IJCV, 88(2):303--338, 2010.

Digital Library

[11]

J. Guo, C. Wang, and H. Chao. Building effective representations for sketch recognition. In AAAI, 2015.

Digital Library

[12]

R. Hu and J. Collomosse. A performance evaluation of gradient field hog descriptor for sketch based image retrieval. CVIU, 117(7):790--806, 2013.

Digital Library

[13]

D.-A. Huang and Y.-C. F. Wang. Coupled dictionary and feature space learning with applications to cross-domain image synthesis and recognition. In ICCV, 2013.

Digital Library

[14]

Y. Jia. Caffe: An open source convolutional architecture for fast feature embedding, 2013.

[15]

L. Jiang, D. Meng, T. Mitamura, and A. G. Hauptmann. Easy samples first: Self-paced reranking for zero-example multimedia search. In ACM Multimedia, 2014.

Digital Library

[16]

L. Jiang, D. Meng, S.-I. Yu, Z. Lan, S. Shan, and A. Hauptmann!. Self-paced learning with diversity!. In NIPS, 2014.

Digital Library

[17]

L. Jiang, D. Meng, Q. Zhao, S. Shan, and A. G. Hauptmann. Self-paced curriculum learning. In AAAI, 2015.

Digital Library

[18]

M. P. Kumar, B. Packer, and D. Koller. Self-paced learning for latent variable models. In NIPS, 2010.

Digital Library

[19]

H. Lee, A. Battle, R. Raina, and A. Y. Ng. Efficient sparse coding algorithms. In NIPS, 2006.

Digital Library

[20]

Y. J. Lee and K. Grauman. Learning the easy things first: Self-paced visual category discovery. In CVPR, 2011.

Digital Library

[21]

Y. Li, T. M. Hospedales, Y.-Z. Song, and S. Gong. Fine-grained sketch-based image retrieval by matching deformable part models. In BMVC, 2014.

[22]

Y.-L. Lin, C.-Y. Huang, H.-J. Wang, and W.-C. Hsu. 3d sub-query expansion for improving sketch-based multi-view image retrieval. In ICCV, 2013.

Digital Library

[23]

J. Mairal, F. Bach, J. Ponce, G. Sapiro, and A. Zisserman. Non-local sparse models for image restoration. In ICCV, 2009.

[24]

Y. Matsui. Challenge for manga processing: Sketch-based manga retrieval. In ACM Multimedia, 2015.

Digital Library

[25]

A. Pentina, V. Sharmanska, and C. H. Lampert. Curriculum learning of multiple tasks. CVPR, 2015.

[26]

Y. Qi, Y.-Z. Song, T. Xiang, H. Zhang, T. Hospedales, Y. Li, and J. Guo. Making better use of edges via perceptual grouping. In CVPR, 2015.

[27]

J. M. Saavedra, J. M. Barrios, and S. Orand. Sketch based image retrieval using learned keyshapes (LKS). In BMVC, 2015.

[28]

J. M. Saavedra and B. Bustos. An improved histogram of edge local orientations for sketch-based image retrieval. In Pattern Recognition, pages 432--441. Springer, 2010.

Digital Library

[29]

J. M. Saavedra and B. Bustos. Sketch-based image retrieval using keyshapes. Multimedia Tools and Applications, 73(3):2033--2062, 2014.

Digital Library

[30]

A. Sharma and D. W. Jacobs. Bypassing synthesis: Pls for face recognition with pose, low-resolution and sketch. In CVPR, 2011.

Digital Library

[31]

X. Sun, C. Wang, C. Xu, and L. Zhang. Indexing billions of images for sketch-based retrieval. In ACM Multimedia, 2013.

Digital Library

[32]

J. S. Supancic and D. Ramanan. Self-paced learning for long-term tracking. In CVPR, 2013.

Digital Library

[33]

X. Tang and X. Wang. Face sketch recognition. IEEE Trans. Cir. Sys. Video Tech., 14(1):50--57, 2004.

Digital Library

[34]

Y. Tang, Y.-B. Yang, and Y. Gao. Self-paced dictionary learning for image classification. In ACM Multimedia, 2012.

Digital Library

[35]

J. B. Tenenbaum and W. T. Freeman. Separating style and content with bilinear models. Neural Computation, 12(6):1247--1283, 2000.

Digital Library

[36]

S. Wang, L. Zhang, Y. Liang, and Q. Pan. Semi-coupled dictionary learning with applications to image super-resolution and photo-sketch synthesis. In CVPR, 2012.

[37]

X. Wang and X. Tang. Face photo-sketch synthesis and recognition. IEEE Trans. on PAMI, 31(11):1955--1967, 2009.

Digital Library

[38]

C. Xiao, C. Wang, L. Zhang, and L. Zhang. Sketch-based image retrieval via shape words. In ACM ICMR, 2015.

Digital Library

[39]

C. Xu, D. Tao, and C. Xu. Multi-view self-paced learning for clustering. In IJCAI, 2015.

Digital Library

[40]

D. Xu, E. Ricci, Y. Yan, J. Song, and N. Sebe. Learning deep representations of appearance and motion for anomalous event detection. In BMVC, 2015.

[41]

Y. Yan, Y. Yang, H. Shen, D. Meng, G. Liu, A. Hauptmann, and N. Sebe. Complex event detection via event oriented dictionary learning. In AAAI, 2015.

Digital Library

[42]

J. Yang, J. Wright, T. S. Huang, and Y. Ma. Image super-resolution via sparse representation. IEEE Trans. on Image Processing, 19(11):2861--2873, 2010.

Digital Library

[43]

S. M. Yoon, G.-J. Yoon, and T. Schreck. User-drawn sketch-based 3d object retrievalusing sparse coding. Multimedia Tools App., 74(13):4707--4722, 2015.

Digital Library

[44]

X. Zhai, Y. Peng, and J. Xiao. Heterogeneous metric learning with joint graph regularization for cross-media retrieval. In AAAI, 2013.

Digital Library

[45]

Q. Zhao, D. Meng, L. Jiang, Q. Xie, Z. Xu, and A. G. Hauptmann. Self-paced learning for matrix factorization. In AAAI, 2015.

Digital Library

Cited By

Yang FIsmail NPang YKebande VAl-Dhaqm AKoh T(2024)A Systematic Literature Review of Deep Learning Approaches for Sketch-Based Image Retrieval: Datasets, Metrics, and Future DirectionsIEEE Access10.1109/ACCESS.2024.335793912(14847-14869)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3357939
Bhattacharjee SYuan J(2020)Semantic Enhanced Sketch Based Image Retrieval with Incomplete Multimodal Query2020 IEEE Sixth International Conference on Multimedia Big Data (BigMM)10.1109/BigMM50055.2020.00022(86-93)Online publication date: Sep-2020
https://doi.org/10.1109/BigMM50055.2020.00022
Sun PZhu SJu XGuo W(2019)Image Classification via Hierarchical Dictionary Learning2019 Chinese Control And Decision Conference (CCDC)10.1109/CCDC.2019.8832839(4630-4634)Online publication date: Jun-2019
https://doi.org/10.1109/CCDC.2019.8832839
Show More Cited By

Index Terms

Academic Coupled Dictionary Learning for Sketch-based Image Retrieval
1. Computing methodologies
  1. Machine learning
2. Information systems
  1. Information retrieval
    1. Retrieval models and ranking
    2. Specialized information retrieval
      1. Multimedia and multimodal retrieval
        Image search

Recommendations

Quadruplet Networks for Sketch-Based Image Retrieval
ICMR '17: Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval

Freehand sketches are a simple and powerful tool for communication. They are easily recognized across cultures and suitable for various applications. In this paper, we use deep convolutional neural networks (ConvNets) to address sketch-based image ...
Read More
Sketch-based image retrieval using keyshapes

Although sketch based image retrieval (SBIR) is still a young research area, there are many applications capable of exploiting this retrieval paradigm, such as web searching and pattern detection. Moreover, nowadays drawing a simple sketch query turns ...
Read More
Sketch-Based Image Retrieval via Compact Binary Codes Learning
Neural Information Processing
Abstract
With the exploding number of images on the Internet and the convenience of free-hand sketch drawing, sketch-based image retrieval (SBIR) has attracted much attention in recent years. Due to the ambiguity and sparsity of sketches, SBIR is more ...
Read More

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '16: Proceedings of the 24th ACM international conference on Multimedia

October 2016

1542 pages

ISBN:9781450336031

DOI:10.1145/2964284

General Chairs:
Alan Hanjalic
Delft University of Technology
,
Cees Snoek
Qualcomm Research Netherlands / University of Amsterdam
,
Marcel Worring
University of Amsterdam
,
Moderator:
Dick Bulterman
CWI / VU University Amsterdam
,
Program Chairs:
Benoit Huet
EURECOM
,
Aisling Kelliher
Virginia Tech
,
Yiannis Kompatsiaris
CERTH-ITI
,
Jin Li
Microsoft

Copyright © 2016 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 October 2016

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

MM '16

Sponsor:

SIGMM

MM '16: ACM Multimedia Conference

October 15 - 19, 2016

Amsterdam, The Netherlands

Acceptance Rates

MM '16 Paper Acceptance Rate 52 of 237 submissions, 22%;

Overall Acceptance Rate 995 of 4,171 submissions, 24%

Upcoming Conference

MM '24

Sponsor:
sigmm

The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

11
Total Citations
View Citations
295
Total Downloads

Downloads (Last 12 months)4
Downloads (Last 6 weeks)2

Other Metrics

View Author Metrics

Citations

Cited By

Yang FIsmail NPang YKebande VAl-Dhaqm AKoh T(2024)A Systematic Literature Review of Deep Learning Approaches for Sketch-Based Image Retrieval: Datasets, Metrics, and Future DirectionsIEEE Access10.1109/ACCESS.2024.335793912(14847-14869)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3357939
Bhattacharjee SYuan J(2020)Semantic Enhanced Sketch Based Image Retrieval with Incomplete Multimodal Query2020 IEEE Sixth International Conference on Multimedia Big Data (BigMM)10.1109/BigMM50055.2020.00022(86-93)Online publication date: Sep-2020
https://doi.org/10.1109/BigMM50055.2020.00022
Sun PZhu SJu XGuo W(2019)Image Classification via Hierarchical Dictionary Learning2019 Chinese Control And Decision Conference (CCDC)10.1109/CCDC.2019.8832839(4630-4634)Online publication date: Jun-2019
https://doi.org/10.1109/CCDC.2019.8832839
Benuwa BZhan YLiu JGou JGhansah BAnsah E(2019)Group sparse based locality --- sensitive dictionary learning for video semantic analysisMultimedia Tools and Applications10.1007/s11042-018-6417-378:6(6721-6744)Online publication date: 1-Mar-2019
https://dl.acm.org/doi/10.1007/s11042-018-6417-3
Sha LLucey PYue YWei XHobbs JRohlf CSridharan S(2018)Interactive Sports AnalyticsACM Transactions on Computer-Human Interaction10.1145/318559625:2(1-32)Online publication date: 11-Apr-2018
https://dl.acm.org/doi/10.1145/3185596
Xu DAlameda-Pineda XSong JRicci ESebe N(2018)Cross-Paced Representation Learning With Partial Curricula for Sketch-Based Image RetrievalIEEE Transactions on Image Processing10.1109/TIP.2018.283738127:9(4410-4421)Online publication date: Sep-2018
https://doi.org/10.1109/TIP.2018.2837381
Lu BZhu SJu XChen L(2018)Adaptive codebook modeling based multiple objects detection2018 Chinese Control And Decision Conference (CCDC)10.1109/CCDC.2018.8407540(2471-2475)Online publication date: Jun-2018
https://doi.org/10.1109/CCDC.2018.8407540
Lee SKim HChoi MMoon Y(2018)A time-series matching approach for symmetric-invariant boundary image matchingMultimedia Tools and Applications10.1007/s11042-017-5323-477:16(20979-21001)Online publication date: 1-Aug-2018
https://dl.acm.org/doi/10.1007/s11042-017-5323-4
Tao S(2018)3D CAD model retrieval based on the softassign quadratic assignment algorithmMultimedia Tools and Applications10.1007/s11042-017-5197-577:13(16249-16265)Online publication date: 1-Jul-2018
https://dl.acm.org/doi/10.1007/s11042-017-5197-5
Lei JZheng KZhang HCao XLing NHou Y(2017)Sketch based image retrieval via image-aided cross domain learning2017 IEEE International Conference on Image Processing (ICIP)10.1109/ICIP.2017.8296970(3685-3689)Online publication date: Sep-2017
https://doi.org/10.1109/ICIP.2017.8296970
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents