Article

Efficient clothing retrieval with semantic-preserving visual phrases

Authors:

Hanqing LuAuthors Info & Claims

ACCV'12: Proceedings of the 11th Asian conference on Computer Vision - Volume Part II

Pages 420 - 431

https://doi.org/10.1007/978-3-642-37444-9_33

Published: 05 November 2012 Publication History

Abstract

In this paper, we address the problem of large scale cross-scenario clothing retrieval with semantic-preserving visual phrases (SPVP). Since the human parts are important cues for clothing detection and segmentation, we firstly detect human parts as the semantic context, and refine the regions of human parts with sparse background reconstruction. Then, the semantic parts are encoded into the vocabulary tree under the bag-of-visual-word (BOW) framework, and the contextual constraint of visual words among different human parts is exploited through the SPVP. Moreover, the SPVP is integrated into the inverted index structure for accelerating the retrieval process. Experiments and comparisons on our clothing dataset indicate that the SPVP significantly enhances the discriminative power of local features with a slight increase of memory usage or runtime consumption compared to the BOW model. Therefore, the approach is superior to both the state-of-the-art approach and two clothing search engines.

References

[1]

Sivic, J., Zisserman, A.: Video google: a text retrieval approach to object matching in videos. In: ICCV (2003)

Digital Library

[2]

Chen, H., Xu, Z., Liu, Z., Zhu, S.: Composite templates for cloth modeling and sketching. In: CVPR (2006)

Digital Library

[3]

Hasan, B., Hogg, D.: Segmentation using deformable spatial priors with application to clothing. In: BMVC (2010)

[4]

Wang, N., Ai, H.: Who blocks who: Simultaneous clothing segmentation for grouping images. In: ICCV (2011)

Digital Library

[5]

Yang, M., Yu, K.: Real-time clothing recognition in surveillance videos. In: ICIP (2011)

[6]

Wang, X., Zhang, T.: Clothes search in consumer photos via color matching and attribute learning. In: ACM Multimedia (2011)

Digital Library

[7]

Huang, L., Xia, T., Zhang, Y., Lin, S.: Finding Suits in Images of People. In: Schoeffmann, K., Merialdo, B., Hauptmann, A. G., Ngo, C.-W., Andreopoulos, Y., Breiteneder, C. (eds.) MMM 2012. LNCS, vol. 7131, pp. 485-494. Springer, Heidelberg (2012)

Digital Library

[8]

Liu, S., Song, Z., Liu, G., Xu, C., Lu, H., Yan, S.: Street-to-shop: Cross-scenario clothing retrieval via parts alignment and auxiliary set. In: CVPR (2012)

[9]

Zheng, Y., Zhao, M., Neo, S., Chua, T., Tian, Q.: Visual synset: Towards a higherlevel visual representation. In: CVPR (2008)

[10]

Cao, Y., Wang, C., Li, Z., Zhang, L., Zhang, L.: Spatial-bag-of-features. In: CVPR (2010)

[11]

Wu, Z., Ke, Q., Isard, M., Sun, J.: Bundling features for large scale partial-duplicate web image search. In: CVPR (2009)

[12]

Yuan, J., Wu, Y., Yang, M.: Discovery of collocation patterns: from visual words to visual phrases. In: CVPR (2007)

[13]

Zhang, Y., Jia, Z., Chen, T.: Image retrieval with geometry-preserving visual phrases. In: CVPR (2011)

[14]

Yang, Y., Ramanan, D.: Articulated pose estimation with flexible mixtures-ofparts. In: CVPR (2011)

[15]

Zhang, Z., Liang, X., Ganesh, A., Ma, Y.: TILT: Transform Invariant Low-Rank Textures. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part III. LNCS, vol. 6494, pp. 314-328. Springer, Heidelberg (2011)

Digital Library

[16]

Hoyer, P.: Non-negative sparse coding. In: IEEE Workshop on Neural Networks for Signal Processing (2002)

[17]

Li, Z., Yang, Y., Liu, J., Zhou, X., Lu, H.: Unsupervised feature selection using nonnegative spectral analysis. In: AAAI (2012)

[18]

Lowe, D.: Distinctive image features from scale-invariant keypoints. IJCV 60, 91-110 (2004)

Digital Library

[19]

Fu, J., Wang, J., Lu, H.: Effective logo retrieval with adaptive local feature selection. In: ACM Multimedia (2010)

Digital Library

[20]

Nisterand, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: CVPR (2006)

Digital Library

[21]

Fu, J., Wang, J., Zhang, Y., Lu, H.: Point-context descriptor based region search for logo recognition. In: ACM ICIMCS (2012)

Digital Library

[22]

Bourdev, L., Maji, S., Malik, J.: Describing people: a poselet-based approach to attribute classification. In: ICCV (2011)

Digital Library

[23]

Gallagher, A., Chen, T.: Clothing cosegmentation for recognizing people. In: CVPR (2008)

[24]

Siddiquie, B., Feris, R., Davis, L.: Image ranking and retrieval based on multiattribute queries. In: CVPR (2011)

Digital Library

Cited By

Feng ZYu ZJing YWu SSong MYang YJiang J(2019)Interpretable Partitioned Embedding for Intelligent Multi-item Fashion Outfit CompositionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/332633215:2s(1-20)Online publication date: 29-Jul-2019
https://dl.acm.org/doi/10.1145/3326332
Chen YShen WLi QWei Z(2019)A Dustbin Category Based Feedback Incremental Learning Strategy for Hierarchical Image ClassificationPattern Recognition and Computer Vision10.1007/978-3-030-31654-9_41(480-491)Online publication date: 8-Nov-2019
https://dl.acm.org/doi/10.1007/978-3-030-31654-9_41
Feng ZYu ZYang YJing YJiang JSong MAizawa KLew MSatoh S(2018)Interpretable Partitioned Embedding for Customized Multi-item Fashion Outfit CompositionProceedings of the 2018 ACM on International Conference on Multimedia Retrieval10.1145/3206025.3206048(143-151)Online publication date: 5-Jun-2018
https://dl.acm.org/doi/10.1145/3206025.3206048
Show More Cited By

Efficient clothing retrieval with semantic-preserving visual phrases
1. Information systems
  1. Information retrieval

Recommendations

Effective and efficient object-based image retrieval using visual phrases
MM '06: Proceedings of the 14th ACM international conference on Multimedia

In this paper, we draw an analogy between image retrieval and text retrieval and propose a visual phrase-based approach to retrieve images containing desired objects. The visual phrase is defined as a pair of adjacent local image patches and is ...
Constructing visual phrases for effective and efficient object-based image retrieval

The explosion of multimedia data necessitates effective and efficient ways for us to get access to our desired ones. In this article, we draw an analogy between image retrieval and text retrieval and propose a visual phrase-based approach to retrieve ...
Spatial Similarity Measure of Visual Phrases for Image Retrieval
MMM 2014: Proceedings of the 20th Anniversary International Conference on MultiMedia Modeling - Volume 8326

Spatial information plays an essential role in accurate matching of local features in applications, e.g., image retrieval. Despite of previous work, it remains a challenging problem to extract appropriate spatial information. We propose an image ...

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings

ACCV'12: Proceedings of the 11th Asian conference on Computer Vision - Volume Part II

November 2012

810 pages

ISBN:9783642374432

Editors:
Kyoung Mu Lee
Department of Electrical and Computer Engineering, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, Korea
,
Yasuyuki Matsushita
Department of Electrical and Computer Engineering, Microsoft Research Asia, No. 5, Danling st., Haidian district, Beijing, P.R. China
,
James M. Rehg
School of Interactive Computing, Georgia Institute of Technology, 801 Atlantic Drive, CCB 315, Atlanta, GA, P.R. China
,
Zhanyi Hu
Institute of Automation, National Laboratory of Pattern Recognition, Chinese Academy of Sciences, Zhong Quan Cun East Road 95, Haidian District, Beijing, GA, P.R. China

Sponsors

DCC: Daejeon Convention Center
DMCITY: Daejeon Metropolitan City
KTO: Korea Tourism Organization
Kaist
DIME: Daejeon International Marketing Enterprise

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 05 November 2012

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

10
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 11 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Feng ZYu ZJing YWu SSong MYang YJiang J(2019)Interpretable Partitioned Embedding for Intelligent Multi-item Fashion Outfit CompositionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/332633215:2s(1-20)Online publication date: 29-Jul-2019
https://dl.acm.org/doi/10.1145/3326332
Chen YShen WLi QWei Z(2019)A Dustbin Category Based Feedback Incremental Learning Strategy for Hierarchical Image ClassificationPattern Recognition and Computer Vision10.1007/978-3-030-31654-9_41(480-491)Online publication date: 8-Nov-2019
https://dl.acm.org/doi/10.1007/978-3-030-31654-9_41
Feng ZYu ZYang YJing YJiang JSong MAizawa KLew MSatoh S(2018)Interpretable Partitioned Embedding for Customized Multi-item Fashion Outfit CompositionProceedings of the 2018 ACM on International Conference on Multimedia Retrieval10.1145/3206025.3206048(143-151)Online publication date: 5-Jun-2018
https://dl.acm.org/doi/10.1145/3206025.3206048
Sun GCheng ZWu XPeng Q(2018)Personalized clothing recommendation combining user social circle and fashion style consistencyMultimedia Tools and Applications10.1007/s11042-017-5245-177:14(17731-17754)Online publication date: 1-Jul-2018
https://dl.acm.org/doi/10.1007/s11042-017-5245-1
Lasserre JBracher CVollgraf R(2018)Street2Fashion2Shop: Enabling Visual Search in Fashion e-Commerce Using Studio ImagesPattern Recognition Applications and Methods10.1007/978-3-030-05499-1_1(3-26)Online publication date: 16-Jan-2018
https://dl.acm.org/doi/10.1007/978-3-030-05499-1_1
Yan SLiu ZLuo PQiu SWang XTang XLiu QLienhart RWang HChen SBoll SChen PFriedland GLi JYan S(2017)Unconstrained Fashion Landmark Detection via Hierarchical Recurrent Transformer NetworksProceedings of the 25th ACM international conference on Multimedia10.1145/3123266.3123276(172-180)Online publication date: 23-Oct-2017
https://dl.acm.org/doi/10.1145/3123266.3123276
Jiang SWu YFu YHanjalic ASnoek CWorring MBulterman DHuet BKelliher AKompatsiaris YLi J(2016)Deep Bi-directional Cross-triplet Embedding for Cross-Domain Clothing RetrievalProceedings of the 24th ACM international conference on Multimedia10.1145/2964284.2967182(52-56)Online publication date: 1-Oct-2016
https://dl.acm.org/doi/10.1145/2964284.2967182
Li ZLi YGao YLiu Y(2016)Fast Cross-Scenario Clothing Retrieval Based on Indexing Deep Features17th Pacific-Rim Conference on Advances in Multimedia Information Processing - Volume 991610.1007/978-3-319-48890-5_11(107-118)Online publication date: 15-Sep-2016
https://dl.acm.org/doi/10.1007/978-3-319-48890-5_11
Mizuochi MKanezaki AHarada THua KRui YSteinmetz RHanjalic ANatsev AZhu W(2014)Clothing Retrieval Based on Local Similarity with Multiple ImagesProceedings of the 22nd ACM international conference on Multimedia10.1145/2647868.2655021(1165-1168)Online publication date: 3-Nov-2014
https://dl.acm.org/doi/10.1145/2647868.2655021
Moreira Mdos Santos JVeloso AKankanhalli MRueger SManmatha RJose Jvan Rijsbergen K(2014)Learning to Rank Similar Apparel Styles with Economically-Efficient Rule-Based Active LearningProceedings of International Conference on Multimedia Retrieval10.1145/2578726.2578773(361-368)Online publication date: 1-Apr-2014
https://dl.acm.org/doi/10.1145/2578726.2578773

View Options

View options

Media

Figures

Other

Tables

View Table of Contents