Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1007/978-3-642-37444-9_33guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Efficient clothing retrieval with semantic-preserving visual phrases

Published: 05 November 2012 Publication History

Abstract

In this paper, we address the problem of large scale cross-scenario clothing retrieval with semantic-preserving visual phrases (SPVP). Since the human parts are important cues for clothing detection and segmentation, we firstly detect human parts as the semantic context, and refine the regions of human parts with sparse background reconstruction. Then, the semantic parts are encoded into the vocabulary tree under the bag-of-visual-word (BOW) framework, and the contextual constraint of visual words among different human parts is exploited through the SPVP. Moreover, the SPVP is integrated into the inverted index structure for accelerating the retrieval process. Experiments and comparisons on our clothing dataset indicate that the SPVP significantly enhances the discriminative power of local features with a slight increase of memory usage or runtime consumption compared to the BOW model. Therefore, the approach is superior to both the state-of-the-art approach and two clothing search engines.

References

[1]
Sivic, J., Zisserman, A.: Video google: a text retrieval approach to object matching in videos. In: ICCV (2003)
[2]
Chen, H., Xu, Z., Liu, Z., Zhu, S.: Composite templates for cloth modeling and sketching. In: CVPR (2006)
[3]
Hasan, B., Hogg, D.: Segmentation using deformable spatial priors with application to clothing. In: BMVC (2010)
[4]
Wang, N., Ai, H.: Who blocks who: Simultaneous clothing segmentation for grouping images. In: ICCV (2011)
[5]
Yang, M., Yu, K.: Real-time clothing recognition in surveillance videos. In: ICIP (2011)
[6]
Wang, X., Zhang, T.: Clothes search in consumer photos via color matching and attribute learning. In: ACM Multimedia (2011)
[7]
Huang, L., Xia, T., Zhang, Y., Lin, S.: Finding Suits in Images of People. In: Schoeffmann, K., Merialdo, B., Hauptmann, A. G., Ngo, C.-W., Andreopoulos, Y., Breiteneder, C. (eds.) MMM 2012. LNCS, vol. 7131, pp. 485-494. Springer, Heidelberg (2012)
[8]
Liu, S., Song, Z., Liu, G., Xu, C., Lu, H., Yan, S.: Street-to-shop: Cross-scenario clothing retrieval via parts alignment and auxiliary set. In: CVPR (2012)
[9]
Zheng, Y., Zhao, M., Neo, S., Chua, T., Tian, Q.: Visual synset: Towards a higherlevel visual representation. In: CVPR (2008)
[10]
Cao, Y., Wang, C., Li, Z., Zhang, L., Zhang, L.: Spatial-bag-of-features. In: CVPR (2010)
[11]
Wu, Z., Ke, Q., Isard, M., Sun, J.: Bundling features for large scale partial-duplicate web image search. In: CVPR (2009)
[12]
Yuan, J., Wu, Y., Yang, M.: Discovery of collocation patterns: from visual words to visual phrases. In: CVPR (2007)
[13]
Zhang, Y., Jia, Z., Chen, T.: Image retrieval with geometry-preserving visual phrases. In: CVPR (2011)
[14]
Yang, Y., Ramanan, D.: Articulated pose estimation with flexible mixtures-ofparts. In: CVPR (2011)
[15]
Zhang, Z., Liang, X., Ganesh, A., Ma, Y.: TILT: Transform Invariant Low-Rank Textures. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part III. LNCS, vol. 6494, pp. 314-328. Springer, Heidelberg (2011)
[16]
Hoyer, P.: Non-negative sparse coding. In: IEEE Workshop on Neural Networks for Signal Processing (2002)
[17]
Li, Z., Yang, Y., Liu, J., Zhou, X., Lu, H.: Unsupervised feature selection using nonnegative spectral analysis. In: AAAI (2012)
[18]
Lowe, D.: Distinctive image features from scale-invariant keypoints. IJCV 60, 91-110 (2004)
[19]
Fu, J., Wang, J., Lu, H.: Effective logo retrieval with adaptive local feature selection. In: ACM Multimedia (2010)
[20]
Nisterand, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: CVPR (2006)
[21]
Fu, J., Wang, J., Zhang, Y., Lu, H.: Point-context descriptor based region search for logo recognition. In: ACM ICIMCS (2012)
[22]
Bourdev, L., Maji, S., Malik, J.: Describing people: a poselet-based approach to attribute classification. In: ICCV (2011)
[23]
Gallagher, A., Chen, T.: Clothing cosegmentation for recognizing people. In: CVPR (2008)
[24]
Siddiquie, B., Feris, R., Davis, L.: Image ranking and retrieval based on multiattribute queries. In: CVPR (2011)

Cited By

View all
  • (2019)Interpretable Partitioned Embedding for Intelligent Multi-item Fashion Outfit CompositionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/332633215:2s(1-20)Online publication date: 29-Jul-2019
  • (2019)A Dustbin Category Based Feedback Incremental Learning Strategy for Hierarchical Image ClassificationPattern Recognition and Computer Vision10.1007/978-3-030-31654-9_41(480-491)Online publication date: 8-Nov-2019
  • (2018)Interpretable Partitioned Embedding for Customized Multi-item Fashion Outfit CompositionProceedings of the 2018 ACM on International Conference on Multimedia Retrieval10.1145/3206025.3206048(143-151)Online publication date: 5-Jun-2018
  • Show More Cited By
  1. Efficient clothing retrieval with semantic-preserving visual phrases

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    ACCV'12: Proceedings of the 11th Asian conference on Computer Vision - Volume Part II
    November 2012
    810 pages
    ISBN:9783642374432
    • Editors:
    • Kyoung Mu Lee,
    • Yasuyuki Matsushita,
    • James M. Rehg,
    • Zhanyi Hu

    Sponsors

    • DCC: Daejeon Convention Center
    • DMCITY: Daejeon Metropolitan City
    • KTO: Korea Tourism Organization
    • Kaist
    • DIME: Daejeon International Marketing Enterprise

    Publisher

    Springer-Verlag

    Berlin, Heidelberg

    Publication History

    Published: 05 November 2012

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 11 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2019)Interpretable Partitioned Embedding for Intelligent Multi-item Fashion Outfit CompositionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/332633215:2s(1-20)Online publication date: 29-Jul-2019
    • (2019)A Dustbin Category Based Feedback Incremental Learning Strategy for Hierarchical Image ClassificationPattern Recognition and Computer Vision10.1007/978-3-030-31654-9_41(480-491)Online publication date: 8-Nov-2019
    • (2018)Interpretable Partitioned Embedding for Customized Multi-item Fashion Outfit CompositionProceedings of the 2018 ACM on International Conference on Multimedia Retrieval10.1145/3206025.3206048(143-151)Online publication date: 5-Jun-2018
    • (2018)Personalized clothing recommendation combining user social circle and fashion style consistencyMultimedia Tools and Applications10.1007/s11042-017-5245-177:14(17731-17754)Online publication date: 1-Jul-2018
    • (2018)Street2Fashion2Shop: Enabling Visual Search in Fashion e-Commerce Using Studio ImagesPattern Recognition Applications and Methods10.1007/978-3-030-05499-1_1(3-26)Online publication date: 16-Jan-2018
    • (2017)Unconstrained Fashion Landmark Detection via Hierarchical Recurrent Transformer NetworksProceedings of the 25th ACM international conference on Multimedia10.1145/3123266.3123276(172-180)Online publication date: 23-Oct-2017
    • (2016)Deep Bi-directional Cross-triplet Embedding for Cross-Domain Clothing RetrievalProceedings of the 24th ACM international conference on Multimedia10.1145/2964284.2967182(52-56)Online publication date: 1-Oct-2016
    • (2016)Fast Cross-Scenario Clothing Retrieval Based on Indexing Deep Features17th Pacific-Rim Conference on Advances in Multimedia Information Processing - Volume 991610.1007/978-3-319-48890-5_11(107-118)Online publication date: 15-Sep-2016
    • (2014)Clothing Retrieval Based on Local Similarity with Multiple ImagesProceedings of the 22nd ACM international conference on Multimedia10.1145/2647868.2655021(1165-1168)Online publication date: 3-Nov-2014
    • (2014)Learning to Rank Similar Apparel Styles with Economically-Efficient Rule-Based Active LearningProceedings of International Conference on Multimedia Retrieval10.1145/2578726.2578773(361-368)Online publication date: 1-Apr-2014

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media