Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1007/978-3-642-33712-3_44guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Describing clothing by semantic attributes

Published: 07 October 2012 Publication History

Abstract

Describing clothing appearance with semantic attributes is an appealing technique for many important applications. In this paper, we propose a fully automated system that is capable of generating a list of nameable attributes for clothes on human body in unconstrained images. We extract low-level features in a pose-adaptive manner, and combine complementary features for learning attribute classifiers. Mutual dependencies between the attributes are then explored by a Conditional Random Field to further improve the predictions from independent classifiers. We validate the performance of our system on a challenging clothing attribute dataset, and introduce a novel application of dressing style analysis that utilizes the semantic attributes produced by our system.

References

[1]
Kumar, N., Belhumeur, P.N., Nayar, S.K.: FaceTracer: A Search Engine for Large Collections of Images with Faces. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part IV. LNCS, vol. 5305, pp. 340-353. Springer, Heidelberg (2008).
[2]
Anguelov, D., Lee, K., Gokturk, S.B., Sumengen, B.: Contextual identity recognition in personal photo albums. In: CVPR (2007).
[3]
Lin, D., Kapoor, A., Hua, G., Baker, S.: Joint People, Event, and Location Recognition in Personal Photo Collections Using Cross-Domain Context. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part I. LNCS, vol. 6311, pp. 243-256. Springer, Heidelberg (2010).
[4]
Gallagher, A.C., Chen, T.: Clothing cosegmentation for recognizing people. In: CVPR (2008).
[5]
Cao, L., Dikmen, M., Fu, Y., Huang, T.S.: Gender recognition from body. ACM Multimedia (2008).
[6]
Bourdev, L., Maji, S., Malik, J.: Describing people: Poselet-based attribute classification. In: ICCV (2011).
[7]
Eichner, M., Marin-Jimenez, M., Zisserman, A., Ferrari, V.: Articulated human pose estimation and search in (almost) unconstrained still images. Technical Report 272, ETH Zurich, D-ITET, BIWI (2010).
[8]
Ferrari, V., Zisserman, A.: Learning visual attributes. In: NIPS (2007).
[9]
Farhadi, A., Endres, I., Hoiem, D., Forsyth, D.: Describing objects by their attributes. In: CVPR (2009).
[10]
Russakovsky, O., Fei-Fei, L.: Attribute learning in large-scale datasets. In: ECCV (2010).
[11]
Parikh, D., Grauman, K.: Interactively building a discriminative vocabulary of nameable attributes. In: CVPR (2011).
[12]
Kumar, N., Berg, A.C., Belhumeur, P.N., Nayar, S.K.: Attribute and simile classifiers for face verification. In: ICCV (2009).
[13]
Siddiquie, B., Feris, R.S., Davis, L.S.: Image ranking and retrieval based on multiattribute queries. In: CVPR (2011).
[14]
Berg, T.L., Berg, A.C., Shih, J.: Automatic Attribute Discovery and Characterization from Noisy Web Data. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part I. LNCS, vol. 6311, pp. 663-676. Springer, Heidelberg (2010).
[15]
Kulkarni, G., Premraj, V., Dhar, S., Li, S., Berg, A., Choi, Y., Berg, T.: Baby talk: Understanding and generating image descriptions. In: CVPR (2011).
[16]
Farhadi, A., Hejrati, M., Sadeghi, M.A., Young, P., Rashtchian, C., Hockenmaier, J., Forsyth, D.: Every Picture Tells a Story: Generating Sentences from Images. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 15-29. Springer, Heidelberg (2010).
[17]
Bourdev, L., Malik, J.: Poselets: Body part detectors trained using 3d human pose annotations. In: ICCV (2009).
[18]
Song, Z., Wang, M., Hua, X., Yan, S.: Predicting occupation via human clothing and contexts. In: ICCV (2011).
[19]
Yang, M., Yu, K.: Real-time clothing recognition in surveillance videos. In: ICIP (2011).
[20]
Zhang, W., Begole, B., Chu, M., Liu, J., Yee, N.: Real-time clothes comparison based on multi-view vision. In: ICDSC (2008).
[21]
Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part based models. PAMI (2009).
[22]
Yang, Y., Ramanan, D.: Articulated pose estimation with flexible mixtures-of-parts. In: CVPR (2011).
[23]
Shotton, J., Fitzgibbon, A., Cook, M., Blake, A.: Real-time human pose recognition in parts from single depth images. In: CVPR (2011).
[24]
Viola, P., Jones, M.: Robust real-time object detection. IJCV (2001).
[25]
Rother, C., Kolmogorov, V., Blake, A.: Grabcut - interactive foreground extraction using iterated graph cuts. In: SIGGRAPH (2004).
[26]
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. IJCV (2004).
[27]
Varma, M., Zisserman, A.: A statistical approach to texture classification from single images. IJCV (2005).
[28]
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: CVPR (2006).
[29]
Chang, C.C., Lin, C.J.: LIBSVM: A library for support vector machines. ACM Trans. on Intel. Sys. and Tech. (2011).
[30]
Xiao, J., Hays, J., Ehinger, K.A., Torralba, A., Oliva, A.: Sun database: Large scale scene recognition from abbey to zoo. In: CVPR (2010).
[31]
Tappen, M.F., Freeman, W.T.: Comparison of graph cuts with belief propagation for stereo, using identical mrf parameters. In: ICCV (2003).
[32]
Gallagher, A.C., Chen, T.: Understanding images of groups of people. In: CVPR (2009).

Cited By

View all
  • (2023)Multimodal Fashion Knowledge Extraction as CaptioningProceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region10.1145/3624918.3625315(52-62)Online publication date: 26-Nov-2023
  • (2023)A Review of Modern Fashion Recommender SystemsACM Computing Surveys10.1145/362473356:4(1-37)Online publication date: 21-Oct-2023
  • (2023)CoTel: Ontology-Neural Co-Enhanced Text LabelingProceedings of the ACM Web Conference 202310.1145/3543507.3583533(1897-1906)Online publication date: 30-Apr-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
ECCV'12: Proceedings of the 12th European conference on Computer Vision - Volume Part III
October 2012
880 pages
ISBN:9783642337116
  • Editors:
  • Andrew Fitzgibbon,
  • Svetlana Lazebnik,
  • Pietro Perona,
  • Yoichi Sato,
  • Cordelia Schmid

Sponsors

  • TOYOTA: TOYOTA
  • Google Inc.
  • IBMR: IBM Research
  • NVIDIA
  • Microsoft Reasearch: Microsoft Reasearch

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 07 October 2012

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 04 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Multimodal Fashion Knowledge Extraction as CaptioningProceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region10.1145/3624918.3625315(52-62)Online publication date: 26-Nov-2023
  • (2023)A Review of Modern Fashion Recommender SystemsACM Computing Surveys10.1145/362473356:4(1-37)Online publication date: 21-Oct-2023
  • (2023)CoTel: Ontology-Neural Co-Enhanced Text LabelingProceedings of the ACM Web Conference 202310.1145/3543507.3583533(1897-1906)Online publication date: 30-Apr-2023
  • (2023)AABLSTM: A Novel Multi-task Based CNN-RNN Deep Model for Fashion AnalysisACM Transactions on Multimedia Computing, Communications, and Applications10.1145/351902919:1(1-18)Online publication date: 5-Jan-2023
  • (2023)Fashion Label Relation Networks for Attribute RecognitionArtificial Intelligence10.1007/978-981-99-8850-1_28(340-351)Online publication date: 22-Jul-2023
  • (2023)Transpose and Mask: Simple and Effective Logit-Based Knowledge Distillation for Multi-attribute and Multi-label ClassificationPattern Recognition and Computer Vision10.1007/978-981-99-8549-4_23(273-284)Online publication date: 13-Oct-2023
  • (2023)PAR Contest 2023: Pedestrian Attributes Recognition with Multi-task LearningComputer Analysis of Images and Patterns10.1007/978-3-031-44237-7_1(3-12)Online publication date: 25-Sep-2023
  • (2022)Unsupervised multi-modal modeling of fashion styles with visual attributesApplied Soft Computing10.1016/j.asoc.2021.108214115:COnline publication date: 1-Jan-2022
  • (2022)A Hybrid Recommender System with Implicit Feedbacks in Fashion RetailAIxIA 2022 – Advances in Artificial Intelligence10.1007/978-3-031-27181-6_15(212-224)Online publication date: 28-Nov-2022
  • (2022)Webly Supervised Concept Expansion for General Purpose Vision ModelsComputer Vision – ECCV 202210.1007/978-3-031-20059-5_38(662-681)Online publication date: 23-Oct-2022
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media