Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3474085.3475691acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Collocation and Try-on Network: Whether an Outfit is Compatible

Published: 17 October 2021 Publication History

Abstract

Whether an outfit is compatible? Using machine learning methods to assess an outfit's compatibility, namely, fashion compatibility modeling (FCM), has recently become a popular yet challenging topic. However, current FCM studies still perform far from satisfactory, because they only consider the collocation compatibility modeling, while neglecting the natural human habits that people generally evaluate outfit compatibility from both the collocation (discrete assess) and the try-on (unified assess) perspectives. In light of the above analysis, we propose a Collocation and Try-On Network (CTO-Net) for FCM, combining both the collocation and try-on compatibilities. In particular, for the collocation perspective, we devise a disentangled graph learning scheme, where the collocation compatibility is disentangled into multiple fine-grained compatibilities between items; regarding the try-on perspective, we propose an integrated distillation learning scheme to unify all item information in the whole outfit to evaluate the compatibility based on the latent try-on representation. To further enhance the collocation and try-on compatibilities, we exploit the mutual learning strategy to obtain a more comprehensive judgment. Extensive experiments on the real-world dataset demonstrate that our CTO-Net significantly outperforms the state-of-the-art methods. In particular, compared with the competitive counterparts, our proposed CTO-Net significantly improves AUC accuracy from 83.2% to 87.8% and MRR from 15.4% to 21.8%. We have released our source codes and trained models to benefit other researchers.1

References

[1]
Taleb Alashkar, Songyao Jiang, Shuyang Wang, and Yun Fu. 2017. Examples-Rules Guided Deep Neural Network for Makeup Recommendation. In Proceedings of the AAAI Conference on Artificial Intelligence. AAAI, 941--947.
[2]
Suthee Chaidaroon, Yi Fang, Min Xie, and Alessandro Magnani. 2019. Neural Compatibility Ranking for Text-based Fashion Matching. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 1229--1232.
[3]
Zeyu Cui, Zekun Li, Shu Wu, Xiao-Yu Zhang, and Liang Wang. 2019. Dressing as a whole: Outfit compatibility learning based on node-wise graph neural networks. In Proceedings of the ACM International Conference on World Wide Web. ACM, 307--317.
[4]
Xue Dong, Xuemeng Song, Fuli Feng, Peiguang Jing, Xin-Shun Xu, and Liqiang Nie. 2019. Personalized Capsule Wardrobe Creation with Garment and User Modeling. In Proceedings of the 27th ACM International Conference on Multimedia. ACM, 302--310.
[5]
Xue Dong, Jianlong Wu, Xuemeng Song, Hongjun Dai, and Liqiang Nie. 2020. Fashion Compatibility Modeling through a Multi-modal Try-on-guided Scheme. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 771--780.
[6]
Zan Gao, Yin-ming Li, Wei-li Guan, Wei-zhi Nie, Zhi-yong Cheng, and An-an Liu. 2020. Pairwise view weighted graph network for view-based 3d model retrieval. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 129--138.
[7]
Xianjing Han, Xuemeng Song, Jianhua Yin, Yinglong Wang, and Liqiang Nie. 2019. Prototype-guided Attribute-wise Interpretable Scheme for Clothing Matching. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 785--794.
[8]
Xintong Han, Zuxuan Wu, Yu-Gang Jiang, and Larry S. Davis. 2017. Learning Fashion Compatibility with Bidirectional LSTMs. In Proceedings of the ACM International Conference on Multimedia. ACM, 1078--1086.
[9]
Geoffrey Hinton, Oriol Vinyals, and Jeff Dean. 2015. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 (2015).
[10]
Linmei Hu, Siyong Xu, Chen Li, Cheng Yang, Chuan Shi, Nan Duan, Xing Xie, and Ming Zhou. 2020. Graph Neural News Recommendation with Unsupervised Preference Disentanglement. In Proceedings of the Annual Meeting of the Association for Computational Linguistics. ACL, 4255--4264.
[11]
Zhiting Hu, Xuezhe Ma, Zhengzhong Liu, Eduard Hovy, H., and Eric Xing. 2016. Harnessing Deep Neural Networks with Logic Rules. In Proceedings of the Annual Meeting of the Association for Computational Linguistics. ACL, 2410--2420.
[12]
Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
[13]
Thomas N. Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. In International Conference on Learning Representations. ICLR.
[14]
Wei-Hong Li and Hakan Bilen. 2020. Knowledge Distillation for Multi-task Learning. In Proceedings of the European Conference on Computer Vision. Springer, 163--176.
[15]
Yuncheng Li, Liangliang Cao, Jiang Zhu, and Jiebo Luo. 2017. Mining Fashion Outfit Composition Using an End-to-End Deep Learning Approach on Set Data. IEEE Transactions on Multimedia, Vol. 19, 8 (2017), 1946--1955.
[16]
Zhi Li, Bo Wu, Qi Liu, Likang Wu, Hongke Zhao, and Tao Mei. 2020. Learning the Compositional Visual Coherence for Complementary Recommendations. In Proceedings of the International Joint Conference on Artificial Intelligence. IJCAI, 3536--3543.
[17]
Min Lin, Qiang Chen, and Shuicheng Yan. 2013. Network in network. arXiv preprint arXiv:1312.4400.
[18]
Yen-Liang Lin, Son Tran, and Larry S. Davis. 2020. Fashion Outfit Complementary Item Retrieval. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 3311--3319.
[19]
Fan Liu, Zhiyong Cheng, Changchang Sun, Yinglong Wang, Liqiang Nie, and Mohan Kankanhalli. 2019 a. User Diverse Preference Modeling by Multimodal Attentive Metric Learning. In Proceedings of the ACM International Conference on Multimedia. ACM, 1526--1534.
[20]
Jinhuan Liu, Xuemeng Song, Zhaochun Ren, Liqiang Nie, Zhaopeng Tu, and Jun Ma. 2020. Auxiliary Template-Enhanced Generative Compatibility Modeling. In Proceedings of the International Joint Conference on Artificial Intelligence. IJCAI, 3508--3514.
[21]
Meng Liu, Liqiang Nie, Xiang Wang, Qi Tian, and Baoquan Chen. 2019 b. Online Data Organizer: Micro-Video Categorization by Structure-Guided Multimodal Dictionary Learning. IEEE Transactions on Image Processing, Vol. 28, 3 (2019), 1235--1247.
[22]
Meng Liu, Xiang Wang, Liqiang Nie, Qi Tian, Baoquan Chen, and Tat-Seng Chua. 2018. Cross-Modal Moment Localization in Videos. In Proceedings of the ACM International Conference on Multimedia. ACM, 843--851.
[23]
Julian McAuley, Christopher Targett, Qinfeng Shi, and Anton Van Den Hengel. 2015. Image-based recommendations on styles and substitutes. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 43--52.
[24]
Liqiang Nie, Yongqi Li, Fuli Feng, Xuemeng Song, Meng Wang, and Yinglong Wang. 2020. Large-Scale Question Tagging via Joint Question-Topic Embedding Learning. ACM Transactions on Information Systems, Vol. 38, 2 (2020).
[25]
Mathias Niepert, Mohamed Ahmed, and Konstantin Kutzkov. 2016. Learning Convolutional Neural Networks for Graphs. In Proceedings of the International Conference on Machine Learning. ACM, 2014--2023.
[26]
Yehuda Koren Paolo Cremonesi and Roberto Turrin. 2010. Performance of recommender algorithms on top-n recommendation tasks. In Proceedings of the ACM Conference on Recommender Systems. ACM, 39--46.
[27]
Xuemeng Song, Fuli Feng, Xianjing Han, Xin Yang, Wei Liu, and Liqiang Nie. 2018. Neural compatibility modeling with attentive knowledge distillation. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 5--14.
[28]
Xuemeng Song, Fuli Feng, Jinhuan Liu, Zekun Li, Liqiang Nie, and Jun Ma. 2017. NeuroStylist: Neural Compatibility Modeling for Clothing Matching. In Proceedings of the ACM International Conference on Multimedia. ACM, 753--761.
[29]
Reuben Tan, Mariya I. Vasileva, Kate Saenko, and Bryan A. Plummer. 2019. Learning Similarity Conditions Without Explicit Supervision. In Proceedings of the IEEE International Conference on Computer Vision. IEEE, 10373--10382.
[30]
Mariya I. Vasileva, Bryan A. Plummer, Krishna Dusad, Shreya Rajpal, Ranjitha Kumar, and David Forsyth. 2018. Learning Type-Aware Embeddings for Fashion Compatibility. In Proceedings of the European Conference on Computer Vision. Springer, 390--405.
[31]
Xiang Wang, Xiangnan He, Meng Wang, Fuli Feng, and Tat-Seng Chua. 2019 a. Neural Graph Collaborative Filtering. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 165--174.
[32]
Xionghui Wang, Jian-Fang Hu, Jian-Huang Lai, Jianguo Zhang, and Wei-Shi Zheng. 2019 b. Progressive Teacher-Student Learning for Early Action Prediction. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 3556--3565.
[33]
Xiang Wang, Hongye Jin, An Zhang, Xiangnan He, Tong Xu, and Tat-Seng Chua. 2020. Disentangled Graph Collaborative Filtering. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 1001--1010.
[34]
Xin Wang, Bo Wu, and Yueqi Zhong. 2019 c. Outfit Compatibility Prediction and Diagnosis with Multi-Layered Comparison Network. In Proceedings of the ACM International Conference on Multimedia. ACM, 329--337.
[35]
Yinwei Wei, Xiang Wang, Weili Guan, Liqiang Nie, Zhouchen Lin, and Baoquan Chen. 2019 a. Neural multimodal cooperative learning toward micro-video understanding. IEEE Transactions on Image Processing, Vol. 29 (2019), 1--14.
[36]
Yinwei Wei, Xiang Wang, Liqiang Nie, Xiangnan He, Richang Hong, and Tat-Seng Chua. 2019 b. MMGCN: Multi-modal graph convolution network for personalized recommendation of micro-video. In Proceedings of the 27th ACM International Conference on Multimedia. ACM, 1437--1445.
[37]
Li Xingchen, Wang Xiang, Xiangnan He, Long Chen, Jun Xiao, and Tat-Seng Chua. 2020. Hierarchical Fashion Graph Network for Personalized Outfit Recommendation. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 159--168.
[38]
Xin Yang, Xuemeng Song, Xianjing Han, Haokun Wen, Jie Nie, and Liqiang Nie. 2020. Generative Attribute Manipulation Scheme for Flexible Fashion Search. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 941--950.
[39]
Jun Yu, Hao Zhou, Yibing Zhan, and Dacheng Tao. 2021. Deep Graph-neighbor Coherence Preserving Network for Unsupervised Cross-modal Hashing. In Proceedings of the AAAI Conference on Artificial Intelligence. AAAI, 4626--4634.
[40]
Mingkuan Yuan and Yuxin Peng. 2019. Ckd: Cross-task knowledge distillation for text-to-image synthesis. IEEE Transactions on Multimedia, Vol. 22, 8 (2019), 1955--1968.
[41]
Yibing Zhan, Jun Yu, Zhou Yu, Rong Zhang, Dacheng Tao, and Qi Tian. 2018. Comprehensive distance-preserving autoencoders for cross-modal retrieval. In Proceedings of the ACM international conference on Multimedia. ACM, 1137--1145.
[42]
Liheng Zhang, Guo-Jun Qi, Liqiang Wang, and Jiebo Luo. 2019. AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transformations Rather Than Data. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 2547--2555.
[43]
Ying Zhang, Tao Xiang, Timothy M Hospedales, and Huchuan Lu. 2018. Deep mutual learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 4320--4328.
[44]
Na Zheng, Xuemeng Song, Zhaozheng Chen, Linmei Hu, Da Cao, and Liqiang Nie. 2019. Virtually Trying on New Clothing with Arbitrary Poses. In Proceedings of the ACM International Conference on Multimedia. ACM, 266--274.
[45]
Junyan Zhu, Taesung Park, Phillip Isola, and Alexei A Efros. 2017. Unpaired image-to-image translation using cycle-consistent adversarial networks. In Proceedings of the IEEE International Conference on Computer Vision. IEEE, 2223--2232.

Cited By

View all
  • (2024)A Review of Explainable Fashion Compatibility Modeling MethodsACM Computing Surveys10.1145/366461456:11(1-29)Online publication date: 28-Jun-2024
  • (2024)Self-Training Boosted Multi-Factor Matching Network for Composed Image RetrievalIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.334643446:5(3665-3678)Online publication date: May-2024
  • (2024)DMAP: Decoupling-Driven Multi-Level Attribute Parsing for Interpretable Outfit CollocationIEEE Transactions on Multimedia10.1109/TMM.2024.340254126(9988-10000)Online publication date: 2024
  • Show More Cited By

Index Terms

  1. Collocation and Try-on Network: Whether an Outfit is Compatible

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      MM '21: Proceedings of the 29th ACM International Conference on Multimedia
      October 2021
      5796 pages
      ISBN:9781450386517
      DOI:10.1145/3474085
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 17 October 2021

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. disentangled graph neural network
      2. fashion compatibility modeling
      3. try-on knowledge distillation

      Qualifiers

      • Research-article

      Funding Sources

      • the Key R&D Program of Shandong (Major scientific and technological innovation projects)
      • the Shandong Provincial Natural Science Foundation
      • the new AI project towards the integration of education and industry in QLUT
      • the National Natural Science Foundation of China
      • CCF-Baidu Open Fund

      Conference

      MM '21
      Sponsor:
      MM '21: ACM Multimedia Conference
      October 20 - 24, 2021
      Virtual Event, China

      Acceptance Rates

      Overall Acceptance Rate 995 of 4,171 submissions, 24%

      Upcoming Conference

      MM '24
      The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)72
      • Downloads (Last 6 weeks)16
      Reflects downloads up to 15 Oct 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)A Review of Explainable Fashion Compatibility Modeling MethodsACM Computing Surveys10.1145/366461456:11(1-29)Online publication date: 28-Jun-2024
      • (2024)Self-Training Boosted Multi-Factor Matching Network for Composed Image RetrievalIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.334643446:5(3665-3678)Online publication date: May-2024
      • (2024)DMAP: Decoupling-Driven Multi-Level Attribute Parsing for Interpretable Outfit CollocationIEEE Transactions on Multimedia10.1109/TMM.2024.340254126(9988-10000)Online publication date: 2024
      • (2023)Computational Technologies for Fashion Recommendation: A SurveyACM Computing Surveys10.1145/362710056:5(1-45)Online publication date: 25-Nov-2023
      • (2023)Stylized Data-to-text Generation: A Case Study in the E-Commerce DomainACM Transactions on Information Systems10.1145/360337442:1(1-24)Online publication date: 18-Aug-2023
      • (2023)Target-Guided Composed Image RetrievalProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3611817(915-923)Online publication date: 26-Oct-2023
      • (2023)Deep Multimodal Complementarity LearningIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.316518034:12(10213-10224)Online publication date: Dec-2023
      • (2022)Research and implementation of a trend prediction model based on trend similarity for the changing trends of fashion elements in clothingProceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering10.1145/3573428.3573685(1460-1466)Online publication date: 21-Oct-2022
      • (2022)Partially Supervised Compatibility ModelingIEEE Transactions on Image Processing10.1109/TIP.2022.318729031(4733-4745)Online publication date: 2022
      • (2022)Sat: Self-Adaptive Training for Fashion Compatibility Prediction2022 IEEE International Conference on Image Processing (ICIP)10.1109/ICIP46576.2022.9897313(2431-2435)Online publication date: 16-Oct-2022
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media