research-article

Collocation and Try-on Network: Whether an Outfit is Compatible

Authors:

Liqiang NieAuthors Info & Claims

MM '21: Proceedings of the 29th ACM International Conference on Multimedia

Pages 309 - 317

https://doi.org/10.1145/3474085.3475691

Published: 17 October 2021 Publication History

Abstract

Whether an outfit is compatible? Using machine learning methods to assess an outfit's compatibility, namely, fashion compatibility modeling (FCM), has recently become a popular yet challenging topic. However, current FCM studies still perform far from satisfactory, because they only consider the collocation compatibility modeling, while neglecting the natural human habits that people generally evaluate outfit compatibility from both the collocation (discrete assess) and the try-on (unified assess) perspectives. In light of the above analysis, we propose a Collocation and Try-On Network (CTO-Net) for FCM, combining both the collocation and try-on compatibilities. In particular, for the collocation perspective, we devise a disentangled graph learning scheme, where the collocation compatibility is disentangled into multiple fine-grained compatibilities between items; regarding the try-on perspective, we propose an integrated distillation learning scheme to unify all item information in the whole outfit to evaluate the compatibility based on the latent try-on representation. To further enhance the collocation and try-on compatibilities, we exploit the mutual learning strategy to obtain a more comprehensive judgment. Extensive experiments on the real-world dataset demonstrate that our CTO-Net significantly outperforms the state-of-the-art methods. In particular, compared with the competitive counterparts, our proposed CTO-Net significantly improves AUC accuracy from 83.2% to 87.8% and MRR from 15.4% to 21.8%. We have released our source codes and trained models to benefit other researchers.1

References

[1]

Taleb Alashkar, Songyao Jiang, Shuyang Wang, and Yun Fu. 2017. Examples-Rules Guided Deep Neural Network for Makeup Recommendation. In Proceedings of the AAAI Conference on Artificial Intelligence. AAAI, 941--947.

Digital Library

[2]

Suthee Chaidaroon, Yi Fang, Min Xie, and Alessandro Magnani. 2019. Neural Compatibility Ranking for Text-based Fashion Matching. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 1229--1232.

Digital Library

[3]

Zeyu Cui, Zekun Li, Shu Wu, Xiao-Yu Zhang, and Liang Wang. 2019. Dressing as a whole: Outfit compatibility learning based on node-wise graph neural networks. In Proceedings of the ACM International Conference on World Wide Web. ACM, 307--317.

Digital Library

[4]

Xue Dong, Xuemeng Song, Fuli Feng, Peiguang Jing, Xin-Shun Xu, and Liqiang Nie. 2019. Personalized Capsule Wardrobe Creation with Garment and User Modeling. In Proceedings of the 27th ACM International Conference on Multimedia. ACM, 302--310.

Digital Library

[5]

Xue Dong, Jianlong Wu, Xuemeng Song, Hongjun Dai, and Liqiang Nie. 2020. Fashion Compatibility Modeling through a Multi-modal Try-on-guided Scheme. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 771--780.

Digital Library

[6]

Zan Gao, Yin-ming Li, Wei-li Guan, Wei-zhi Nie, Zhi-yong Cheng, and An-an Liu. 2020. Pairwise view weighted graph network for view-based 3d model retrieval. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 129--138.

Digital Library

[7]

Xianjing Han, Xuemeng Song, Jianhua Yin, Yinglong Wang, and Liqiang Nie. 2019. Prototype-guided Attribute-wise Interpretable Scheme for Clothing Matching. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 785--794.

Digital Library

[8]

Xintong Han, Zuxuan Wu, Yu-Gang Jiang, and Larry S. Davis. 2017. Learning Fashion Compatibility with Bidirectional LSTMs. In Proceedings of the ACM International Conference on Multimedia. ACM, 1078--1086.

Digital Library

[9]

Geoffrey Hinton, Oriol Vinyals, and Jeff Dean. 2015. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 (2015).

[10]

Linmei Hu, Siyong Xu, Chen Li, Cheng Yang, Chuan Shi, Nan Duan, Xing Xie, and Ming Zhou. 2020. Graph Neural News Recommendation with Unsupervised Preference Disentanglement. In Proceedings of the Annual Meeting of the Association for Computational Linguistics. ACL, 4255--4264.

[11]

Zhiting Hu, Xuezhe Ma, Zhengzhong Liu, Eduard Hovy, H., and Eric Xing. 2016. Harnessing Deep Neural Networks with Logic Rules. In Proceedings of the Annual Meeting of the Association for Computational Linguistics. ACL, 2410--2420.

[12]

Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).

[13]

Thomas N. Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. In International Conference on Learning Representations. ICLR.

[14]

Wei-Hong Li and Hakan Bilen. 2020. Knowledge Distillation for Multi-task Learning. In Proceedings of the European Conference on Computer Vision. Springer, 163--176.

Digital Library

[15]

Yuncheng Li, Liangliang Cao, Jiang Zhu, and Jiebo Luo. 2017. Mining Fashion Outfit Composition Using an End-to-End Deep Learning Approach on Set Data. IEEE Transactions on Multimedia, Vol. 19, 8 (2017), 1946--1955.

Digital Library

[16]

Zhi Li, Bo Wu, Qi Liu, Likang Wu, Hongke Zhao, and Tao Mei. 2020. Learning the Compositional Visual Coherence for Complementary Recommendations. In Proceedings of the International Joint Conference on Artificial Intelligence. IJCAI, 3536--3543.

[17]

Min Lin, Qiang Chen, and Shuicheng Yan. 2013. Network in network. arXiv preprint arXiv:1312.4400.

[18]

Yen-Liang Lin, Son Tran, and Larry S. Davis. 2020. Fashion Outfit Complementary Item Retrieval. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 3311--3319.

[19]

Fan Liu, Zhiyong Cheng, Changchang Sun, Yinglong Wang, Liqiang Nie, and Mohan Kankanhalli. 2019 a. User Diverse Preference Modeling by Multimodal Attentive Metric Learning. In Proceedings of the ACM International Conference on Multimedia. ACM, 1526--1534.

Digital Library

[20]

Jinhuan Liu, Xuemeng Song, Zhaochun Ren, Liqiang Nie, Zhaopeng Tu, and Jun Ma. 2020. Auxiliary Template-Enhanced Generative Compatibility Modeling. In Proceedings of the International Joint Conference on Artificial Intelligence. IJCAI, 3508--3514.

[21]

Meng Liu, Liqiang Nie, Xiang Wang, Qi Tian, and Baoquan Chen. 2019 b. Online Data Organizer: Micro-Video Categorization by Structure-Guided Multimodal Dictionary Learning. IEEE Transactions on Image Processing, Vol. 28, 3 (2019), 1235--1247.

Digital Library

[22]

Meng Liu, Xiang Wang, Liqiang Nie, Qi Tian, Baoquan Chen, and Tat-Seng Chua. 2018. Cross-Modal Moment Localization in Videos. In Proceedings of the ACM International Conference on Multimedia. ACM, 843--851.

Digital Library

[23]

Julian McAuley, Christopher Targett, Qinfeng Shi, and Anton Van Den Hengel. 2015. Image-based recommendations on styles and substitutes. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 43--52.

Digital Library

[24]

Liqiang Nie, Yongqi Li, Fuli Feng, Xuemeng Song, Meng Wang, and Yinglong Wang. 2020. Large-Scale Question Tagging via Joint Question-Topic Embedding Learning. ACM Transactions on Information Systems, Vol. 38, 2 (2020).

Digital Library

[25]

Mathias Niepert, Mohamed Ahmed, and Konstantin Kutzkov. 2016. Learning Convolutional Neural Networks for Graphs. In Proceedings of the International Conference on Machine Learning. ACM, 2014--2023.

Digital Library

[26]

Yehuda Koren Paolo Cremonesi and Roberto Turrin. 2010. Performance of recommender algorithms on top-n recommendation tasks. In Proceedings of the ACM Conference on Recommender Systems. ACM, 39--46.

Digital Library

[27]

Xuemeng Song, Fuli Feng, Xianjing Han, Xin Yang, Wei Liu, and Liqiang Nie. 2018. Neural compatibility modeling with attentive knowledge distillation. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 5--14.

Digital Library

[28]

Xuemeng Song, Fuli Feng, Jinhuan Liu, Zekun Li, Liqiang Nie, and Jun Ma. 2017. NeuroStylist: Neural Compatibility Modeling for Clothing Matching. In Proceedings of the ACM International Conference on Multimedia. ACM, 753--761.

Digital Library

[29]

Reuben Tan, Mariya I. Vasileva, Kate Saenko, and Bryan A. Plummer. 2019. Learning Similarity Conditions Without Explicit Supervision. In Proceedings of the IEEE International Conference on Computer Vision. IEEE, 10373--10382.

[30]

Mariya I. Vasileva, Bryan A. Plummer, Krishna Dusad, Shreya Rajpal, Ranjitha Kumar, and David Forsyth. 2018. Learning Type-Aware Embeddings for Fashion Compatibility. In Proceedings of the European Conference on Computer Vision. Springer, 390--405.

[31]

Xiang Wang, Xiangnan He, Meng Wang, Fuli Feng, and Tat-Seng Chua. 2019 a. Neural Graph Collaborative Filtering. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 165--174.

Digital Library

[32]

Xionghui Wang, Jian-Fang Hu, Jian-Huang Lai, Jianguo Zhang, and Wei-Shi Zheng. 2019 b. Progressive Teacher-Student Learning for Early Action Prediction. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 3556--3565.

[33]

Xiang Wang, Hongye Jin, An Zhang, Xiangnan He, Tong Xu, and Tat-Seng Chua. 2020. Disentangled Graph Collaborative Filtering. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 1001--1010.

Digital Library

[34]

Xin Wang, Bo Wu, and Yueqi Zhong. 2019 c. Outfit Compatibility Prediction and Diagnosis with Multi-Layered Comparison Network. In Proceedings of the ACM International Conference on Multimedia. ACM, 329--337.

Digital Library

[35]

Yinwei Wei, Xiang Wang, Weili Guan, Liqiang Nie, Zhouchen Lin, and Baoquan Chen. 2019 a. Neural multimodal cooperative learning toward micro-video understanding. IEEE Transactions on Image Processing, Vol. 29 (2019), 1--14.

[36]

Yinwei Wei, Xiang Wang, Liqiang Nie, Xiangnan He, Richang Hong, and Tat-Seng Chua. 2019 b. MMGCN: Multi-modal graph convolution network for personalized recommendation of micro-video. In Proceedings of the 27th ACM International Conference on Multimedia. ACM, 1437--1445.

Digital Library

[37]

Li Xingchen, Wang Xiang, Xiangnan He, Long Chen, Jun Xiao, and Tat-Seng Chua. 2020. Hierarchical Fashion Graph Network for Personalized Outfit Recommendation. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 159--168.

Digital Library

[38]

Xin Yang, Xuemeng Song, Xianjing Han, Haokun Wen, Jie Nie, and Liqiang Nie. 2020. Generative Attribute Manipulation Scheme for Flexible Fashion Search. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 941--950.

Digital Library

[39]

Jun Yu, Hao Zhou, Yibing Zhan, and Dacheng Tao. 2021. Deep Graph-neighbor Coherence Preserving Network for Unsupervised Cross-modal Hashing. In Proceedings of the AAAI Conference on Artificial Intelligence. AAAI, 4626--4634.

[40]

Mingkuan Yuan and Yuxin Peng. 2019. Ckd: Cross-task knowledge distillation for text-to-image synthesis. IEEE Transactions on Multimedia, Vol. 22, 8 (2019), 1955--1968.

[41]

Yibing Zhan, Jun Yu, Zhou Yu, Rong Zhang, Dacheng Tao, and Qi Tian. 2018. Comprehensive distance-preserving autoencoders for cross-modal retrieval. In Proceedings of the ACM international conference on Multimedia. ACM, 1137--1145.

Digital Library

[42]

Liheng Zhang, Guo-Jun Qi, Liqiang Wang, and Jiebo Luo. 2019. AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transformations Rather Than Data. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 2547--2555.

[43]

Ying Zhang, Tao Xiang, Timothy M Hospedales, and Huchuan Lu. 2018. Deep mutual learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 4320--4328.

[44]

Na Zheng, Xuemeng Song, Zhaozheng Chen, Linmei Hu, Da Cao, and Liqiang Nie. 2019. Virtually Trying on New Clothing with Arbitrary Poses. In Proceedings of the ACM International Conference on Multimedia. ACM, 266--274.

Digital Library

[45]

Junyan Zhu, Taesung Park, Phillip Isola, and Alexei A Efros. 2017. Unpaired image-to-image translation using cycle-consistent adversarial networks. In Proceedings of the IEEE International Conference on Computer Vision. IEEE, 2223--2232.

Cited By

Selwon KSzymański J(2024)A Review of Explainable Fashion Compatibility Modeling MethodsACM Computing Surveys10.1145/366461456:11(1-29)Online publication date: 28-Jun-2024
https://dl.acm.org/doi/10.1145/3664614
Wen HSong XYin JWu JGuan WNie L(2024)Self-Training Boosted Multi-Factor Matching Network for Composed Image RetrievalIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.334643446:5(3665-3678)Online publication date: May-2024
https://doi.org/10.1109/TPAMI.2023.3346434
Su ZChen YZhang FWang RZhou FLin G(2024)DMAP: Decoupling-Driven Multi-Level Attribute Parsing for Interpretable Outfit CollocationIEEE Transactions on Multimedia10.1109/TMM.2024.340254126(9988-10000)Online publication date: 2024
https://doi.org/10.1109/TMM.2024.3402541
Show More Cited By

Index Terms

Collocation and Try-on Network: Whether an Outfit is Compatible
1. Information systems
  1. Information retrieval
    1. Retrieval tasks and goals
  2. World Wide Web

Recommendations

Fashion Compatibility Modeling through a Multi-modal Try-on-guided Scheme
SIGIR '20: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval

Recent years have witnessed a growing trend of fashion compatibility modeling, which scores the matching degree of the given outfit and then provides people with some dressing advice. Existing methods have primarily solved this problem by analyzing the ...
MCCP: multi-modal fashion compatibility and conditional preference model for personalized clothing recommendation
Abstract
Personalized clothing recommendation remains challenging due to the richness of fashion item representations, the non-uniqueness of fashion compatibility relationship and the complicated conditions of user preference. To address these problems, a ...
Limb-Aware Virtual Try-On Network With Progressive Clothing Warping
Image-based virtual try-on aims to transfer an in-shop clothing image to a person image. Most existing methods adopt a single global deformation to perform clothing warping directly, which lacks fine-grained modeling of in-shop clothing and leads to ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '21: Proceedings of the 29th ACM International Conference on Multimedia

October 2021

5796 pages

ISBN:9781450386517

DOI:10.1145/3474085

General Chairs:
Heng Tao Shen
University of Electronic Science&Technology of China, China
,
Yueting Zhuang
Zhejiang University, China
,
John R. Smith
IBM, USA
,
Program Chairs:
Yang Yang
University of Electronic Science and Technology of China, China
,
Pablo Cesar
CWI&TU Delft, The Netherlands
,
Florian Metze
FACEBOOK, Inc., USA
,
Balakrishnan Prabhakaran
University of Texas at Dallas, USA

Copyright © 2021 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 October 2021

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

the Key R&D Program of Shandong (Major scientific and technological innovation projects)
the Shandong Provincial Natural Science Foundation
the new AI project towards the integration of education and industry in QLUT
the National Natural Science Foundation of China
CCF-Baidu Open Fund

Conference

MM '21

Sponsor:

SIGMM

MM '21: ACM Multimedia Conference

October 20 - 24, 2021

Virtual Event, China

Acceptance Rates

Overall Acceptance Rate 995 of 4,171 submissions, 24%

Upcoming Conference

MM '24

Sponsor:
sigmm

The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

14
Total Citations
View Citations
330
Total Downloads

Downloads (Last 12 months)72
Downloads (Last 6 weeks)16

Reflects downloads up to 15 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

Selwon KSzymański J(2024)A Review of Explainable Fashion Compatibility Modeling MethodsACM Computing Surveys10.1145/366461456:11(1-29)Online publication date: 28-Jun-2024
https://dl.acm.org/doi/10.1145/3664614
Wen HSong XYin JWu JGuan WNie L(2024)Self-Training Boosted Multi-Factor Matching Network for Composed Image RetrievalIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.334643446:5(3665-3678)Online publication date: May-2024
https://doi.org/10.1109/TPAMI.2023.3346434
Su ZChen YZhang FWang RZhou FLin G(2024)DMAP: Decoupling-Driven Multi-Level Attribute Parsing for Interpretable Outfit CollocationIEEE Transactions on Multimedia10.1109/TMM.2024.340254126(9988-10000)Online publication date: 2024
https://doi.org/10.1109/TMM.2024.3402541
Ding YLai ZMok PChua T(2023)Computational Technologies for Fashion Recommendation: A SurveyACM Computing Surveys10.1145/362710056:5(1-45)Online publication date: 25-Nov-2023
https://dl.acm.org/doi/10.1145/3627100
Jing LSong XLin XZhao ZZhou WNie L(2023)Stylized Data-to-text Generation: A Case Study in the E-Commerce DomainACM Transactions on Information Systems10.1145/360337442:1(1-24)Online publication date: 18-Aug-2023
https://dl.acm.org/doi/10.1145/3603374
Wen HZhang XSong XWei YNie LEl Saddik AMei TCucchiara RBertini MTobon Vallejo DAtrey PHossain M(2023)Target-Guided Composed Image RetrievalProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3611817(915-923)Online publication date: 26-Oct-2023
https://dl.acm.org/doi/10.1145/3581783.3611817
Wang DZhao TYu WChawla NJiang M(2023)Deep Multimodal Complementarity LearningIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.316518034:12(10213-10224)Online publication date: Dec-2023
https://doi.org/10.1109/TNNLS.2022.3165180
Zhu MZhan S(2022)Research and implementation of a trend prediction model based on trend similarity for the changing trends of fashion elements in clothingProceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering10.1145/3573428.3573685(1460-1466)Online publication date: 21-Oct-2022
https://dl.acm.org/doi/10.1145/3573428.3573685
Guan WWen HSong XWang CYeh CChang XNie L(2022)Partially Supervised Compatibility ModelingIEEE Transactions on Image Processing10.1109/TIP.2022.318729031(4733-4745)Online publication date: 2022
https://doi.org/10.1109/TIP.2022.3187290
Xiao LYamasaki T(2022)Sat: Self-Adaptive Training for Fashion Compatibility Prediction2022 IEEE International Conference on Image Processing (ICIP)10.1109/ICIP46576.2022.9897313(2431-2435)Online publication date: 16-Oct-2022
https://doi.org/10.1109/ICIP46576.2022.9897313
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents