Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3366423.3380096acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

OutfitNet: Fashion Outfit Recommendation with Attention-Based Multiple Instance Learning

Published: 20 April 2020 Publication History

Abstract

Recommending fashion outfits to users presents several challenges. First of all, an outfit consists of multiple fashion items, and each user emphasizes different parts of an outfit when considering whether they like it or not. Secondly, a user’s liking for a fashion outfit considers not only the aesthetics of each item but also the compatibility among them. Lastly, fashion outfit data is often sparse in terms of the relationship between users and fashion outfits. Not to mention, we can only obtain what the users like, but not what they dislike.
To address the above challenges, in this paper, we formulate the fashion outfit recommendation problem as a multiple-instance-learning (MIL) problem. We propose OutfitNet, a fashion outfit recommendation framework that includes two stages. The first stage is a Fashion Item Relevancy network (FIR), which learns the compatibility between fashion items and further generates relevancy embedding of fashion items. In the second stage, an Outfit Preference network (OP) learns the users’ tastes for fashion outfits using visual information. OutfitNet takes in multiple fashion items in a fashion outfit as input, learns the compatibility among fashion items, the users’ tastes toward each item, as well as the users’ attention on different items in the outfit with the attention mechanism.
Quantitatively, our experiments show that OutfitNet outperforms state-of-the-art models in two tasks: fill-in-the-blank (FITB) and personalized outfit recommendation. Qualitatively, we demonstrate that the learned personalized item scores and attention scores capture well the users’ fashion tastes, and the learned fashion item embeddings capture well the compatibility relationships among fashion items. We also leverage the learned fashion item embedding and propose a simple fashion outfit generation framework, which is shown to produce high-quality fashion outfit combinations.

References

[1]
Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, Manjunath Kudlur, Josh Levenberg, Rajat Monga, Sherry Moore, Derek Gordon Murray, Benoit Steiner, Paul A. Tucker, Vijay Vasudevan, Pete Warden, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. 2016. TensorFlow: A System for Large-Scale Machine Learning. In 12th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2016, Savannah, GA, USA, November 2-4, 2016.265–283. https://www.usenix.org/conference/osdi16/technical-sessions/presentation/abadi
[2]
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural Machine Translation by Jointly Learning to Align and Translate. CoRR abs/1409.0473(2014). arxiv:1409.0473http://arxiv.org/abs/1409.0473
[3]
Jane Bromley, Isabelle Guyon, Yann LeCun, Eduard Säckinger, and Roopak Shah. 1994. Signature verification using a” siamese” time delay neural network. In Advances in neural information processing systems. 737–744.
[4]
Ângelo Cardoso, Fabio Daolio, and Saúl Vargas. 2018. Product Characterisation towards Personalisation: Learning Attributes from Unstructured Data to Recommend Fashion Products. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, KDD 2018, London, UK, August 19-23, 2018. 80–89. https://doi.org/10.1145/3219819.3219888
[5]
Kuang-Yu Chang, Kung-Hung Lu, and Chu-Song Chen. 2017. Aesthetic Critiques Generation for Photos. In IEEE International Conference on Computer Vision, ICCV 2017, Venice, Italy, October 22-29, 2017. 3534–3543. https://doi.org/10.1109/ICCV.2017.380
[6]
Wen Chen, Pipei Huang, Jiaming Xu, Xin Guo, Cheng Guo, Fei Sun, Chao Li, Andreas Pfadler, Huan Zhao, and Binqiang Zhao. 2019. POG: Personalized Outfit Generation for Fashion Recommendation at Alibaba iFashion. arXiv preprint arXiv:1905.01866(2019).
[7]
Jorge Alé Chilet, Cuicui Chen, and Yusan Lin. 2016. Analyzing social media marketing in the high-end fashion industry using Named Entity Recognition. In Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. IEEE Press, 621–622.
[8]
John Collomosse, Tu Bui, Michael J. Wilber, Chen Fang, and Hailin Jin. 2017. Sketching with Style: Visual Search with Sketches and Aesthetic Context. In IEEE International Conference on Computer Vision, ICCV 2017, Venice, Italy, October 22-29, 2017. 2679–2687. https://doi.org/10.1109/ICCV.2017.290
[9]
Zeyu Cui, Zekun Li, Shu Wu, Xiaoyu Zhang, and Liang Wang. 2019. Dressing as a Whole: Outfit Compatibility Learning Based on Node-wise Graph Neural Networks. In The World Wide Web Conference, WWW 2019, San Francisco, CA, USA, May 13-17, 2019. 307–317. https://doi.org/10.1145/3308558.3313444
[10]
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Fei-Fei Li. 2009. ImageNet: A large-scale hierarchical image database. In 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 20-25 June 2009, Miami, Florida, USA. 248–255. https://doi.org/10.1109/CVPRW.2009.5206848
[11]
Xintong Han, Zuxuan Wu, Yu-Gang Jiang, and Larry S. Davis. 2017. Learning Fashion Compatibility with Bidirectional LSTMs. In Proceedings of the 2017 ACM on Multimedia Conference, MM 2017, Mountain View, CA, USA, October 23-27, 2017. 1078–1086. https://doi.org/10.1145/3123266.3123394
[12]
Wei-Lin Hsiao and Kristen Grauman. 2017. Learning the Latent ”Look”: Unsupervised Discovery of a Style-Coherent Embedding from Fashion Images. In IEEE International Conference on Computer Vision, ICCV 2017, Venice, Italy, October 22-29, 2017. 4213–4222. https://doi.org/10.1109/ICCV.2017.451
[13]
Gao Huang, Zhuang Liu, Laurens van der Maaten, and Kilian Q. Weinberger. 2017. Densely Connected Convolutional Networks. In 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA, July 21-26, 2017. 2261–2269. https://doi.org/10.1109/CVPR.2017.243
[14]
Maximilian Ilse, Jakub M. Tomczak, and Max Welling. 2018. Attention-based Deep Multiple Instance Learning. In Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stockholmsmässan, Stockholm, Sweden, July 10-15, 2018. 2132–2141. http://proceedings.mlr.press/v80/ilse18a.html
[15]
Wang-Cheng Kang, Chen Fang, Zhaowen Wang, and Julian McAuley. 2017. Visually-Aware Fashion Recommendation and Design with Generative Image Models. In 2017 IEEE International Conference on Data Mining, ICDM 2017, New Orleans, LA, USA, November 18-21, 2017. 207–216. https://doi.org/10.1109/ICDM.2017.30
[16]
Wang-Cheng Kang, Eric Kim, Jure Leskovec, Charles Rosenberg, and Julian J. McAuley. 2018. Complete the Look: Scene-based Complementary Product Recommendation. CoRR abs/1812.01748(2018). arxiv:1812.01748http://arxiv.org/abs/1812.01748
[17]
Yuncheng Li, Liangliang Cao, Jiang Zhu, and Jiebo Luo. 2017. Mining Fashion Outfit Composition Using an End-to-End Deep Learning Approach on Set Data. IEEE Trans. Multimedia 19, 8 (2017), 1946–1955. https://doi.org/10.1109/TMM.2017.2690144
[18]
Yujie Lin, Pengjie Ren, Zhumin Chen, Zhaochun Ren, Jun Ma, and Maarten de Rijke. 2018. Explainable Fashion Recommendation with Joint Outfit Matching and Comment Generation. CoRR abs/1806.08977(2018). arxiv:1806.08977http://arxiv.org/abs/1806.08977
[19]
Yusan Lin, Heng Xu, Yilu Zhou, and Wang-Chien Lee. 2015. Styles in the fashion social network: an analysis on Lookbook. nu. In International Conference on Social Computing, Behavioral-Cultural Modeling, and Prediction. Springer, 356–361.
[20]
Yusan Lin, Yilu Zhou, and Heng Xu. 2014. The hidden influence network in the fashion industry. In The 24th Annual Workshop on Information Technologies and Systems (WITS), Vol. 1.
[21]
Yusan Lin, Yilu Zhou, and Heng Xu. 2015. Text-Generated Fashion Influence Model: An Empirical Study on Style.com. In 48th Hawaii International Conference on System Sciences, HICSS 2015, Kauai, Hawaii, USA, January 5-8, 2015. 3642–3650. https://doi.org/10.1109/HICSS.2015.438
[22]
Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of machine learning research 9, Nov (2008), 2579–2605.
[23]
Oded Maron and Tomás Lozano-Pérez. 1997. A Framework for Multiple-Instance Learning. In Advances in Neural Information Processing Systems 10, [NIPS Conference, Denver, Colorado, USA, 1997]. 570–576. http://papers.nips.cc/paper/1346-a-framework-for-multiple-instance-learning
[24]
Jian Ren, Xiaohui Shen, Zhe L. Lin, Radomír Mech, and David J. Foran. 2017. Personalized Image Aesthetics. In IEEE International Conference on Computer Vision, ICCV 2017, Venice, Italy, October 22-29, 2017. 638–647. https://doi.org/10.1109/ICCV.2017.76
[25]
Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2009. BPR: Bayesian Personalized Ranking from Implicit Feedback. In UAI 2009, Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence, Montreal, QC, Canada, June 18-21, 2009. 452–461. https://dslpitt.org/uai/displayArticleDetails.jsp?mmnu=1&smnu=2&article_id=1630&proceeding_id=25
[26]
Florian Schroff, Dmitry Kalenichenko, and James Philbin. 2015. Facenet: A unified embedding for face recognition and clustering. In Proceedings of the IEEE conference on computer vision and pattern recognition. 815–823.
[27]
Xuemeng Song, Fuli Feng, Xianjing Han, Xin Yang, Wei Liu, and Liqiang Nie. 2018. Neural Compatibility Modeling with Attentive Knowledge Distillation. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, SIGIR 2018, Ann Arbor, MI, USA, July 08-12, 2018. 5–14. https://doi.org/10.1145/3209978.3209996
[28]
Xuemeng Song, Fuli Feng, Jinhuan Liu, Zekun Li, Liqiang Nie, and Jun Ma. 2017. NeuroStylist: Neural Compatibility Modeling for Clothing Matching. In Proceedings of the 2017 ACM on Multimedia Conference, MM 2017, Mountain View, CA, USA, October 23-27, 2017. 753–761. https://doi.org/10.1145/3123266.3123314
[29]
Mariya I. Vasileva, Bryan A. Plummer, Krishna Dusad, Shreya Rajpal, Ranjitha Kumar, and David A. Forsyth. 2018. Learning Type-Aware Embeddings for Fashion Compatibility. In Computer Vision - ECCV 2018 - 15th European Conference, Munich, Germany, September 8-14, 2018, Proceedings, Part XVI. 405–421. https://doi.org/10.1007/978-3-030-01270-0_24
[30]
Sirion Vittayakorn, Kota Yamaguchi, Alexander C Berg, and Tamara L Berg. 2015. Runway to realway: Visual analysis of fashion. In 2015 IEEE Winter Conference on Applications of Computer Vision. IEEE, 951–958.
[31]
Wenguan Wang and Jianbing Shen. 2017. Deep Cropping via Attention Box Prediction and Aesthetics Assessment. In IEEE International Conference on Computer Vision, ICCV 2017, Venice, Italy, October 22-29, 2017. 2205–2213. https://doi.org/10.1109/ICCV.2017.240
[32]
Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron C. Courville, Ruslan Salakhutdinov, Richard S. Zemel, and Yoshua Bengio. 2015. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention. In Proceedings of the 32nd International Conference on Machine Learning, ICML 2015, Lille, France, 6-11 July 2015. 2048–2057. http://jmlr.org/proceedings/papers/v37/xuc15.html
[33]
Ruiping Yin, Kan Li, Jie Lu, and Guangquan Zhang. 2019. Enhancing Fashion Recommendation with Visual Compatibility Relationship. In The World Wide Web Conference, WWW 2019, San Francisco, CA, USA, May 13-17, 2019. 3434–3440. https://doi.org/10.1145/3308558.3313739
[34]
Wenhui Yu, Huidi Zhang, Xiangnan He, Xu Chen, Li Xiong, and Zheng Qin. 2018. Aesthetic-based Clothing Recommendation. In Proceedings of the 2018 World Wide Web Conference on World Wide Web, WWW 2018, Lyon, France, April 23-27, 2018. 649–658. https://doi.org/10.1145/3178876.3186146
[35]
Manzil Zaheer, Satwik Kottur, Siamak Ravanbakhsh, Barnabás Póczos, Ruslan R. Salakhutdinov, and Alexander J. Smola. 2017. Deep Sets. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 4-9 December 2017, Long Beach, CA, USA. 3394–3404. http://papers.nips.cc/paper/6931-deep-sets

Cited By

View all
  • (2024)A Review of Explainable Fashion Compatibility Modeling MethodsACM Computing Surveys10.1145/366461456:11(1-29)Online publication date: 28-Jun-2024
  • (2024)Revolutionizing Fashion Recommendations: A Deep Dive into Deep Learning-based Recommender SystemsProceedings of the 7th International Conference on Networking, Intelligent Systems and Security10.1145/3659677.3659678(1-8)Online publication date: 18-Apr-2024
  • (2024)Less is More: A Streamlined Graph-Based Fashion Outfit Recommendation without Multimodal DependencyProceedings of the 39th ACM/SIGAPP Symposium on Applied Computing10.1145/3605098.3636168(492-495)Online publication date: 8-Apr-2024
  • Show More Cited By

Index Terms

  1. OutfitNet: Fashion Outfit Recommendation with Attention-Based Multiple Instance Learning
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Conferences
          WWW '20: Proceedings of The Web Conference 2020
          April 2020
          3143 pages
          ISBN:9781450370233
          DOI:10.1145/3366423
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Sponsors

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 20 April 2020

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. fashion item relevancy
          2. fashion outfit generation
          3. fashion outfit recommendation

          Qualifiers

          • Research-article
          • Research
          • Refereed limited

          Conference

          WWW '20
          Sponsor:
          WWW '20: The Web Conference 2020
          April 20 - 24, 2020
          Taipei, Taiwan

          Acceptance Rates

          Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)104
          • Downloads (Last 6 weeks)5
          Reflects downloads up to 10 Nov 2024

          Other Metrics

          Citations

          Cited By

          View all
          • (2024)A Review of Explainable Fashion Compatibility Modeling MethodsACM Computing Surveys10.1145/366461456:11(1-29)Online publication date: 28-Jun-2024
          • (2024)Revolutionizing Fashion Recommendations: A Deep Dive into Deep Learning-based Recommender SystemsProceedings of the 7th International Conference on Networking, Intelligent Systems and Security10.1145/3659677.3659678(1-8)Online publication date: 18-Apr-2024
          • (2024)Less is More: A Streamlined Graph-Based Fashion Outfit Recommendation without Multimodal DependencyProceedings of the 39th ACM/SIGAPP Symposium on Applied Computing10.1145/3605098.3636168(492-495)Online publication date: 8-Apr-2024
          • (2024)AI in fashion: a literature reviewElectronic Commerce Research10.1007/s10660-024-09872-zOnline publication date: 19-Jun-2024
          • (2023)Reproducibility in multiple instance learningProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3666718(13530-13544)Online publication date: 10-Dec-2023
          • (2023)Fashion clothing matching by global-local feature optimizationJournal of Image and Graphics10.11834/jig.21117028:4(1104-1118)Online publication date: 2023
          • (2023)Computational Technologies for Fashion Recommendation: A SurveyACM Computing Surveys10.1145/362710056:5(1-45)Online publication date: 25-Nov-2023
          • (2023)A Review of Modern Fashion Recommender SystemsACM Computing Surveys10.1145/362473356:4(1-37)Online publication date: 21-Oct-2023
          • (2023)Deep Multimodal Complementarity LearningIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.316518034:12(10213-10224)Online publication date: Dec-2023
          • (2023)Personalized Fashion Recommendation With Discrete Content-Based Tensor FactorizationIEEE Transactions on Multimedia10.1109/TMM.2022.318674425(5053-5064)Online publication date: 2023
          • Show More Cited By

          View Options

          Get Access

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format.

          HTML Format

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media