Multi-label Image Set Recognition in Visually-Aware Recommender Systems

Demochkin, Kirill; Savchenko, Andrey V.

doi:10.1007/978-3-030-37334-4_26

Kirill Demochkin²² &
Andrey V. Savchenko²²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11832))

Included in the following conference series:

International Conference on Analysis of Images, Social Networks and Texts

1091 Accesses

Abstract

In this paper we focus on the problem of multi-label image recognition for visually-aware recommender systems. We propose a two stage approach in which a deep convolutional neural network is firstly fine-tuned on a part of the training set. Secondly, an attention-based aggregation network is trained to compute the weighted average of visual features in an input image set. Our approach is implemented as a mobile fashion recommender system application. It is experimentally show on the Amazon Fashion dataset that our approach achieves an F1-measure of 0.58 for 15 recommendations, which is twice as good as the 0.25 F1-measure for conventional averaging of feature vectors.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

User Preference Prediction in a Set of Photos Based on Neural Aggregation Network

Neural Attention Mechanism and Linear Squeezing of Descriptors in Image Classification for Visual Recommender Systems

Article 01 October 2020

Visual Hybrid Recommendation Systems Based on the Content-Based Filtering

References

Shankar, D., Narumanchi, S., Ananya, H., Kompalli, P., Chaudhury, K.: Deep learning based large scale visual recommendation and search for e-commerce. arXiv preprint arXiv:1703.02344 (2017)
Bokde, D., Girase, S., Mukhopadhyay, D.: Matrix factorization model in collaborative filtering algorithms: a survey. Procedia Comput. Sci. 49, 136–146 (2015)
Article Google Scholar
Zhou, Y., Wilkinson, D., Schreiber, R., Pan, R.: Large-scale parallel collaborative filtering for the Netflix Prize. In: Fleischer, R., Xu, J. (eds.) AAIM 2008. LNCS, vol. 5034, pp. 337–348. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-68880-8_32
Chapter Google Scholar
Park, D.H., Kim, H.K., Choi, I.Y., Kim, J.K.: A literature review and classification of recommender systems research. Expert Syst. Appl. 39(11), 10059–10072 (2012)
Article Google Scholar
McAuley, J., Targett, C., Shi, Q., Van Den Hengel, A.: Image-based recommendations on styles and substitutes. In: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 43–52. ACM (2015)
Google Scholar
de Barros Costa, E., Rocha, H.J.B., Silva, E.T., Lima, N.C., Cavalcanti, J.: Understanding and personalising clothing recommendation for women. In: Rocha, Á., Correia, A.M., Adeli, H., Reis, L.P., Costanzo, S. (eds.) WorldCIST 2017. AISC, vol. 569, pp. 841–850. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-56535-4_82
Chapter Google Scholar
Yang, Z., Su, Z., Yang, Y., Lin, G.: From recommendation to generation: a novel fashion clothing advising framework. In: 2018 7th International Conference on Digital Home (ICDH), pp. 180–186. IEEE (2018)
Google Scholar
Andreeva, E., Ignatov, D.I., Grachev, A., Savchenko, A.V.: Extraction of visual features for recommendation of products via deep learning. In: van der Aalst, W.M.P., et al. (eds.) AIST 2018. LNCS, vol. 11179, pp. 201–210. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-11027-7_20
Chapter Google Scholar
Kang, W.C., Fang, C., Wang, Z., McAuley, J.: Visually-aware fashion recommendation and design with generative image models. In: 2017 IEEE International Conference on Data Mining (ICDM), pp. 207–216. IEEE (2017)
Google Scholar
Packer, C., McAuley, J., Ramisa, A.: Visually-aware personalized recommendation using interpretable image representations. arXiv preprint arXiv:1806.09820 (2018)
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016)
MATH Google Scholar
Miech, A., Laptev, I., Sivic, J.: Learnable pooling with context gating for video classification. arXiv preprint arXiv:1706.06905 (2017)
Yang, J., et al.: Neural aggregation network for video face recognition. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5216–5225. IEEE (2017)
Google Scholar
Iandola, F., Han, S., Moskewicz, M., Ashraf, K., Dally, W., Keutzer, K.: SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and $<$0.5 mb model size. arXiv preprint arXiv:1602.07360 (2016)
Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008 (2017)
Google Scholar
Savchenko, A.: Sequential three-way decisions in multi-category image recognition with deep features based on distance factor. Inf. Sci. 489, 18–36 (2019)
Article MathSciNet Google Scholar
Howard, A., et al.: MobileNets: efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861 (2017)

Download references

Acknowledgements

The paper was prepared within the framework of the Academic Fund Program at the National Research University Higher School of Economics (HSE) in 2019 (grant No. 19-04-0004) and by the Russian Academic Excellence Project 5-100.

Author information

Authors and Affiliations

Laboratory of Algorithms and Technologies for Network Analysis, National Research University Higher School of Economics, Nizhny Novgorod, Russia
Kirill Demochkin & Andrey V. Savchenko

Authors

Kirill Demochkin
View author publications
You can also search for this author in PubMed Google Scholar
Andrey V. Savchenko
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kirill Demochkin .

Editor information

Editors and Affiliations

RWTH Aachen University, Aachen, Germany
Wil M. P. van der Aalst
University of Ljubljana, Ljubljana, Slovenia
Vladimir Batagelj
National Research University Higher School of Economics, Moscow, Russia
Dmitry I. Ignatov
Krasovskii Institute of Mathematics and Mechanics, Yekaterinburg, Russia
Michael Khachay
National Research University Higher School of Economics, Moscow, Russia
Valentina Kuskova
University of Oslo, Oslo, Norway
Andrey Kutuzov
National Research University Higher School of Economics, Moscow, Russia
Sergei O. Kuznetsov
National Research University Higher School of Economics, Moscow, Russia
Irina A. Lomazova
Lomonosov Moscow State University, Moscow, Russia
Natalia Loukachevitch
LORIA, Vandœuvre-lès-Nancy, France
Amedeo Napoli
University of Florida, Gainesville, FL, USA
Panos M. Pardalos
Ca Foscari University of Venice, Venice, Italy
Marcello Pelillo
National Research University Higher School of Economics, Nizhny Novgorod, Russia
Andrey V. Savchenko
Kazan Federal University, Kazan, Russia
Elena Tutubalina

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Demochkin, K., Savchenko, A.V. (2019). Multi-label Image Set Recognition in Visually-Aware Recommender Systems. In: van der Aalst, W., et al. Analysis of Images, Social Networks and Texts. AIST 2019. Lecture Notes in Computer Science(), vol 11832. Springer, Cham. https://doi.org/10.1007/978-3-030-37334-4_26

Download citation

DOI: https://doi.org/10.1007/978-3-030-37334-4_26
Published: 15 December 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37333-7
Online ISBN: 978-3-030-37334-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Multi-label Image Set Recognition in Visually-Aware Recommender Systems

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

User Preference Prediction in a Set of Photos Based on Neural Aggregation Network

Neural Attention Mechanism and Linear Squeezing of Descriptors in Image Classification for Visual Recommender Systems

Visual Hybrid Recommendation Systems Based on the Content-Based Filtering

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Multi-label Image Set Recognition in Visually-Aware Recommender Systems

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

User Preference Prediction in a Set of Photos Based on Neural Aggregation Network

Neural Attention Mechanism and Linear Squeezing of Descriptors in Image Classification for Visual Recommender Systems

Visual Hybrid Recommendation Systems Based on the Content-Based Filtering

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation