research-article

Adversarial Distillation for Efficient Recommendation with External Knowledge

Authors:

Yongfeng Zhang,

Hongyuan ZhaAuthors Info & Claims

ACM Transactions on Information Systems (TOIS), Volume 37, Issue 1

Article No.: 12, Pages 1 - 28

https://doi.org/10.1145/3281659

Published: 13 December 2018 Publication History

Abstract

Integrating external knowledge into the recommendation system has attracted increasing attention in both industry and academic communities. Recent methods mostly take the power of neural network for effective knowledge representation to improve the recommendation performance. However, the heavy deep architectures in existing models are usually incorporated in an embedded manner, which may greatly increase the model complexity and lower the runtime efficiency.

To simultaneously take the power of deep learning for external knowledge modeling as well as maintaining the model efficiency at test time, we reformulate the problem of recommendation with external knowledge into a generalized distillation framework. The general idea is to free the complex deep architecture into a separate model, which is only used in the training phrase, while abandoned at test time. In particular, in the training phrase, the external knowledge is processed by a comprehensive teacher model to produce valuable information to teach a simple and efficient student model. Once the framework is learned, the teacher model is abandoned, and only the succinct yet enhanced student model is used to make fast predictions at test time. In this article, we specify the external knowledge as user review, and to leverage it in an effective manner, we further extend the traditional generalized distillation framework by designing a Selective Distillation Network (SDNet) with adversarial adaption and orthogonality constraint strategies to make it more robust to noise information.

Extensive experiments verify that our model can not only improve the performance of rating prediction, but also can significantly reduce time consumption when making predictions as compared with several state-of-the-art methods.

References

[1]

David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent Dirichlet allocation. Journal of Machine Learning Research 3 (2003), 993--1022.

Digital Library

[2]

Konstantinos Bousmalis, George Trigeorgis, Nathan Silberman, Dilip Krishnan, and Dumitru Erhan. 2016. Domain separation networks. In NIPS.

Digital Library

[3]

Rose Catherine and William Cohen. 2017. TransNets: Learning to transform for recommendation. In Recsys.

Digital Library

[4]

Jingyuan Chen, Hanwang Zhang, Xiangnan He, Liqiang Nie, Wei Liu, and Tat-Seng Chua. 2017. Attentive collaborative filtering: Multimedia recommendation with item-and component-level attention. In SIGIR.

Digital Library

[5]

Li Chen, Guanliang Chen, and Feng Wang. 2015. Recommender systems based on user reviews: The state of the art. User Modeling and User-Adapted Interaction 25, 2 (2015), 99--154.

Digital Library

[6]

Xu Chen, Zheng Qin, Yongfeng Zhang, and Tao Xu. 2016. Learning to rank features for recommendation over multiple categories. In SIGIR.

Digital Library

[7]

Xu Chen, Yongfeng Zhang, Qingyao Ai, Hongteng Xu, Junchi Yan, and Zheng Qin. 2017. Personalized key frame recommendation. In SIGIR.

Digital Library

[8]

Shereen Fouad, Peter Tino, Somak Raychaudhury, and Petra Schneider. 2013. Incorporating privileged information through metric learning. IEEE Transactions on Neural Networks and Learning Systems 24, 7 (2013), 1086--1098.

[9]

Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. In NIPS.

Digital Library

[10]

Ruining He and Julian McAuley. 2016. Ups and downs: Modeling the visual evolution of fashion trends with one-class collaborative filtering. In WWW.

Digital Library

[11]

Xiangnan He and Tat-Seng Chua. 2017. Neural factorization machines for sparse predictive analytics. In SIGIR.

Digital Library

[12]

Xiangnan He, Zhankui He, Xiaoyu Du, and Tat-Seng Chua. 2018. Adversarial personalized ranking for recommendation. In SIGIR.

Digital Library

[13]

Xiangnan He, Zhenkui He, Jingkuan Song, Zhenguang Liu, Yu-Gang Jiang, and Tat-Seng Chua. 2018. NAIS: Neural attentive item similarity model for recommendation. IEEE Transactions on Knowledge and Data Engineering 1, 1--1.

[14]

Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua. 2017. Neural collaborative filtering. In WWW.

Digital Library

[15]

Geoffrey Hinton, Oriol Vinyals, and Jeff Dean. 2015. Distilling the knowledge in a neural network. stat (2015).

[16]

Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural Computation 9, 8 (1997), 1735--1780.

Digital Library

[17]

Zhiting Hu, Xuezhe Ma, Zhengzhong Liu, Eduard Hovy, and Eric Xing. 2016. Harnessing deep neural networks with logic rules. In ACL.

[18]

Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, and Alexei A. Efros. 2017. Image-to-image translation with conditional adversarial networks. In CVPR.

[19]

Donghyun Kim, Chanyoung Park, Jinoh Oh, Sungyoung Lee, and Hwanjo Yu. 2016. Convolutional matrix factorization for document context-aware recommendation. In Recsys.

Digital Library

[20]

Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer 8 (2009), 30--37.

Digital Library

[21]

Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. ImageNet classification with deep convolutional neural networks. In NIPS.

Digital Library

[22]

Christian Ledig, Lucas Theis, Ferenc Huszár, Jose Caballero, Andrew Cunningham, Alejandro Acosta, Andrew Aitken, Alykhan Tejani, Johannes Totz, Zehan Wang, et al. 2017. Photo-realistic single image super-resolution using a generative adversarial network. In CVPR.

[23]

Chuan Li and Michael Wand. 2016. Precomputed real-time texture synthesis with Markovian generative adversarial networks. In ECCV.

[24]

Piji Li, Zihao Wang, Zhaochun Ren, Lidong Bing, and Wai Lam. 2017. Neural rating regression with abstractive tips generation for recommendation. In SIGIR.

Digital Library

[25]

Wen Li, Li Niu, and Dong Xu. 2014. Exploiting privileged information from web data for image categorization. In ECCV.

[26]

Guang Ling, Michael R. Lyu, and Irwin King. 2014. Ratings meet reviews, a combined approach to recommend. In Recsys.

Digital Library

[27]

Pengfei Liu, Xipeng Qiu, and Xuanjing Huang. 2017. Adversarial multi-task learning for text classification. In ACL.

[28]

David Lopez-Paz, Léon Bottou, Bernhard Schölkopf, and Vladimir Vapnik. 2015. Unifying distillation and privileged information. stat (2015).

[29]

Minh-Thang Luong, Hieu Pham, and Christopher D. Manning. 2015. Effective approaches to attention-based neural machine translation. In EMNLP.

[30]

Andrew L. Maas, Awni Y. Hannun, and Andrew Y. Ng. 2013. Rectifier nonlinearities improve neural network acoustic models. In ICML.

[31]

Julian McAuley and Jure Leskovec. 2013. Hidden factors and hidden topics: Understanding rating dimensions with review text. In Recsys.

Digital Library

[32]

Julian McAuley, Christopher Targett, Qinfeng Shi, and Anton van den Hengel. 2015. Image-based recommendations on styles and substitutes. In SIGIR.

Digital Library

[33]

Takeru Miyato, Andrew M. Dai, and Ian Goodfellow. 2016. Virtual adversarial training for semi-supervised text classification. stat (2016).

[34]

Andriy Mnih and Ruslan R. Salakhutdinov. 2008. Probabilistic matrix factorization. In NIPS.

Digital Library

[35]

Lili Mou, Ran Jia, Yan Xu, Ge Li, Lu Zhang, and Zhi Jin. 2016. Distilling word embeddings: An encoding approach. In CIKM.

Digital Library

[36]

Vinod Nair and Geoffrey E. Hinton. 2010. Rectified linear units improve restricted Boltzmann machines. In ICML.

Digital Library

[37]

Bo Pang, Lillian Lee, et al. 2008. Opinion mining and sentiment analysis. Foundations and Trends® in Information Retrieval 2, 1--2 (2008), 1--135.

Digital Library

[38]

Deepak Pathak, Philipp Krahenbuhl, Jeff Donahue, Trevor Darrell, and Alexei A. Efros. 2016. Context encoders: Feature learning by inpainting. In CVPR.

[39]

Scott Reed, Zeynep Akata, Xinchen Yan, Lajanugen Logeswaran, Bernt Schiele, and Honglak Lee. 2016. Generative adversarial text to image synthesis. In ICML.

Digital Library

[40]

Richard Socher, Yoshua Bengio, and Christopher D. Manning. 2012. Deep learning for NLP (without magic). In Tutorial Abstracts of ACL 2012.

Digital Library

[41]

Yunzhi Tan, Min Zhang, Yiqun Liu, and Shaoping Ma. 2016. Rating-boosted latent topics: Understanding users and items with ratings and reviews. In IJCAI.

Digital Library

[42]

Vladimir Vapnik and Akshay Vashist. 2009. A new learning paradigm: Learning using privileged information. Neural Networks 22, 5--6 (2009), 544--557.

Digital Library

[43]

Chong Wang and David M. Blei. 2011. Collaborative topic modeling for recommending scientific articles. In KDD.

Digital Library

[44]

Hao Wang, Naiyan Wang, and Dit-Yan Yeung. 2015. Collaborative deep learning for recommender systems. In SIGKDD.

Digital Library

[45]

Jun Wang, Lantao Yu, Weinan Zhang, Yu Gong, Yinghui Xu, Benyou Wang, Peng Zhang, and Dell Zhang. 2017. IRGAN: A minimax game for unifying generative and discriminative information retrieval models. In SIGIR.

Digital Library

[46]

Jun Xiao, Hao Ye, Xiangnan He, Hanwang Zhang, Fei Wu, and Tat-Seng Chua. 2017. Attentional factorization machines: Learning the weight of feature interactions via attention networks. In IJCAI.

Digital Library

[47]

Yan Yan, Feiping Nie, Wen Li, Chenqiang Gao, Yi Yang, and Dong Xu. 2016. Image classification by cross-media active learning with privileged information. IEEE Transactions on Multimedia 18, 12 (2016), 2494--2502.

Digital Library

[48]

Lantao Yu, Weinan Zhang, Jun Wang, and Yong Yu. 2017. SeqGAN: Sequence generative adversarial nets with policy gradient. In AAAI.

Digital Library

[49]

Y. Zhang, Q. Ai, X. Chen, and W. Croft. 2017. Joint representation learning for top-N recommendation with heterogeneous information sources. CIKM (2017).

Digital Library

[50]

Yongfeng Zhang, Guokun Lai, Min Zhang, Yi Zhang, Yiqun Liu, and Shaoping Ma. 2014. Explicit factor models for explainable recommendation based on phrase-level sentiment analysis. In SIGIR.

Digital Library

[51]

Lei Zheng, Vahid Noroozi, and Philip S. Yu. 2017. Joint deep modeling of users and items using reviews for recommendation. In WSDM.

Digital Library

[52]

Feida Zhu, Yongfeng Zhang, Neil Yorke-Smith, Guibing Guo, and Xu Chen. 2018. IFUP: Workshop on multi-dimensional information fusion for user modeling and personalization. In WSDM.

Digital Library

[53]

Jun-Yan Zhu, Philipp Krähenbühl, Eli Shechtman, and Alexei A. Efros. 2016. Generative visual manipulation on the natural image manifold. In ECCV.

Cited By

Wang QMao ZGao JZhang Y(2024)Document-level Relation Extraction with Progressive Self-distillationACM Transactions on Information Systems10.1145/365616842:6(1-34)Online publication date: 25-Jun-2024
https://dl.acm.org/doi/10.1145/3656168
Ye SLu J(2024)Robust Recommender Systems with Rating Flip NoiseACM Transactions on Intelligent Systems and Technology10.1145/3641285Online publication date: 29-Feb-2024
https://doi.org/10.1145/3641285
Zhang SJiang ZYao JFeng FKuang KZhao ZLi SYang HChua TWu F(2024)Causal Distillation for Alleviating Performance Heterogeneity in Recommender SystemsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.329054536:2(459-474)Online publication date: 1-Feb-2024
https://dl.acm.org/doi/10.1109/TKDE.2023.3290545
Show More Cited By

Index Terms

Adversarial Distillation for Efficient Recommendation with External Knowledge
1. Information systems
  1. World Wide Web
    1. Web searching and information discovery
      1. Collaborative filtering
      2. Personalization

Recommendations

Performance improvement in collaborative recommendation using multi-layer perceptron
ICONIP'06: Proceedings of the 13th international conference on Neural information processing - Volume Part III

Recommendation is to offer information which fits user's interests and tastes to provide better services and to reduce information overload. It recently draws attention upon Internet users and information providers. Collaborative filtering is one of the ...
Adversarial dual autoencoders for trust-aware recommendation
Abstract
Recommender systems face longstanding challenges in gaining users’ trust due to the unreliable information caused by profile injection or human misbehavior. Traditional solutions to those challenges focus on leveraging users’ social relationships ...
Adversarial Personalized Ranking for Recommendation
SIGIR '18: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval

Item recommendation is a personalized ranking task. To this end, many recommender systems optimize models with pairwise ranking objectives, such as the Bayesian Personalized Ranking (BPR). Using matrix Factorization (MF) - the most widely used model in ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Information Systems

ACM Transactions on Information Systems Volume 37, Issue 1

January 2019

435 pages

ISSN:1046-8188

EISSN:1558-2868

DOI:10.1145/3289475

Editor:
Maarten de Rijke
University of Amsterdam, The Netherlands

Issue’s Table of Contents

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 December 2018

Accepted: 01 September 2018

Revised: 01 August 2018

Received: 01 April 2018

Published in TOIS Volume 37, Issue 1

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

49
Total Citations
View Citations
889
Total Downloads

Downloads (Last 12 months)63
Downloads (Last 6 weeks)5

Reflects downloads up to 09 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Wang QMao ZGao JZhang Y(2024)Document-level Relation Extraction with Progressive Self-distillationACM Transactions on Information Systems10.1145/365616842:6(1-34)Online publication date: 25-Jun-2024
https://dl.acm.org/doi/10.1145/3656168
Ye SLu J(2024)Robust Recommender Systems with Rating Flip NoiseACM Transactions on Intelligent Systems and Technology10.1145/3641285Online publication date: 29-Feb-2024
https://doi.org/10.1145/3641285
Zhang SJiang ZYao JFeng FKuang KZhao ZLi SYang HChua TWu F(2024)Causal Distillation for Alleviating Performance Heterogeneity in Recommender SystemsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.329054536:2(459-474)Online publication date: 1-Feb-2024
https://dl.acm.org/doi/10.1109/TKDE.2023.3290545
Yin JZhang QXi XLiu MLu WTu H(2024)Enhancing Cervical Cell Detection Through Weakly Supervised Learning With Local Distillation MechanismIEEE Access10.1109/ACCESS.2024.340706612(77104-77113)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3407066
Hu HLiu QLi CKan M(2024)Lightweight Modality Adaptation to Sequential Recommendation via Correlation SupervisionAdvances in Information Retrieval10.1007/978-3-031-56027-9_8(123-139)Online publication date: 24-Mar-2024
https://dl.acm.org/doi/10.1007/978-3-031-56027-9_8
Yin ZHan KWang PZhu X(2023)H3GNN: Hybrid Hierarchical HyperGraph Neural Network for Personalized Session-based RecommendationACM Transactions on Information Systems10.1145/3630002Online publication date: 23-Oct-2023
https://dl.acm.org/doi/10.1145/3630002
Bittencourt GFonseca GAndrade YSilva NRocha L(2023)A Survey on Review - Aware Recommendation SystemsProceedings of the 29th Brazilian Symposium on Multimedia and the Web10.1145/3617023.3617050(198-207)Online publication date: 23-Oct-2023
https://dl.acm.org/doi/10.1145/3617023.3617050
Pan YLi NGao CChang JNiu YSong YJin DLi YFrommholz IHopfgartner FLee MOakes MLalmas MZhang MSantos R(2023)Learning and Optimization of Implicit Negative Feedback for Industrial Short-video Recommender SystemProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615482(4787-4793)Online publication date: 21-Oct-2023
https://dl.acm.org/doi/10.1145/3583780.3615482
Qiao SZhou WWen JZhang HGao MFrommholz IHopfgartner FLee MOakes MLalmas MZhang MSantos R(2023)Bi-channel Multiple Sparse Graph Attention Networks for Session-based RecommendationProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614791(2075-2084)Online publication date: 21-Oct-2023
https://dl.acm.org/doi/10.1145/3583780.3614791
Wen YGao CYi LQiu LWang YLi YSingh ASun YAkoglu LGunopulos DYan XKumar ROzcan FYe J(2023)Efficient and Joint Hyperparameter and Architecture Search for Collaborative FilteringProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599322(2547-2558)Online publication date: 6-Aug-2023
https://dl.acm.org/doi/10.1145/3580305.3599322
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Issue’s Table of Contents