research-article

Progressive Layered Extraction (PLE): A Novel Multi-Task Learning (MTL) Model for Personalized Recommendations

Authors:

Xudong GongAuthors Info & Claims

RecSys '20: Proceedings of the 14th ACM Conference on Recommender Systems

Pages 269 - 278

https://doi.org/10.1145/3383313.3412236

Published: 22 September 2020 Publication History

Abstract

Multi-task learning (MTL) has been successfully applied to many recommendation applications. However, MTL models often suffer from performance degeneration with negative transfer due to the complex and competing task correlation in real-world recommender systems. Moreover, through extensive experiments across SOTA MTL models, we have observed an interesting seesaw phenomenon that performance of one task is often improved by hurting the performance of some other tasks. To address these issues, we propose a Progressive Layered Extraction (PLE) model with a novel sharing structure design. PLE separates shared components and task-specific components explicitly and adopts a progressive routing mechanism to extract and separate deeper semantic knowledge gradually, improving efficiency of joint representation learning and information routing across tasks in a general setup. We apply PLE to both complicatedly correlated and normally correlated tasks, ranging from two-task cases to multi-task cases on a real-world Tencent video recommendation dataset with 1 billion samples, and results show that PLE outperforms state-of-the-art MTL models significantly under different task correlations and task-group size. Furthermore, online evaluation of PLE on a large-scale content recommendation platform at Tencent manifests 2.23% increase in view-count and 1.84% increase in watch time compared to SOTA MTL models, which is a significant improvement and demonstrates the effectiveness of PLE. Finally, extensive offline experiments on public benchmark datasets demonstrate that PLE can be applied to a variety of scenarios besides recommendations to eliminate the seesaw phenomenon. PLE now has been deployed to the online video recommender system in Tencent successfully.

References

[1]

Trapit Bansal, David Belanger, and Andrew McCallum. 2016. Ask the gru: Multi-task learning for deep text recommendations. In Proceedings of the 10th ACM Conference on Recommender Systems. 107–114.

Digital Library

[2]

Rich Caruana. 1997. Multitask learning. Machine learning 28, 1 (1997), 41–75.

[3]

Zhongxia Chen, Xiting Wang, Xing Xie, Tong Wu, Guoqing Bu, Yining Wang, and Enhong Chen. 2019. Co-attentive multi-task learning for explainable recommendation. In Proceedings of the 28th International Joint Conference on Artificial Intelligence. AAAI Press, 2137–2143.

[4]

Heng-Tze Cheng, Levent Koc, Jeremiah Harmsen, Tal Shaked, Tushar Chandra, Hrishi Aradhye, Glen Anderson, Greg Corrado, Wei Chai, Mustafa Ispir, 2016. Wide & deep learning for recommender systems. In Proceedings of the 1st workshop on deep learning for recommender systems. 7–10.

Digital Library

[5]

Dheeru Dua and Casey Graff. 2017. UCI Machine Learning Repository. http://archive.ics.uci.edu/ml

[6]

Huifeng Guo, Ruiming Tang, Yunming Ye, Zhenguo Li, and Xiuqiang He. 2017. DeepFM: a factorization-machine based neural network for CTR prediction. arXiv preprint arXiv:1703.04247(2017).

[7]

Guy Hadash, Oren Sar Shalom, and Rita Osadchy. 2018. Rank and rate: multi-task learning for recommender systems. In Proceedings of the 12th ACM Conference on Recommender Systems. 451–454.

Digital Library

[8]

Robert A Jacobs, Michael I Jordan, Steven J Nowlan, and Geoffrey E Hinton. 1991. Adaptive mixtures of local experts. Neural computation 3, 1 (1991), 79–87.

[9]

Alex Kendall, Yarin Gal, and Roberto Cipolla. 2018. Multi-task learning using uncertainty to weigh losses for scene geometry and semantics. In Proceedings of the IEEE conference on computer vision and pattern recognition. 7482–7491.

[10]

Shikun Liu, Edward Johns, and Andrew J Davison. 2019. End-to-end multi-task learning with attention. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1871–1880.

[11]

Yichao Lu, Ruihai Dong, and Barry Smyth. 2018. Why I like it: multi-task learning for recommendation and explanation. In Proceedings of the 12th ACM Conference on Recommender Systems. 4–12.

Digital Library

[12]

Jiaqi Ma, Zhe Zhao, Jilin Chen, Ang Li, Lichan Hong, and Ed H Chi. 2019. SNR: Sub-Network Routing for Flexible Parameter Sharing in Multi-task Learning. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 216–223.

Digital Library

[13]

Jiaqi Ma, Zhe Zhao, Xinyang Yi, Jilin Chen, Lichan Hong, and Ed H Chi. 2018. Modeling task relationships in multi-task learning with multi-gate mixture-of-experts. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 1930–1939.

Digital Library

[14]

Xiao Ma, Liqin Zhao, Guan Huang, Zhi Wang, Zelin Hu, Xiaoqiang Zhu, and Kun Gai. 2018. Entire space multi-task model: An effective approach for estimating post-click conversion rate. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval. 1137–1140.

Digital Library

[15]

Krzysztof Maziarz, Efi Kokiopoulou, Andrea Gesmundo, Luciano Sbaiz, Gabor Bartok, and Jesse Berent. 2019. Gumbel-Matrix Routing for Flexible Multi-task Learning. arXiv preprint arXiv:1910.04915(2019).

[16]

Ishan Misra, Abhinav Shrivastava, Abhinav Gupta, and Martial Hebert. 2016. Cross-stitch networks for multi-task learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3994–4003.

[17]

Clemens Rosenbaum, Tim Klinger, and Matthew Riemer. 2017. Routing networks: Adaptive selection of non-linear functions for multi-task learning. arXiv preprint arXiv:1711.01239(2017).

[18]

Sebastian Ruder12, Joachim Bingel, Isabelle Augenstein, and Anders Søgaard. 2017. Sluice networks: Learning what to share between loosely related tasks. stat 1050(2017), 23.

[19]

Yoav Shoham, Rob Powers, and Trond Grenager. 2003. Multi-agent reinforcement learning: a critical survey. Web manuscript (2003).

[20]

Yi Tay, Anh Tuan Luu, and Siu Cheung Hui. 2018. Multi-pointer co-attention networks for recommendation. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2309–2318.

Digital Library

[21]

Lisa Torrey and Jude Shavlik. 2010. Transfer learning. In Handbook of research on machine learning applications and trends: algorithms, methods, and techniques. IGI Global, 242–264.

[22]

Jialei Wang, Steven CH Hoi, Peilin Zhao, and Zhi-Yong Liu. 2013. Online multi-task collaborative filtering for on-the-fly recommender systems. In Proceedings of the 7th ACM conference on Recommender systems. 237–244.

Digital Library

[23]

Nan Wang, Hongning Wang, Yiling Jia, and Yue Yin. 2018. Explainable recommendation via multi-task learning in opinionated text data. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval. 165–174.

Digital Library

[24]

Jiejie Zhao, Bowen Du, Leilei Sun, Fuzhen Zhuang, Weifeng Lv, and Hui Xiong. 2019. Multiple Relational Attention Network for Multi-task Learning. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 1123–1131.

Digital Library

[25]

Zhe Zhao, Lichan Hong, Li Wei, Jilin Chen, Aniruddh Nath, Shawn Andrews, Aditee Kumthekar, Maheswaran Sathiamoorthy, Xinyang Yi, and Ed Chi. 2019. Recommending what video to watch next: a multitask ranking system. In Proceedings of the 13th ACM Conference on Recommender Systems. 43–51.

Digital Library

[26]

Barret Zoph and Quoc V Le. 2016. Neural architecture search with reinforcement learning. arXiv preprint arXiv:1611.01578(2016).

Cited By

Hou CZheng L(2024)A Multi-Task Joint Learning Model Based on Transformer and Customized Gate Control for Predicting Remaining Useful Life and Health Status of ToolsSensors10.3390/s2413411724:13(4117)Online publication date: 25-Jun-2024
https://doi.org/10.3390/s24134117
Tang LZhao XHu XLuo CLin M(2024)A Multi-Task Convolutional Neural Network Relative Radiometric Calibration Based on Temporal InformationRemote Sensing10.3390/rs1617334616:17(3346)Online publication date: 9-Sep-2024
https://doi.org/10.3390/rs16173346
Zhu CQi JLu ZChen SLi XLi Z(2024)Performance Prediction of the Elastic Support Structure of a Wind Turbine Based on Multi-Task LearningMachines10.3390/machines1206035612:6(356)Online publication date: 21-May-2024
https://doi.org/10.3390/machines12060356
Show More Cited By

Recommendations

Metric-Guided Multi-task Learning
Foundations of Intelligent Systems
Abstract
Multi-task learning (MTL) aims to solve multiple related learning tasks simultaneously so that the useful information in one specific task can be utilized by other tasks in order to improve the learning performance of all tasks. Many ...
Prototype Feature Extraction for Multi-task Learning
WWW '22: Proceedings of the ACM Web Conference 2022

Multi-task learning (MTL) has been widely utilized in various industrial scenarios, such as recommender systems and search engines. MTL can improve learning efficiency and prediction accuracy by exploiting commonalities and differences across tasks. ...
Multi-task feature and structure learning for user-preference based knowledge-aware recommendation
Abstract
Data sparsity and long-tail items recommendation, two typical problems of recommender systems, can be effectively alleviated by leveraging user-side information and item-side information. Knowledge graph (KG) is a source of side ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

RecSys '20: Proceedings of the 14th ACM Conference on Recommender Systems

September 2020

796 pages

ISBN:9781450375832

DOI:10.1145/3383313

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 September 2020

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

RecSys '20

Sponsor:

RecSys '20: Fourteenth ACM Conference on Recommender Systems

September 22 - 26, 2020

Virtual Event, Brazil

Acceptance Rates

Overall Acceptance Rate 254 of 1,295 submissions, 20%

Upcoming Conference

RecSys '24

Sponsor:
sigchi

18th ACM Conference on Recommender Systems

October 14 - 18, 2024

Bari , Italy

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

264
Total Citations
View Citations
13,266
Total Downloads

Downloads (Last 12 months)2,156
Downloads (Last 6 weeks)253

Reflects downloads up to 13 Sep 2024

Other Metrics

View Author Metrics

Citations

Cited By

Hou CZheng L(2024)A Multi-Task Joint Learning Model Based on Transformer and Customized Gate Control for Predicting Remaining Useful Life and Health Status of ToolsSensors10.3390/s2413411724:13(4117)Online publication date: 25-Jun-2024
https://doi.org/10.3390/s24134117
Tang LZhao XHu XLuo CLin M(2024)A Multi-Task Convolutional Neural Network Relative Radiometric Calibration Based on Temporal InformationRemote Sensing10.3390/rs1617334616:17(3346)Online publication date: 9-Sep-2024
https://doi.org/10.3390/rs16173346
Zhu CQi JLu ZChen SLi XLi Z(2024)Performance Prediction of the Elastic Support Structure of a Wind Turbine Based on Multi-Task LearningMachines10.3390/machines1206035612:6(356)Online publication date: 21-May-2024
https://doi.org/10.3390/machines12060356
Peng JGong JZhou CZang QFang XYang KYu J(2024)KGCFRec: Improving Collaborative Filtering Recommendation with Knowledge GraphElectronics10.3390/electronics1310192713:10(1927)Online publication date: 15-May-2024
https://doi.org/10.3390/electronics13101927
Alnabhan MBranco P(2024)BERTGuard: Two-Tiered Multi-Domain Fake News Detection with Class Imbalance MitigationBig Data and Cognitive Computing10.3390/bdcc80800938:8(93)Online publication date: 16-Aug-2024
https://doi.org/10.3390/bdcc8080093
Su ZLin SZhang LFeng YJiang W(2024)Multitask Learning-Based Affective Prediction for Videos of Films and TV ScenesApplied Sciences10.3390/app1411439114:11(4391)Online publication date: 22-May-2024
https://doi.org/10.3390/app14114391
Zhang XHu XLiu ZXiang YZhou D(2024)HFD: Hierarchical feature decoupling for SQL generation from textIntelligent Data Analysis10.3233/IDA-23039028:4(991-1005)Online publication date: 17-Jul-2024
https://doi.org/10.3233/IDA-230390
Ma WZhang JYao H(2024)NeoMUST: an accurate and efficient multi-task learning model for neoantigen presentationLife Science Alliance10.26508/lsa.2023022557:4(e202302255)Online publication date: 30-Jan-2024
https://doi.org/10.26508/lsa.202302255
Yang ZLu PLiu P(2024)Personalized Recommendation Multi-Objective Optimization Model Based on Deep LearningInternational Journal of Advanced Network, Monitoring and Controls10.2478/ijanmc-2024-00059:1(44-57)Online publication date: 28-Mar-2024
https://doi.org/10.2478/ijanmc-2024-0005
Liu PWang NXu CZhao MWang BRen Y(2024)Enhancing User Interest based on Stream Clustering and Memory Networks in Large-Scale Recommender SystemsSSRN Electronic Journal10.2139/ssrn.4836975Online publication date: 2024
https://doi.org/10.2139/ssrn.4836975
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Table of Contents