Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3178876.3186155acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article
Public Access

AdaError: An Adaptive Learning Rate Method for Matrix Approximation-based Collaborative Filtering

Published: 23 April 2018 Publication History

Abstract

Gradient-based learning methods such as stochastic gradient descent are widely used in matrix approximation-based collaborative filtering algorithms to train recommendation models based on observed user-item ratings. One major difficulty in existing gradient-based learning methods is determining proper learning rates, since model convergence would be inaccurate or very slow if the learning rate is too large or too small, respectively. This paper proposes AdaError, an adaptive learning rate method for matrix approximation-based collaborative filtering. AdaError eliminates the need of manually tuning the learning rates by adaptively adjusting the learning rates based on the noisiness level of user-item ratings, using smaller learning rates for noisy ratings so as to reduce their impact on the learned models. Our theoretical and empirical analysis shows that AdaError can improve the generalization performance of the learned models. Experimental studies on the MovieLens and Netflix datasets also demonstrate that AdaError outperforms state-of-the-art adaptive learning rate methods in matrix approximation-based collaborative filtering. Furthermore, by applying AdaError to the standard matrix approximation method, we can achieve statistically significant improvements over state-of-the-art collaborative filtering methods in both rating prediction accuracy and top-N recommendation accuracy.

References

[1]
Xavier Amatriain, Josep M. Pujol, and Nuria Oliver. 2009. I Like It... I Like It Not: Evaluating User Ratings Noise in Recommender Systems. In Proceedings of the 17th International Conference on User Modeling, Adaptation, and Personalization (UMAP '09). Springer, 247--258.
[2]
Xavier Amatriain, Josep M. Pujol, Nava Tintarev, and Nuria Oliver. 2009. Rate It Again: Increasing Recommendation Accuracy by User Re-rating. In Proceedings of the Third ACM Conference on Recommender Systems (RecSys '09). ACM, 173--180.
[3]
Alex Beutel, Amr Ahmed, and Alexander J. Smola. 2015. ACCAMS: Additive Co-Clustering to Approximate Matrices Succinctly. In Proceedings of the 24th International Conference on World Wide Web (WWW '15). 119--129.
[4]
Alex Beutel, Ed H. Chi, Zhiyuan Cheng, Hubert Pham, and John Anderson. 2017. Beyond Globally Optimal: Focused Learning for Improved Recommendations. In Proceedings of the 26th International Conference on World Wide Web (WWW '17). 203--212.
[5]
Daniel Billsus and Michael J Pazzani. 1998. Learning Collaborative Information Filters. In Proceedings of the Fifteenth International Conference on Machine Learning (ICML '98), Vol. 98. 46--54.
[6]
Olivier Bousquet and Andr´e Elisseeff. 2001. Algorithmic Stability and Generalization Performance. In Advances in Neural Information Processing Systems. 196--202.
[7]
Emmanuel J. Cand
[8]
es and Yaniv Plan. 2010. Matrix Completion With Noise. Proc. IEEE 98, 6 (2010), 925--936.
[9]
Chao Chen, Dongsheng Li, Yingying Zhao, Qin Lv, and Li Shang. 2015. WEMAREC: Accurate and Scalable Recommendation through Weighted and Ensemble Matrix Approximation. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval. 303--312.
[10]
Peizhe Cheng, Shuaiqiang Wang, Jun Ma, Jiankai Sun, and Hui Xiong. 2017. Learning to Recommend Accurate and Diverse Items. In Proceedings of the 26th International Conference on World Wide Web (WWW '17). 183--192.
[11]
Dan Cosley, Shyong K. Lam, Istvan Albert, Joseph A. Konstan, and John Riedl. 2003. Is Seeing Believing?: How Recommender System Interfaces Affect Users? Opinions. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '03). ACM, 585--592.
[12]
Jeffrey Dean, Greg Corrado, Rajat Monga, Kai Chen, Matthieu Devin, Mark Mao, Andrew Senior, Paul Tucker, Ke Yang, Quoc V Le, et al. 2012. Large scale distributed deep networks. In Advances in neural information processing systems. 1223--1231.
[13]
John Duchi, Elad Hazan, and Yoram Singer. 2011. Adaptive subgradient methods for online learning and stochastic optimization. Journal of Machine Learning Research 12, Jul (2011), 2121--2159.
[14]
Moritz Hardt, Benjamin Recht, and Yoram Singer. 2016. Train Faster, Generalize Better: Stability of Stochastic Gradient Descent. In Proceedings of the 33rd International Conference on International Conference on Machine Learning (ICML'16). JMLR.org, 1225--1234.
[15]
Elad Hazan and Satyen Kale. 2014. Beyond the regret minimization barrier: optimal algorithms for stochastic strongly-convex optimization. Journal of Machine Learning Research 15, 1 (2014), 2489--2512.
[16]
Liang Hu, Jian Cao, Guandong Xu, Longbing Cao, Zhiping Gu, and Can Zhu. 2013. Personalized Recommendation via Crossdomain Triadic Factorization. In Proceedings of the 22Nd International Conference on World Wide Web (WWW '13). ACM, 595--606.
[17]
Yifan Hu, Yehuda Koren, and Chris Volinsky. 2008. Collaborative Filtering for Implicit Feedback Datasets. In Proceedings of the Eighth IEEE International Conference on Data Mining (ICDM '08). 263--272.
[18]
Robert A Jacobs. 1988. Increased rates of convergence through learning rate adaptation. Neural networks 1, 4 (1988), 295--307.
[19]
Nicolas Jones, Armelle Brun, and Anne Boyer. 2011. Comparisons Instead of Ratings: Towards More Stable Preferences. In Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intel ligence and Intel ligent Agent Technology (WI-IAT '11). IEEE, 451--456.
[20]
Santosh Kabbur, Xia Ning, and George Karypis. 2013. FISM: Factored Item Similarity Models for top-N Recommender Systems. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '13). ACM, 659--667.
[21]
Diederik Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
[22]
Yehuda Koren. 2008. Factorization Meets the Neighborhood: A Multifaceted Collaborative Filtering Model. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '08). ACM, 426--434.
[23]
Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer 42, 8 (2009), 30--37.
[24]
Balaji Lakshminarayanan, Guillaume Bouchard, and Cedric Archambeau. 2011. Robust Bayesian matrix factorisation. In Proceedings of the International Conference on Artificial Intel ligence and Statistics (AISTATS). 425--433.
[25]
Joonseok Lee, Seungyeon Kim, Guy Lebanon, and Yoram Singer. 2013. Local low-rank matrix approximation. In Proceedings of The 30th International Conference on Machine Learning (ICML '13). 82--90.
[26]
Dongsheng Li, Chao Chen, Qin Lv, Li Shang, Stephen M. Chu, and Hongyuan Zha. 2017. ERMMA: Expected Risk Minimization for Matrix Approximation-based Recommender Systems. In Proceedings of the Thirty-First AAAI Conference on Artificial Intel ligence (AAAI '17). 1403--1409.
[27]
Dongsheng Li, Chao Chen, Qin Lv, Junchi Yan, Li Shang, and Stephen M. Chu. 2016. Low-rank matrix approximation with stability. In Proceedings of The 33rd International Conference on Machine Learning (ICML '16). 295--303.
[28]
Lester W Mackey, Michael I Jordan, and Ameet Talwalkar. 2011. Divide-and-conquer matrix factorization. In Advances in Neural Information Processing Systems. 1134--1142.
[29]
Bhaskar Mehta, Thomas Hofmann, and Wolfgang Nejdl. 2007. Robust Collaborative Filtering. In Proceedings of the 2007 ACM Conference on Recommender Systems (RecSys '07). ACM, 49-- 56.
[30]
Andrew Y. Ng. 2004. Feature Selection, L1 vs. L2 Regularization, and Rotational Invariance. In Proceedings of the Twenty-first International Conference on Machine Learning (ICML '04). 78--85.
[31]
Tien T. Nguyen, Daniel Kluver, Ting-Yu Wang, Pik-Mai Hui, Michael D. Ekstrand, Martijn C. Willemsen, and John Riedl. 2013. Rating Support Interfaces to Improve User Experience and Recommender Accuracy. In Proceedings of the 7th ACM Conference on Recommender Systems (RecSys '13). ACM, 149-- 156.
[32]
Xia Ning and George Karypis. 2011. SLIM: Sparse Linear Methods for Top-N Recommender Systems. In Proceedings of the 2011 IEEE 11th International Conference on Data Mining (ICDM '11). 497--506.
[33]
Michael O'Mahony, Neil Hurley, Nicholas Kushmerick, and Gu´enol´e Silvestre. 2004. Collaborative Recommendation: A Robustness Analysis. ACM Trans. Internet Technol. 4, 4 (2004), 344--377.
[34]
Michael P. O'Mahony, Neil J. Hurley, and Gu´enol´e C.M. Silvestre. 2006. Detecting Noise in Recommender System Databases. In Proceedings of the 11th International Conference on Intel ligent User Interfaces (IUI '06). ACM, 109--115.
[35]
Arkadiusz Paterek. 2007. Improving regularized singular value decomposition for collaborative filtering. In Proceedings of KDD cup and workshop, Vol. 2007. 5--8.
[36]
Steffen Rendle and Christoph Freudenthaler. 2014. Improving Pairwise Learning for Item Recommendation from Implicit Feedback. In Proceedings of the 7th ACM International Conference on Web Search and Data Mining (WSDM '14). 273--282.
[37]
Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2009. BPR: Bayesian personalized ranking from implicit feedback. In Proceedings of the twenty-fifth conference on uncertainty in artificial intel ligence. 452--461.
[38]
Paul Resnick and Rahul Sami. 2007. The Influence Limiter: Provably Manipulation-resistant Recommender Systems. In Proceedings of the 2007 ACM Conference on Recommender Systems (RecSys '07). ACM, 25--32.
[39]
Ruslan Salakhutdinov and Andriy Mnih. 2008. Bayesian Probabilistic Matrix Factorization Using Markov Chain Monte Carlo. In Proceedings of the 25th International Conference on Machine Learning (ICML '08). ACM, 880--887.
[40]
Ruslan Salakhutdinov and Andriy Mnih. 2008. Probabilistic matrix factorization. In Advances in neural information processing systems. 1257--1264.
[41]
Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl. 2000. Application of Dimensionality Reduction in Recommender System - A Case Study. In ACM WebKDD 2000 Workshop. ACM SIGKDD.
[42]
Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl. 2002. Incremental Singular Value Decomposition Algorithms for Highly Scalable Recommender Systems. In Proceedings of the 5th International Conference in Computers and Information Technology.
[43]
Madeleine Udell, Corinne Horn, Reza Zadeh, and Stephen Boyd. 2016. Generalized Low Rank Models. Foundations and Trends in Machine Learning 9, 1 (2016), 1--118.
[44]
Linli Xu, Zaiyi Chen, Qi Zhou, Enhong Chen, Nicholas Jing Yuan, and Xing Xie. 2016. Aligned Matrix Completion: Integrating Consistency and Independency in Multiple Domains. In 2016 IEEE 16th International Conference on Data Mining (ICDM). 529--538.
[45]
Ting Yuan, Jian Cheng, Xi Zhang, Shuang Qiu, and Hanqing Lu. 2014. Recommendation by Mining Multiple User Behaviors with Group Sparsity. In Proceedings of the 28th AAAI Conference on Artificial Intel ligence (AAAI '14). 222--228.
[46]
Matthew D Zeiler. 2012. ADADELTA: an adaptive learning rate method. arXiv preprint arXiv:1212.5701 (2012).
[47]
Yongfeng Zhang, Min Zhang, Yiqun Liu, Shaoping Ma, and Shi Feng. 2013. Localized Matrix Factorization for Recommendation Based on Matrix Block Diagonal Forms. In Proceedings of the 22Nd International Conference on World Wide Web (WWW '13). ACM, 1511--1520.

Cited By

View all
  • (2024)Parallel Fractional Stochastic Gradient Descent With Adaptive Learning for Recommender SystemsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2022.318521235:3(470-483)Online publication date: Mar-2024
  • (2024)Who Wants to Shop With You: Joint Product–Participant Recommendation for Group-Buying ServiceIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.319000335:2(2353-2363)Online publication date: Feb-2024
  • (2024)Data and resource aware incremental ML training in support of pervasive applicationsComputing10.1007/s00607-024-01338-2106:11(3727-3753)Online publication date: 16-Aug-2024
  • Show More Cited By

Index Terms

  1. AdaError: An Adaptive Learning Rate Method for Matrix Approximation-based Collaborative Filtering

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      WWW '18: Proceedings of the 2018 World Wide Web Conference
      April 2018
      2000 pages
      ISBN:9781450356398
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      • IW3C2: International World Wide Web Conference Committee

      In-Cooperation

      Publisher

      International World Wide Web Conferences Steering Committee

      Republic and Canton of Geneva, Switzerland

      Publication History

      Published: 23 April 2018

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. collaborative filtering
      2. matrix approximation

      Qualifiers

      • Research-article

      Funding Sources

      Conference

      WWW '18
      Sponsor:
      • IW3C2
      WWW '18: The Web Conference 2018
      April 23 - 27, 2018
      Lyon, France

      Acceptance Rates

      WWW '18 Paper Acceptance Rate 170 of 1,155 submissions, 15%;
      Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)227
      • Downloads (Last 6 weeks)47
      Reflects downloads up to 16 Oct 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Parallel Fractional Stochastic Gradient Descent With Adaptive Learning for Recommender SystemsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2022.318521235:3(470-483)Online publication date: Mar-2024
      • (2024)Who Wants to Shop With You: Joint Product–Participant Recommendation for Group-Buying ServiceIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.319000335:2(2353-2363)Online publication date: Feb-2024
      • (2024)Data and resource aware incremental ML training in support of pervasive applicationsComputing10.1007/s00607-024-01338-2106:11(3727-3753)Online publication date: 16-Aug-2024
      • (2023)A Differential Evolution-Enhanced Position-Transitional Approach to Latent Factor AnalysisIEEE Transactions on Emerging Topics in Computational Intelligence10.1109/TETCI.2022.31866737:2(389-401)Online publication date: Apr-2023
      • (2023)Overview of Recommender SystemsRecommender Systems10.1007/978-981-99-8964-5_1(1-30)Online publication date: 24-Nov-2023
      • (2022)Position-Transitional Particle Swarm Optimization-Incorporated Latent Factor AnalysisIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2020.303332434:8(3958-3970)Online publication date: 1-Aug-2022
      • (2022)Accurate Latent Factor Analysis via Dynamic-Neighbor-cooperated Hierarchical Particle Swarm Optimizers2022 IEEE International Conference on Networking, Sensing and Control (ICNSC)10.1109/ICNSC55942.2022.10004108(1-6)Online publication date: 15-Dec-2022
      • (2022)Accurate Latent Factor Analysis via Dynamic-Neighbor-cooperated Hierarchical Particle Swarm Optimizers2022 IEEE International Conference on Networking, Sensing and Control (ICNSC)10.1109/ICNSC55942.2022.10004091(1-6)Online publication date: 15-Dec-2022
      • (2022)Learning Rate-Free Latent Factor Analysis via PSOLatent Factor Analysis for High-dimensional and Sparse Matrices10.1007/978-981-19-6703-0_2(11-27)Online publication date: 16-Nov-2022
      • (2021)XPL-CFProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3482221(2847-2851)Online publication date: 26-Oct-2021
      • Show More Cited By

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format.

      HTML Format

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media