short-paper

Open access

Everyone’s a Winner! On Hyperparameter Tuning of Recommendation Models

Authors:

Faisal Shehzad,

Dietmar JannachAuthors Info & Claims

RecSys '23: Proceedings of the 17th ACM Conference on Recommender Systems

Pages 652 - 657

https://doi.org/10.1145/3604915.3609488

Published: 14 September 2023 Publication History

All formats PDF

Abstract

The performance of a recommender system algorithm in terms of common offline accuracy measures often strongly depends on the chosen hyperparameters. Therefore, when comparing algorithms in offline experiments, we can obtain reliable insights regarding the effectiveness of a newly proposed algorithm only if we compare it to a number of state-of-the-art baselines that are carefully tuned for each of the considered datasets. While this fundamental principle of any area of applied machine learning is undisputed, we find that the tuning process for the baselines in the current literature is barely documented in much of today’s published research. Ultimately, in case the baselines are actually not carefully tuned, progress may remain unclear. In this paper, we exemplify through a computational experiment involving seven recent deep learning models how every method in such an unsound comparison can be reported to be outperforming the state-of-the-art. Finally, we iterate appropriate research practices to avoid unreliable algorithm comparisons in the future.

References

[1]

Charu C. Aggarwal. 2016. Recommender Systems - The Textbook. Springer.

Digital Library

[2]

Vito Walter Anelli, Alejandro Bellogín, Antonio Ferrara, Daniele Malitesta, Felice Antonio Merra, Claudio Pomo, Francesco Maria Donini, and Tommaso Di Noia. 2021. Elliot: A Comprehensive and Rigorous Framework for Reproducible Recommender Systems Evaluation. In SIGIR ’21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2405–2414.

Digital Library

[3]

Vito Walter Anelli, Alejandro Bellogin, Tommaso Di Noia, Dietmar Jannach, and Claudio Pomo. 2022. Top-N Recommendation Algorithms: A Quest for the State-of-the-Art. In 30th ACM Conference on User Modeling, Adaptation and Personalization (UMAP 2022).

[4]

Timothy G. Armstrong, Alistair Moffat, William Webber, and Justin Zobel. 2009. Improvements That Don’t Add Up: Ad-hoc Retrieval Results Since 1998. In Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM ’09). 601–610.

Digital Library

[5]

Christine Bauer, Maik Fröbe, Dietmar Jannach, Udo Kruschwitz, Paolo Rosso, Damiano Spina, and Nava Tintarev. 2023. Overcoming Methodological Challenges in Information Retrieval and Recommender Systems through Awareness and Education. In Dagstuhl Seminar 23031: Frontiers of Information Access Experimentation for Research and Education, Christine Bauer, Ben Carterette, Nicola Ferro, and Norbert Fuhr (Eds.). https://doi.org/10.48550/arXiv.2305.01509

[6]

Dong-Kyu Chae, Jin-Soo Kang, Sang-Wook Kim, and Jung-Tae Lee. 2018. CFGAN: A Generic Collaborative Filtering Framework based on Generative Adversarial Networks. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 137–146.

Digital Library

[7]

Paul Covington, Jay Adams, and Emre Sargin. 2016. Deep Neural Networks for YouTube Recommendations. In Proceedings of the 10th ACM Conference on Recommender Systems(RecSys 2016). 191–198.

Digital Library

[8]

Paolo Cremonesi and Dietmar Jannach. 2021. Progress in Recommender Systems Research: Crisis? What crisis?AI Magazine 42, 3 (2021), 43–54.

[9]

Maurizio Ferrari Dacrema, Simone Boglio, Paolo Cremonesi, and Dietmar Jannach. 2021. A Troubling Analysis of Reproducibility and Progress in Recommender Systems Research. ACM Transactions on Information Systems (TOIS) 39 (2021). Issue 2.

[10]

Bracha Shapira Francesco Ricci, Lior Rokach (Ed.). 2023. Recommender Systems Handbook (3 ed.). Springer.

[11]

Odd Erik Gundersen, Yolanda Gil, and David W. Aha. 2018. On Reproducible AI: Towards Reproducible Research, Open Science, and Digital Scholarship in AI Publications. AI Mag. 39, 3 (2018), 56–68.

Digital Library

[12]

Xiangnan He, Kuan Deng, Xiang Wang, Yan Li, Yongdong Zhang, and Meng Wang. 2020. LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 639–648.

Digital Library

[13]

Xiangnan He, Xiaoyu Du, Xiang Wang, Feng Tian, Jinhui Tang, and Tat-Seng Chua. 2018. Outer Product-Based Neural Collaborative Filtering. 18 (2018), 2227–2233.

[14]

Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua. 2017. Neural Collaborative Filtering. In Proceedings of the 26th International Conference on World Wide Web. 173–182.

Digital Library

[15]

Donghyun Kim, Chanyoung Park, Jinoh Oh, Sungyoung Lee, and Hwanjo Yu. 2016. Convolutional Matrix Factorization for Document Context-aware Recommendation. In Proceedings of the 10th ACM Conference on Recommender Systems. 233–240.

Digital Library

[16]

Pigi Kouki, Ilias Fountalis, Nikolaos Vasiloglou, Xiquan Cui, Edo Liberty, and Khalifeh Al Jadda. 2020. From the Lab to Production: A Case study of Session-based Recommendations in the Home-Improvement Domain. In Proceedings of the 14th ACM Conference on Recommender Systems (Virtual Event, Brazil) (RecSys 2020). 140–149.

Digital Library

[17]

Dawen Liang, Rahul G Krishnan, Matthew D Hoffman, and Tony Jebara. 2018. Variational Autoencoders for Collaborative Filtering. In Proceedings of the 2018 World Wide Web Conference. 689–698.

Digital Library

[18]

Jimmy Lin. 2019. The Neural Hype and Comparisons Against Weak Baselines. ACM SIGIR Forum 52, 2 (2019), 40–51.

Digital Library

[19]

Malte Ludewig, Sara Latifi, Noemi Mauro, and Dietmar Jannach. 2021. Empirical Analysis of Session-Based Recommendation Algorithms. User Modeling and User-Adapted Interaction 31 (2021), 149–181.

Digital Library

[20]

Qingsong Lv, Ming Ding, Qiang Liu, Yuxiang Chen, Wenzheng Feng, Siming He, Chang Zhou, Jianguo Jiang, Yuxiao Dong, and Jie Tang. 2021. Are We Really Making Much Progress? Revisiting, Benchmarking and Refining Heterogeneous Graph Neural Networks. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (Virtual Event, Singapore) (KDD ’21). 1150–1160.

Digital Library

[21]

Spyros Makridakis, Evangelos Spiliotis, and Vassilios Assimakopoulos. 2018. Statistical and Machine Learning Forecasting Methods: Concerns and Ways Forward. PloS one 13, 3 (2018).

[22]

Joelle Pineau, Philippe Vincent-Lamarre, Koustuv Sinha, Vincent Larivière, Alina Beygelzimer, Florence d’Alché Buc, Emily Fox, and Hugo Larochelle. 2021. Improving Reproducibility in Machine Learning Research (a Report from the NeurIPS 2019 Reproducibility Program). J. Mach. Learn. Res. 22, 1 (2021).

[23]

Steffen Rendle, Walid Krichene, Li Zhang, and John Anderson. 2020. Neural Collaborative Filtering vs. Matrix Factorization Revisited. In Proceedings of the 14th ACM Conference on Recommender Systems (RecSys ’20).

Digital Library

[24]

Steffen Rendle, Walid Krichene, Li Zhang, and Yehuda Koren. 2022. Revisiting the Performance of IALS on Item Recommendation Benchmarks. In Proceedings of the 16th ACM Conference on Recommender Systems(RecSys ’22). 427–435.

Digital Library

[25]

Steffen Rendle, Li Zhang, and Yehuda Koren. 2019. On the Difficulty of Evaluating Baselines: A Study on Recommender Systems. CoRR abs/1905.01395 (2019). http://arxiv.org/abs/1905.01395

[26]

Paul Resnick, Neophytos Iacovou, Mitesh Suchak, Peter Bergstrom, and John Riedl. 1994. GroupLens: An Open Architecture for Collaborative Filtering of Netnews. In Proceedings of the 1994 ACM Conference on Computer Supported Cooperative Work. 175–186.

Digital Library

[27]

J. Ben Schafer, Joseph Konstan, and John Riedl. 1999. Recommender Systems in E-Commerce. In Proceedings of the 1st ACM Conference on Electronic Commerce(EC ’99). 158–166.

Digital Library

[28]

Suvash Sedhain, Aditya Krishna Menon, Scott Sanner, and Lexing Xie. 2015. Autorec: Autoencoders Meet Collaborative Filtering. In Proceedings of the 24th International Conference on World Wide Web. 111–112.

Digital Library

[29]

Harald Steck. 2019. Embarrassingly Shallow Autoencoders for Sparse Data. In The World Wide Web Conference, WWW 2019. 3251–3257.

[30]

Harald Steck, Linas Baltrunas, Ehtsham Elahi, Dawen Liang, Yves Raimond, and Justin Basilico. 2021. Deep Learning for Recommender Systems: A Netflix Case Study. AI Magazine 42, 3 (2021), 7–18.

Digital Library

[31]

Xiang Wang, Xiangnan He, Meng Wang, Fuli Feng, and Tat-Seng Chua. 2019. Neural Graph Collaborative Filtering. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval. 165–174.

Digital Library

Cited By

Vrijenhoek SDaniil SSandel JHollink L(2024)Diversity of What? On the Different Conceptualizations of Diversity in Recommender SystemsProceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency10.1145/3630106.3658926(573-584)Online publication date: 3-Jun-2024
https://dl.acm.org/doi/10.1145/3630106.3658926
Pellini RFerrari Dacrema M(2024)Analyzing the effectiveness of quantum annealing with meta-learningQuantum Machine Intelligence10.1007/s42484-024-00179-86:2Online publication date: 25-Jul-2024
https://doi.org/10.1007/s42484-024-00179-8

Index Terms

Everyone’s a Winner! On Hyperparameter Tuning of Recommendation Models
1. Information systems
  1. Information retrieval
    1. Retrieval tasks and goals
      1. Recommender systems

Recommendations

Improving Micro-video Recommendation by Controlling Position Bias
Machine Learning and Knowledge Discovery in Databases
Abstract
As the micro-video apps become popular, the numbers of micro-videos and users increase rapidly, which highlights the importance of micro-video recommendation. Although the micro-video recommendation can be naturally treated as the sequential ...
Workshop on recommendation utility evaluation: beyond RMSE -- RUE 2012
RecSys '12: Proceedings of the sixth ACM conference on Recommender systems

Measuring the error in rating prediction has been by far the dominant evaluation methodology in the Recommender Systems literature. Yet there seems to be a general consensus that this criterion alone is far from being enough to assess the practical ...
On Unexpectedness in Recommender Systems: Or How to Better Expect the Unexpected
Special Sections on Diversity and Discovery in Recommender Systems, Online Advertising and Regular Papers

Although the broad social and business success of recommender systems has been achieved across several domains, there is still a long way to go in terms of user satisfaction. One of the key dimensions for significant improvement is the concept of ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

RecSys '23: Proceedings of the 17th ACM Conference on Recommender Systems

September 2023

1406 pages

ISBN:9798400702419

DOI:10.1145/3604915

Copyright © 2023 Owner/Author.

This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 September 2023

Check for updates

Author Tags

Qualifiers

Short-paper
Research
Refereed limited

Conference

RecSys '23

Sponsor:

RecSys '23: Seventeenth ACM Conference on Recommender Systems

September 18 - 22, 2023

Singapore, Singapore

Acceptance Rates

Overall Acceptance Rate 254 of 1,295 submissions, 20%

Upcoming Conference

RecSys '24

Sponsor:
sigchi

18th ACM Conference on Recommender Systems

October 14 - 18, 2024

Bari , Italy

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
759
Total Downloads

Downloads (Last 12 months)759
Downloads (Last 6 weeks)78

Reflects downloads up to 30 Aug 2024

Other Metrics

View Author Metrics

Citations

Cited By

Vrijenhoek SDaniil SSandel JHollink L(2024)Diversity of What? On the Different Conceptualizations of Diversity in Recommender SystemsProceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency10.1145/3630106.3658926(573-584)Online publication date: 3-Jun-2024
https://dl.acm.org/doi/10.1145/3630106.3658926
Pellini RFerrari Dacrema M(2024)Analyzing the effectiveness of quantum annealing with meta-learningQuantum Machine Intelligence10.1007/s42484-024-00179-86:2Online publication date: 25-Jul-2024
https://doi.org/10.1007/s42484-024-00179-8

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents