Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3485447.3512116acmconferencesArticle/Chapter ViewAbstractPublication PageswebconfConference Proceedingsconference-collections
research-article

Efficient Online Learning to Rank for Sequential Music Recommendation

Published: 25 April 2022 Publication History
  • Get Citation Alerts
  • Abstract

    Music streaming services heavily rely upon recommender systems to acquire, engage, and retain users. One notable component of these services are playlists, which can be dynamically generated in a sequential manner based on the user’s feedback during a listening session. Online learning to rank approaches have recently been shown effective at leveraging such feedback to learn users’ preferences in the space of song features. Nevertheless, these approaches can suffer from slow convergence as a result of their random exploration component and get stuck in local minima as a result of their session-agnostic exploitation component. To overcome these limitations, we propose a novel online learning to rank approach which efficiently explores the space of candidate recommendation models by restricting itself to the orthogonal complement of the subspace of previous underperforming exploration directions. Moreover, to help overcome local minima, we propose a session-aware exploitation component which adaptively leverages the current best model during model updates. Our thorough evaluation using simulated listening sessions from Last.fm demonstrates substantial improvements over state-of-the-art approaches regarding early-stage performance and overall long-term convergence.

    References

    [1]
    Claudio Baccigalupo and Enric Plaza. 2006. Case-based Sequential Ordering of Songs for Playlist Recommendation. In ECCBR. 286–300.
    [2]
    Geoffray Bonnin and Dietmar Jannach. 2014. Automated generation of music playlists: Survey and experiments. ACM Computing Surveys (CSUR) 47, 2 (2014), 1–35.
    [3]
    Klaas Bosteels and Etienne E. Kerre. 2009. A Fuzzy Framework for Defining Dynamic Playlist Generation Heuristics. Fuzzy Sets and Systems 160, 23 (2009).
    [4]
    Oscar Celma. 2010. Music Recommendation and Discovery: The Long Tail, Long Fail, and Long Play in the Digital Music Space(1st ed.).
    [5]
    Shuo Chen, Josh L Moore, Douglas Turnbull, and Thorsten Joachims. 2012. Playlist prediction via metric embedding. In Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining. 714–722.
    [6]
    Ricardo Dias, Daniel Gonçalves, and Manuel J Fonseca. 2017. From manual to assisted playlist creation: a survey. Multimedia Tools and Applications 76, 12 (2017), 14375–14403.
    [7]
    Bradley Efron. 1992. Bootstrap methods: another look at the jackknife. In Breakthroughs in statistics. Springer, 569–593.
    [8]
    Abraham D. Flaxman, Adam Tauman Kalai, and H. Brendan McMahan. 2005. Online convex optimization in the bandit setting: gradient descent without a gradient. In Proceedings of the 16th Annual ACM-SIAM Symposium on Discrete Algorithms. 385–394.
    [9]
    Don Gartner, Florian Kraft, and Thomas Schaaf. 2007. An adaptive distance measure for similarity based playlist generation. In 2007 IEEE International Conference on Acoustics, Speech and Signal Processing-ICASSP’07, Vol. 1. IEEE, I–229.
    [10]
    Claudio Gentile, Shuai Li, Purushottam Kar, Alexandros Karatzoglou, Giovanni Zappella, and Evans Etrue. 2017. On Context-Dependent Clustering of Bandits. In Proc. of ICML. 1253–1262.
    [11]
    Artem Grotov and Maarten de Rijke. 2016. Online learning to rank for information retrieval: Sigir 2016 tutorial. In Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval. 1215–1218.
    [12]
    Anja Nylund Hagen. 2015. The playlist experience: Personal playlists in music streaming services. Popular Music and Society 38, 5 (2015), 625–645.
    [13]
    Negar Hariri, Bamshad Mobasher, and Robin Burke. 2012. Context-aware music recommendation based on latenttopic sequential patterns. In Proceedings of the sixth ACM conference on Recommender systems. 131–138.
    [14]
    Negar Hariri, Bamshad Mobasher, and Robin Burke. 2015. Adapting to User Preference Changes in Interactive Recommendation. In Proc. of ICAI. 4268–4274.
    [15]
    David B Hauver and James C French. 2001. Flycasting: Using Collaborative Filtering to Generate a Playlist for Online Radio. In Proc. of IEEE Web Delivering of Music. 123–130.
    [16]
    Katja Hofmann 2013. Fast and reliable online learning to rank for information retrieval. In SIGIR Forum, Vol. 47. 140.
    [17]
    Katja Hofmann, Anne Schuth, Shimon Whiteson, and Maarten De Rijke. 2013. Reusing historical interaction data for faster online learning to rank for IR. In Proceedings of the sixth ACM international conference on Web search and data mining. 183–192.
    [18]
    Binbin Hu, Chuan Shi, and Jian Liu. 2017. Playlist recommendation based on reinforcement learning. In International Conference on Intelligence Science. Springer, 172–182.
    [19]
    Gawesh Jawaheer, Martin Szomszor, and Patty Kostkova. 2010. Comparison of implicit and explicit feedback from an online music recommendation service. In proceedings of the 1st international workshop on information heterogeneity and fusion in recommender systems. 47–51.
    [20]
    Iman Kamehkhosh, Geoffray Bonnin, and Dietmar Jannach. 2020. Effects of recommendations on the playlist creation behavior of users. User Modeling and User-Adapted Interaction 30, 2 (2020), 285–322.
    [21]
    James King and Vaiva Imbrasaitė. 2015. Generating Music Playlists With hierarchical Clustering and Q-Learning. In Advances in Information Retrieval. 315–326.
    [22]
    Lihong Li, Wei Chu, John Langford, and Robert E. Schapire. 2010. A Contextual-Bandit Approach to Personalized News Article Recommendation. In Proc. of WWW. 661–670.
    [23]
    Lihong Li, Wei Chu, John Langford, and Xuanhui Wang. 2011. Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms. In Proceedings of the fourth ACM international conference on Web search and data mining. 297–306.
    [24]
    Elad Liebman, Maytal Saar-Tsechansky, and Peter Stone. 2014. Dj-mc: A reinforcement-learning agent for music playlist recommendation. arXiv preprint arXiv:1401.1880(2014).
    [25]
    Beth Logan. 2002. Content-Based Playlist Generation: Exploratory Experiments. In ISMIR, Vol. 2. 295–296.
    [26]
    Joshua L Moore, Shuo Chen, Thorsten Joachims, and Douglas Turnbull. 2012. Learning to Embed Songs and Tags for Playlist Prediction. In ISMIR, Vol. 12. 349–354.
    [27]
    Harrie Oosterhuis and Maarten de Rijke. 2017. Balancing speed and quality in online learning to rank for information retrieval. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. 277–286.
    [28]
    Harrie Oosterhuis and Maarten de Rijke. 2018. Differentiable unbiased online learning to rank. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 1293–1302.
    [29]
    Antti Oulasvirta, Janne P Hukkinen, and Barry Schwartz. 2009. When more is less: the paradox of choice in search engine use. In Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval. 516–523.
    [30]
    Elias Pampalk, Tim Pohle, and Gerhard Widmer. 2005. Dynamic Playlist Generation Based on Skipping Behavior. In Proc. of ISMIR. 634–637.
    [31]
    Snickars Pelle. 2017. More of the Same–On Spotify Radio. Culture Unbound. Journal of Current Cultural Research 9, 2 (2017), 184–211.
    [32]
    Bruno L Pereira, Alberto Ueda, Gustavo Penha, Rodrygo LT Santos, and Nivio Ziviani. 2019. Online learning to rank for sequential music recommendation. In Proceedings of the 13th ACM Conference on Recommender Systems. 237–245.
    [33]
    Martin Pichl, Eva Zangerle, and Günther Specht. 2017. Understanding user-curated playlists on spotify: A machine learning approach. International Journal of Multimedia Data Engineering and Management (IJMDEM) 8, 4 (2017), 44–59.
    [34]
    Boris T Polyak. 1964. Some methods of speeding up the convergence of iteration methods. Ussr computational mathematics and mathematical physics 4, 5(1964), 1–17.
    [35]
    Massimo Quadrana, Paolo Cremonesi, and Dietmar Jannach. 2018. Sequence-aware recommender systems. ACM Computing Surveys (CSUR) 51, 4 (2018), 1–36.
    [36]
    Filip Radlinski, Madhu Kurup, and Thorsten Joachims. 2008. How does clickthrough data reflect retrieval quality?. In Proceedings of the 17th ACM Conference on Information and Knowledge Management. 43–52. https://doi.org/10.1145/1458082.1458092
    [37]
    Anne Schuth, Harrie Oosterhuis, Shimon Whiteson, and Maarten de Rijke. 2016. Multileave gradient descent for fast online learning to rank. In Proceedings of the Ninth ACM International Conference on Web Search and Data Mining. 457–466.
    [38]
    Anne Schuth, Floor Sietsma, Shimon Whiteson, Damien Lefortier, and Maarten de Rijke. 2014. Multileaved comparisons for fast online evaluation. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management. 71–80.
    [39]
    Andreu Vall. 2015. Listener-inspired automated music playlist generation. In Proceedings of the 9th ACM Conference on Recommender Systems. 387–390.
    [40]
    Andreu Vall, Matthias Dorfer, Markus Schedl, and Gerhard Widmer. 2018. A hybrid approach to music playlist continuation based on playlist-song membership. In Proceedings of the 33rd Annual ACM Symposium on Applied Computing. 1374–1382.
    [41]
    Maksims Volkovs and Guang Wei Yu. 2015. Effective latent models for binary feedback in recommender systems. In Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval. 313–322.
    [42]
    Huazheng Wang, Sonwoo Kim, Eric McCord-Snook, Qingyun Wu, and Hongning Wang. 2019. Variance reduction in gradient exploration for online learning to rank. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval. 835–844.
    [43]
    Huazheng Wang, Ramsey Langley, Sonwoo Kim, Eric McCord-Snook, and Hongning Wang. 2018. Efficient exploration of gradient space for online learning to rank. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval. 145–154.
    [44]
    Shoujin Wang, Longbing Cao, Yan Wang, Quan Z Sheng, Mehmet Orgun, and Defu Lian. 2019. A survey on session-based recommender systems. arXiv preprint arXiv:1902.04864(2019).
    [45]
    Xinxi Wang, Yi Wang, David Hsu, and Ye Wang. 2014. Exploration in interactive personalized music recommendation: a reinforcement learning approach. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 11, 1 (2014), 1–22.
    [46]
    Zhe Xing, Xinxi Wang, and Ye Wang. 2014. Enhancing Collaborative Filtering Music Recommendation by Balancing Exploration and Exploitation. In Proc. of ISMIR. 445–450.
    [47]
    Yisong Yue and Thorsten Joachims. 2009. Interactively optimizing information retrieval systems as a dueling bandits problem. In Proceedings of the 26th Annual International Conference on Machine Learning. 1201–1208.
    [48]
    Tong Zhao and Irwin King. 2016. Constructing reliable gradient exploration for online learning to rank. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. 1643–1652.

    Cited By

    View all

    Index Terms

    1. Efficient Online Learning to Rank for Sequential Music Recommendation
            Index terms have been assigned to the content through auto-classification.

            Recommendations

            Comments

            Information & Contributors

            Information

            Published In

            cover image ACM Conferences
            WWW '22: Proceedings of the ACM Web Conference 2022
            April 2022
            3764 pages
            ISBN:9781450390965
            DOI:10.1145/3485447
            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Sponsors

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            Published: 25 April 2022

            Permissions

            Request permissions for this article.

            Check for updates

            Author Tags

            1. Sequential music recommendation
            2. adaptive exploitation
            3. efficient exploration
            4. implicit feedback
            5. online learning to rank

            Qualifiers

            • Research-article
            • Research
            • Refereed limited

            Funding Sources

            Conference

            WWW '22
            Sponsor:
            WWW '22: The ACM Web Conference 2022
            April 25 - 29, 2022
            Virtual Event, Lyon, France

            Acceptance Rates

            Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

            Contributors

            Other Metrics

            Bibliometrics & Citations

            Bibliometrics

            Article Metrics

            • Downloads (Last 12 months)112
            • Downloads (Last 6 weeks)6
            Reflects downloads up to 27 Jul 2024

            Other Metrics

            Citations

            Cited By

            View all
            • (2024)Automatic Hypergraph Generation for Enhancing Recommendation With Sparse OptimizationIEEE Transactions on Multimedia10.1109/TMM.2023.333808326(5680-5693)Online publication date: 2024
            • (2024)Content-driven music recommendationComputer Science Review10.1016/j.cosrev.2024.10061851:COnline publication date: 25-Jun-2024
            • (2024)High-level preferences as positive examples in contrastive learning for multi-interest sequential recommendationWorld Wide Web10.1007/s11280-024-01263-627:2Online publication date: 14-Mar-2024
            • (2023)Efficient Exploration and Exploitation for Sequential Music RecommendationACM Transactions on Recommender Systems10.1145/3625827Online publication date: 27-Sep-2023
            • (2023)Knowledge Graph Convolutional Recommender Model with Collaborative Information and Common Neighbor Ranking Sampling2023 4th International Conference on Intelligent Computing and Human-Computer Interaction (ICHCI)10.1109/ICHCI58871.2023.10278069(232-236)Online publication date: 4-Aug-2023

            View Options

            Get Access

            Login options

            View options

            PDF

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader

            HTML Format

            View this article in HTML Format.

            HTML Format

            Media

            Figures

            Other

            Tables

            Share

            Share

            Share this Publication link

            Share on social media