Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3097983.3098108acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Embedding-based News Recommendation for Millions of Users

Published: 13 August 2017 Publication History

Abstract

It is necessary to understand the content of articles and user preferences to make effective news recommendations. While ID-based methods, such as collaborative filtering and low-rank factorization, are well known for making recommendations, they are not suitable for news recommendations because candidate articles expire quickly and are replaced with new ones within short spans of time. Word-based methods, which are often used in information retrieval settings, are good candidates in terms of system performance but have issues such as their ability to cope with synonyms and orthographical variants and define "queries" from users' historical activities. This paper proposes an embedding-based method to use distributed representations in a three step end-to-end manner: (i) start with distributed representations of articles based on a variant of a denoising autoencoder, (ii) generate user representations by using a recurrent neural network (RNN) with browsing histories as input sequences, and (iii) match and list articles for users based on inner-product operations by taking system performance into consideration. The proposed method performed well in an experimental offline evaluation using past access data on Yahoo! JAPAN's homepage. We implemented it on our actual news distribution system based on these experimental results and compared its online performance with a method that was conventionally incorporated into the system. As a result, the click-through rate (CTR) improved by 23% and the total duration improved by 10%, compared with the conventionally incorporated method. Services that incorporated the method we propose are already open to all users and provide recommendations to over ten million individual users per day who make billions of accesses per month.

References

[1]
Kyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing.
[2]
Henriette Cramer. 2015. Effects of Ad Quality & Content-Relevance on Perceived Content Quality Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems.
[3]
Abhinandan S. Das, Mayur Datar, Ashutosh Garg, and Shyam Rajaram 2007. Google News Personalization: Scalable Online Collaborative Filtering Proceedings of the 16th International Conference on World Wide Web.
[4]
Rong-En Fan, Kai-Wei Chang, Cho-Jui Hsieh, Xiang-Rui Wang, and Chih-Jen Lin 2008. LIBLINEAR: A library for large linear classification. The Journal of Machine Learning Research (2008).
[5]
Sepp Hochreiter. 1991. Untersuchungen zu dynamischen neuronalen Netzen. Diploma thesis. bibinfoschoolInstitut für Informatik, Lehrstuhl Prof. Brauer, Technische Universit"at München.
[6]
Sepp Hochreiter and Jürgen Schmidhuber 1997. Long short-term memory. Neural computation (1997).
[7]
Thorsten Joachims. 2002. Optimizing Search Engines Using Clickthrough Data. Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
[8]
Rafal Jozefowicz, Wojciech Zaremba, and Ilya Sutskever. 2015. An empirical exploration of recurrent network architectures Proceedings of the 32nd International Conference on Machine Learning.
[9]
Andrej Karpathy and Li Fei-Fei 2015. Deep visual-semantic alignments for generating image descriptions Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
[10]
Mounia Lalmas, Janette Lehmann, Guy Shaked, Fabrizio Silvestri, and Gabriele Tolomei 2015. Promoting Positive Post-Click Experience for In-Stream Yahoo Gemini Users Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
[11]
Quoc Le and Tomas Mikolov 2014. Distributed Representations of Sentences and Documents Proceedings of The 31st International Conference on Machine Learning.
[12]
Shumpei Okura, Yukihiro Tagami, and Akira Tajima. 2016. Article De-duplication Using Distributed Representations Proceedings of the 25th International Conference Companion on World Wide Web.
[13]
Razvan Pascanu, Tomas Mikolov, and Yoshua Bengio. 2014. On the difficulty of training recurrent neural networks Proceedings of The 30th International Conference on Machine Learning.
[14]
Jay Adams Paul Covington and Emre Sargin 2016. Deep Neural Networks for YouTube Recommendations. Proceedings of the 10th ACM Conference on Recommender Systems. New York, NY, USA.
[15]
Tara N Sainath, Oriol Vinyals, Andrew Senior, and Hasim Sak 2015. Convolutional, long short-term memory, fully connected deep neural networks Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on.
[16]
Tobias Schnabel, Igor Labutov, David Mimno, and Thorsten Joachims 2015. Evaluation methods for unsupervised word embeddings Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing.
[17]
Ilya Sutskever, Oriol Vinyals, and Quoc V Le. 2014. Sequence to Sequence Learning with Neural Networks Proceedings of Advances in Neural Information Processing Systems 27.
[18]
Yukihiro Tagami, Hayato Kobayashi, Shingo Ono, and Akira Tajima 2015. Modeling User Activities on the Web Using Paragraph Vector Proceedings of the 24th International Conference on World Wide Web.
[19]
Pascal Vincent, Hugo Larochelle, Yoshua Bengio, and Pierre-Antoine Manzagol 2008. Extracting and composing robust features with denoising autoencoders Proceedings of the 25th international conference on Machine learning.
[20]
Pascal Vincent, Hugo Larochelle, Isabelle Lajoie, Yoshua Bengio, and Pierre-Antoine Manzagol. 2010. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. The Journal of Machine Learning Research (2010).
[21]
Yuyu Zhang, Hanjun Dai, Chang Xu, Jun Feng, Taifeng Wang, Jiang Bian, Bin Wang, and Tie-Yan Liu. 2014. Sequential click prediction for sponsored search with recurrent neural networks Proceedings of the 28th AAAI Conference on Artificial Intelligence.
[22]
Erheng Zhong, Nathan Liu, Yue Shi, and Suju Rajan. 2015. Building Discriminative User Profiles for Large-scale Content Recommendation Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. endthebibliography

Cited By

View all
  • (2024)A News Recommendation Method for User Privacy ProtectionInternational Journal of Computer Science and Information Technology10.62051/ijcsit.v2n3.042:3(25-36)Online publication date: 28-May-2024
  • (2024)A Hotspot-Aware Personalized News Recommendation Mechanism Based on DistilBERT-TC-MAInternational Journal of Distributed Systems and Technologies10.4018/IJDST.33956515:1(1-19)Online publication date: 9-Apr-2024
  • (2024)A Hybrid News Recommendation Approach Based on Title–Content MatchingMathematics10.3390/math1213212512:13(2125)Online publication date: 6-Jul-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
August 2017
2240 pages
ISBN:9781450348874
DOI:10.1145/3097983
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 August 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. distributed representations
  2. large-scale services
  3. neural networks
  4. news recommendations

Qualifiers

  • Research-article

Conference

KDD '17
Sponsor:

Acceptance Rates

KDD '17 Paper Acceptance Rate 64 of 748 submissions, 9%;
Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)219
  • Downloads (Last 6 weeks)27
Reflects downloads up to 12 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)A News Recommendation Method for User Privacy ProtectionInternational Journal of Computer Science and Information Technology10.62051/ijcsit.v2n3.042:3(25-36)Online publication date: 28-May-2024
  • (2024)A Hotspot-Aware Personalized News Recommendation Mechanism Based on DistilBERT-TC-MAInternational Journal of Distributed Systems and Technologies10.4018/IJDST.33956515:1(1-19)Online publication date: 9-Apr-2024
  • (2024)A Hybrid News Recommendation Approach Based on Title–Content MatchingMathematics10.3390/math1213212512:13(2125)Online publication date: 6-Jul-2024
  • (2024)A Sampling-Based Method for Detecting Data Poisoning Attacks in Recommendation SystemsMathematics10.3390/math1202024712:2(247)Online publication date: 12-Jan-2024
  • (2024)News Recommendation Method Based on Candidate-Aware Long- and Short-Term Preference ModelingApplied Sciences10.3390/app1501030015:1(300)Online publication date: 31-Dec-2024
  • (2024)Personalized News Recommendation Method with Double-Layer Residual Connections and Double Multi-Head Self-Attention MechanismsApplied Sciences10.3390/app1413566714:13(5667)Online publication date: 28-Jun-2024
  • (2024)The Power of Linear Programming in Sponsored Listings Ranking: Evidence from Field ExperimentsSSRN Electronic Journal10.2139/ssrn.4767661Online publication date: 2024
  • (2024)Heterogeneous Graph Neural Network with Personalized and Adaptive Diversity for News RecommendationACM Transactions on the Web10.1145/364988618:3(1-33)Online publication date: 6-May-2024
  • (2024)EmbSum: Leveraging the Summarization Capabilities of Large Language Models for Content-Based RecommendationsProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688185(1010-1015)Online publication date: 8-Oct-2024
  • (2024)Starling: An I/O-Efficient Disk-Resident Graph Index Framework for High-Dimensional Vector Similarity Search on Data SegmentProceedings of the ACM on Management of Data10.1145/36392692:1(1-27)Online publication date: 26-Mar-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media