Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2124295.2124337acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

Fair and balanced: learning to present news stories

Published: 08 February 2012 Publication History

Abstract

Relevance, diversity and personalization are key issues when presenting content which is apt to pique a user's interest. This is particularly true when presenting an engaging set of news stories. In this paper we propose an efficient algorithm for selecting a small subset of relevant articles from a streaming news corpus. It offers three key pieces of improvement over past work: 1) It is based on a detailed model of a user's viewing behavior which does not require explicit feedback. 2) We use the notion of submodularity to estimate the propensity of interacting with content. This improves over the classical context independent relevance ranking algorithms. Unlike existing methods, we learn the submodular function from the data. 3) We present an efficient online algorithm which can be adapted for personalization, story adaptation, and factorization models. Experiments show that our system yields a significant improvement over a retrieval system deployed in production.

References

[1]
R. Agrawal, S. Gollapudi, A. Halverson, and S. Ieong. Diversifying search results. In R.A. Baeza-Yates, P. Boldi, B.A. Ribeiro-Neto, and B.B. Cambazoglu, editors, WSDM, pages 5--14. ACM, 2009.
[2]
A. Ahmed, Q. Ho, C. H. Teo, J. Eisenstein, A. J. Smola and E. P. Xing. Online inference for the Infinite topic-cluster model: storylines from streaming text In AISTATS, 2011.
[3]
A. Bordes, L. Bottou, and P. Gallinari. SGD-QN: Careful quasi-Newton stochastic gradient descent. JMLR, 10:1737--1754, 2009.
[4]
O. Cappé and E. Moulines. Online expectation-maximization algorithm for latent data models. Journal of the Royal Statistics Society: Series B (Statistical Methodology), 71, 2009.
[5]
O. Chapelle and Y. Zhang. A dynamic bayesian network click model for web search ranking. In J. Quemada, G. León, Y.S. Maarek, and W. Nejdl, editors, WWW, pages 1--10. ACM, 2009.
[6]
N. Craswell, O. Zoeter, M. Taylor, and B. Ramsey. An experimental comparison of click position-bias models. In WSDM, pages 87--94, NY, USA, 2008. ACM.
[7]
G. Dupret and C. Liao. A model to estimate intrinsic document relevance from the clickthrough logs of a web search engine. In B. D. Davison, T. Suel, N. Craswell, and B. Liu, editors, WSDM, pages 181--1\90. ACM, 2010.
[8]
B. Edelman, M. Ostrovsky, and M. Schwarz. Internet advertising and the generalized second price auction: Selling billions of dollars worth of keywords. American Economic Review, 97(1):242--259, 2007.
[9]
K. El-Arini, G. Veda, D. Shahaf, and C. Guestrin. Turning down the noise in the blogosphere. In J. F. Elder IV, F. Fogelman-Soulié, P. A. Flach, and M. J. Zaki, editors, KDD, pages 289--298. ACM, 2009.
[10]
J. Kivinen and M. K. Warmuth. Exponentiated gradient versus gradient descent for linear predictors. Information and Computation, 132(1):1--64, 1997.
[11]
Y. Koren, R.M. Bell, and C. Volinsky. Matrix factorization techniques for recommender systems. IEEE Computer, 42(8):30--37, 2009.
[12]
A. Krause and C. Guestrin. A note on the budgeted maximization of submodular functions. Carnegie Mellon University-CALD 05--103, Carnegie Mellon University, 2005.
[13]
D. Lagun and E. Agichtein. Viewser: Enabling large-scale remote user studies of web search examination and interaction. In ACM SIGIR, 2011.
[14]
J. Leskovec, A. Krause, C. Guestrin, C. Faloutsos, J.M. VanBriesen, and N.S. Glance. Cost-effective outbreak detection in networks. In KDD, pages 420--429. ACM, 2007.
[15]
H. Lin and J. Bilmes. Multi-document summarization via budgeted maximization of submodular functions. In NAACL, pages 912--920. ACL, 2010.
[16]
G. Nemhauser, L. Wolsey, and M. Fisher. An analysis of the approximations for maximizing submodular set functions. Math.\ Programming, 14:265--294, 1978.
[17]
M. Richardson, A. Prakash, and E. Brill. Beyond pagerank: machine learning for static ranking. In L. Carr, D. De Roure, A. Iyengar, C.A. Goble, and M. Dahlin, editors, WWW, pages 707--715. 2006.
[18]
G. Salton. Automatic Text Processing. Addison-Wesley, Massachusetts, 1989.
[19]
A. Schrijver. Combinatorial Optimization, volume 24 of Algorithms and Combinatorics. Springer, 2003.

Cited By

View all
  • (2020)Personalization in text information retrievalJournal of the Association for Information Science and Technology10.1002/asi.2423471:3(349-369)Online publication date: 28-Jan-2020
  • (2019)In Search of a Stochastic Model for the E-News ReaderACM Transactions on Knowledge Discovery from Data10.1145/336269513:6(1-27)Online publication date: 13-Nov-2019
  • (2019)Phrase-guided attention web article recommendation for next clicks and viewsProceedings of the 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining10.1145/3341161.3342869(315-324)Online publication date: 27-Aug-2019
  • Show More Cited By

Index Terms

  1. Fair and balanced: learning to present news stories

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WSDM '12: Proceedings of the fifth ACM international conference on Web search and data mining
    February 2012
    792 pages
    ISBN:9781450307475
    DOI:10.1145/2124295
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 08 February 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. graphical models
    2. online learning
    3. personalization
    4. submodularity

    Qualifiers

    • Research-article

    Conference

    Acceptance Rates

    Overall Acceptance Rate 498 of 2,863 submissions, 17%

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)14
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 30 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2020)Personalization in text information retrievalJournal of the Association for Information Science and Technology10.1002/asi.2423471:3(349-369)Online publication date: 28-Jan-2020
    • (2019)In Search of a Stochastic Model for the E-News ReaderACM Transactions on Knowledge Discovery from Data10.1145/336269513:6(1-27)Online publication date: 13-Nov-2019
    • (2019)Phrase-guided attention web article recommendation for next clicks and viewsProceedings of the 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining10.1145/3341161.3342869(315-324)Online publication date: 27-Aug-2019
    • (2019)Whole Page Optimization with Global ConstraintsProceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining10.1145/3292500.3330675(3153-3161)Online publication date: 25-Jul-2019
    • (2019)A Deep Temporal Collaborative Filtering Recommendation Framework via Joint Learning from Long and Short-Term Effects2019 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom)10.1109/ISPA-BDCloud-SustainCom-SocialCom48970.2019.00139(959-966)Online publication date: Dec-2019
    • (2019)Stochastic Models to Improve E-News Recommender SystemsAdvances in Conceptual Modeling10.1007/978-3-030-34146-6_24(255-262)Online publication date: 27-Oct-2019
    • (2018)Fast greedy MAP inference for determinantal point process to improve recommendation diversityProceedings of the 32nd International Conference on Neural Information Processing Systems10.5555/3327345.3327465(5627-5638)Online publication date: 3-Dec-2018
    • (2018)Adaptive collaborative topic modeling for online recommendationProceedings of the 12th ACM Conference on Recommender Systems10.1145/3240323.3240363(338-346)Online publication date: 27-Sep-2018
    • (2018)A Deep Joint Network for Session-based News Recommendations with Contextual AugmentationProceedings of the 29th on Hypertext and Social Media10.1145/3209542.3209557(201-209)Online publication date: 3-Jul-2018
    • (2017)Recommending Personalized News in Short User SessionsProceedings of the Eleventh ACM Conference on Recommender Systems10.1145/3109859.3109894(121-129)Online publication date: 27-Aug-2017
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media