Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

In Search of a Stochastic Model for the E-News Reader

Published: 13 November 2019 Publication History

Abstract

E-news readers have increasingly at their disposal a broad set of news articles to read. Online newspaper sites use recommender systems to predict and to offer relevant articles to their users. Typically, these recommender systems do not leverage users’ reading behavior. If we know how the topics-reads change in a reading session, we may lead to fine-tuned recommendations, for example, after reading a certain number of sports items, it may be counter-productive to keep recommending other sports news. The motivation for this article is the assumption that understanding user behavior when reading successive online news articles can help in developing better recommender systems. We propose five categories of stochastic models to describe this behavior depending on how the previous reading history affects the future choices of topics. We instantiated these five classes with many different stochastic processes covering short-term memory, revealed-preference, cumulative advantage, and geometric sojourn models. Our empirical study is based on large datasets of E-news from two online newspapers. We collected data from more than 13 million users who generated more than 23 million reading sessions, each one composed by the successive clicks of the users on the posted news. We reduce each user session to the sequence of reading news topics. The models were fitted and compared using the Akaike Information Criterion and the Brier Score. We found that the best models are those in which the user moves through topics influenced only by their most recent readings. Our models were also better to predict the next reading than the recommender systems currently used in these journals showing that our models can improve user satisfaction.

References

[1]
Eugene Agichtein, Eric Brill, and Susan Dumais. 2006. Improving web search ranking by incorporating user behavior information. In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’06). ACM, Seattle, Washington, 19--26.
[2]
Amr Ahmed, Choon Hui Teo, S. V. N. Vishwanathan, and Alex Smola. 2012. Fair and balanced: Learning to present news stories. In Proceedings of the 5th ACM International Conference on Web Search and Data Mining (WSDM’12). ACM, Seattle, Washington, 333--342.
[3]
Hirotugu Akaike. 1974. A new look at the statistical model identification. IEEE Transactions on Automatic Control 19, 6 (December 1974), 716--723.
[4]
Xiao Bai, B. Barla Cambazoglu, Francesco Gullo, Amin Mantrach, and Fabrizio Silvestri. 2017. Exploiting search history of users for news personalization. Information Sciences 385, C (2017), 125--137.
[5]
Trapit Bansal, Mrinal Das, and Chiranjib Bhattacharyya. 2015. Content driven user profiling for comment-worthy recommendations of news and blog articles. In Proceedings of the 9th ACM Conference on Recommender Systems (RecSys’15). ACM, Vienna, Austria, 195--202.
[6]
Alexander Belenky. 2007. The Editor as Curator. Retrieved from https://www.theguardian.com/commentisfree/2007/dec/28/theeditorascurator.
[7]
Mikhail Bilenko and Ryen W. White. 2008. Mining the search trails of surfing crowds: Identifying relevant websites from user activity. In Proceedings of the 17th International Conference on World Wide Web (WWW’08). ACM, Beijing, China, 51--60.
[8]
Toine Bogers and Antal van den Bosch. 2007. Comparing and evaluating information retrieval algorithms for news recommendation. In Proceedings of the 2007 ACM Conference on Recommender Systems (RecSys’07). ACM, Minneapolis, MN, 141--144.
[9]
Glenn W. Brier. 1950. Verification of forecasts expressed in terms of probability. Monthly Weather Review 78, 1 (1950), 1--3.
[10]
Kenneth P. Burnham and David R. Anderson. 2003. Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach. Springer, New York, NY.
[11]
Pedro G. Campos, Fernando Díez, and Iván Cantador. 2014. Time-aware recommender systems: A comprehensive survey and analysis of existing evaluation protocols. User Modeling and User-Adapted Interaction 24, 1 (01 February 2014), 67--119.
[12]
Baojiang Chen, Grace Y. Yi, and Richard J. Cook. 2009. Likelihood analysis of joint marginal and conditional models for longitudinal categorical data. Canadian Journal of Statistics 37, 2 (2009), 182--205.
[13]
Xi Chen, Kaoru Irie, David Banks, Robert Haslinger, Jewell Thomas, and Mike West. 2018. Scalable Bayesian modeling, monitoring, and analysis of dynamic network flow data. Journal of the American Statistical Association 113, 522 (2018), 519--533.
[14]
Mark Claypool, Tim Miranda, Anuja Gokhale, Tim Mir, Pavel Murnikov, Dmitry Netes, and Matthew Sartin. 1999. Combining content-based and collaborative filters in an online newspaper. In Proceedings of ACM SIGIR Workshop on Recommender Systems: Implementarion and Evaluation. ACM, New York.
[15]
Abhinandan S. Das, Mayur Datar, Ashutosh Garg, and Shyam Rajaram. 2007. Google news personalization: Scalable online collaborative filtering. In Proceedings of the 16th International Conference on World Wide Web (WWW’07). ACM, Banff, Alberta, Canada, 271--280.
[16]
Gianmarco De Francisci Morales, Aristides Gionis, and Claudio Lucchese. 2012. From chatter to headlines: Harnessing the real-time web for personalized news recommendation. In Proceedings of the 5th ACM International Conference on Web Search and Data Mining (WSDM’12). ACM, Seattle, Washington, 153--162.
[17]
Elena Viorica Epure, Benjamin Kille, Jon Espen Ingvaldsen, Rebecca Deneckere, Camille Salinesi, and Sahin Albayrak. 2017. Modeling the dynamics of online news reading interests. In Proceedings of the 25th Conference on User Modeling, Adaptation and Personalization (UMAP’17). ACM, Bratislava, Slovakia, 363--364.
[18]
Elena Viorica Epure, Benjamin Kille, Jon Espen Ingvaldsen, Rebecca Deneckere, Camille Salinesi, and Sahin Albayrak. 2017. Recommending personalized news in short user sessions. In Proceedings of the 11th ACM Conference on Recommender Systems (RecSys’17). ACM, Como, Italy, 121--129.
[19]
Cagdas Esiyok, Benjamin Kille, Brijnesh-Johannes Jain, Frank Hopfgartner, and Sahin Albayrak. 2014. Users’ reading habits in online news portals. In Proceedings of the 5th Information Interaction in Context Symposium (IIiX’14). ACM, Regensburg, Bavaria, Germany, 263--266.
[20]
Sylvia Frühwirth-Schnatter, Stefan Pittner, Andrea Weber, and Rudolf Winter-Ebmer. 2018. Analysing plant closure effects using time-varying mixture-of-experts Markov chain clustering. The Annals of Applied Statistics 12, 3 (09 2018), 1796--1830. https://doi.org/10.1214/17-AOAS1132
[21]
Peter Guttorp and Vladimir N. Minin. 2018. Stochastic Modeling of Scientific Data. CRC Press.
[22]
José Hernández-Orallo, Peter Flach, and Cèsar Ferri. 2011. Brier curves: A new cost-based visualisation of classifier performance. In Proceedings of the 28th International Conference on Machine Learning (ICML’11). Lise Getoor and Tobias Scheffer (Eds.), ACM, Bellevue, Washington, 585--592.
[23]
José Hernández-Orallo, Peter Flach, and Cèsar Ferri. 2012. A unified view of performance metrics: Translating threshold choice into expected classification loss. Journal of Machine Learning Research 13, Oct (2012), 2813--2869.
[24]
Cheng-Kang Hsieh, Longqi Yang, Honghao Wei, Mor Naaman, and Deborah Estrin. 2016. Immersive recommendation: News and event recommendations using personal digital traces. In Proceedings of the 25th International Conference on World Wide Web (WWW’16). International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland, 51--62.
[25]
Yifan Hu, Yehuda Koren, and Chris Volinsky. 2008. Collaborative filtering for implicit feedback datasets. In Proceedings of the 8th IEEE International Conference on Data Mining (ICDM’08). IEEE, 263--272.
[26]
Yajie Hu and Mitsunori Ogihara. 2011. NextOne player: A music recommendation system based on user behavior. In Proceedings of the 12th International Society for Music Information Retrieval Conference, Vol. 11. University of Miami, Miami, Florida, 103--108.
[27]
Yaser Keneshloo, Shuguang Wang, Eui-Hong Han, and Naren Ramakrishnan. 2016. Predicting the popularity of news articles. In Proceedings of the 2016 SIAM International Conference on Data Mining. SIAM, Miami, Florida, 441--449.
[28]
Dmitry Lagun and Mounia Lalmas. 2016. Understanding user attention and engagement in online news reading. In Proceedings of the 9th ACM International Conference on Web Search and Data Mining (WSDM’16). ACM, San Francisco, CA, 113--122.
[29]
Vadim Lavrusik. 2009. 12 Things Newspapers Should Do to Survive. Retrieved from http://mashable.com/2009/08/14/newspaper-survival/#LKdF2UTVJuqy.
[30]
Lihong Li, Wei Chu, John Langford, and Robert E. Schapire. 2010. A contextual-bandit approach to personalized news article recommendation. In Proceedings of the 19th International Conference on World Wide Web (WWW’10). ACM, Raleigh, North Carolina, 661--670.
[31]
Lei Li, Li Zheng, Fan Yang, and Tao Li. 2014. Modeling and broadening temporal user interest in personalized news recommendation. Expert Systems with Applications 41, 7 (2014), 3168--3177.
[32]
Yiqun Liu, Rongwei Cen, Min Zhang, Shaoping Ma, and Liyun Ru. 2008. Identifying web spam with user behavior analysis. In Proceedings of the 4th International Workshop on Adversarial Information Retrieval on the Web (AIRWeb’08). ACM, Madrid, Spain, 9--16.
[33]
Ziming Liu. 2005. Reading behavior in the digital environment: Changes in reading behavior over the past ten years. Journal of Documentation 61, 6 (2005), 700--712.
[34]
Ramon Lopes, Renato Assunção, and Rodrygo L. T. Santos. 2016. Efficient Bayesian methods for graph-based recommendation. In Proceedings of the 10th ACM Conference on Recommender Systems. ACM, 333--340.
[35]
Allan H. Murphy. 1973. A new vector partition of the probability score. Journal of Applied Meteorology 12, 4 (1973), 595--600.
[36]
Scott E. Page. 2018. The Model Thinker. Hachette, UK.
[37]
Giovanni Parmigiani and Lurdes Inoue. 2009. Decision Theory: Principles and Approaches, Vol. 812. John Wiley 8 Sons.
[38]
Owen Phelan, Kevin McCarthy, Mike Bennett, and Barry Smyth. 2011. Terms of a feather: Content-based news recommendation and discovery using twitter. In Proceedings of the European Conference on Advances in Information Retrieval (ECIR’11). Springer, Berlin, Germany, 448--459.
[39]
Mark Pinsky and Samuel Karlin. 2010. An Introduction to Stochastic Modeling. Academic press.
[40]
Julio Reis, Fabricio Benevenuto, P. Vaz de Melo, Raquel Prates, Haewoon Kwak, and Jisun An. 2015. Breaking the news: First impressions matter on online news. In Proceedings of the 9th International AAAI Conference on Web-Blogs and Social Media. Association for the Advancement of Artificial Intelligence, Oxford, UK, 357--366.
[41]
Mozhgan Tavakolifard, Jon Atle Gulla, Kevin C. Almeroth, Frank Hopfgartner, Benjamin Kille, Till Plumbaum, Andreas Lommatzsch, Torben Brodt, Arthur Bucko, and Tobias Heintz. 2013. Workshop and challenge on news recommender systems. In Proceedings of the 7th ACM Conference on Recommender Systems (RecSys’13). ACM, Hong Kong, China, 481--482.
[42]
William Trouleau, Azin Ashkan, Weicong Ding, and Brian Eriksson. 2016. Just one more: Modeling binge watching behavior. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’16). ACM, San Francisco, California, 1215--1224.

Cited By

View all
  • (2023)Realtime News Analysis using Natural Language Processing2023 4th International Conference for Emerging Technology (INCET)10.1109/INCET57972.2023.10170350(1-6)Online publication date: 26-May-2023

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Knowledge Discovery from Data
ACM Transactions on Knowledge Discovery from Data  Volume 13, Issue 6
December 2019
282 pages
ISSN:1556-4681
EISSN:1556-472X
DOI:10.1145/3366748
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 November 2019
Accepted: 01 September 2019
Revised: 01 July 2019
Received: 01 December 2018
Published in TKDD Volume 13, Issue 6

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Modeling user behavior
  2. online newspapers
  3. stochastic models

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

  • Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES)
  • Fundação de Amparo à Pesquisa do Estado de Minas Gerais (FAPEMIG)
  • Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
  • Universidade Federal de Ouro Preto

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 31 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Realtime News Analysis using Natural Language Processing2023 4th International Conference for Emerging Technology (INCET)10.1109/INCET57972.2023.10170350(1-6)Online publication date: 26-May-2023

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media