Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2645710.2645754acmconferencesArticle/Chapter ViewAbstractPublication PagesrecsysConference Proceedingsconference-collections
research-article

Bayesian binomial mixture model for collaborative prediction with non-random missing data

Published: 06 October 2014 Publication History

Abstract

Collaborative prediction involves filling in missing entries of a user-item matrix to predict preferences of users based on their observed preferences. Most of existing models assume that the data is missing at random (MAR), which is often violated in recommender systems in practice. Incorrect assumption on missing data ignores the missing data mechanism, leading to biased inferences and prediction. In this paper we present a Bayesian binomial mixture model for collaborative prediction, where the generative process for data and missing data mechanism are jointly modeled to handle non-random missing data. Missing data mechanism is modeled by three factors, each of which is related to users, items, and rating values. Each factor is modeled by Bernoulli random variable, and the observation of rating value is determined by the Boolean OR operation of three binary variables. We develop computationally-efficient variational inference algorithms, where variational parameters have closed-form update rules and the computational complexity depends on the number of observed ratings, instead of the size of the rating data matrix. We also discuss implementation issues on hyperparameter tuning and estimation based on empirical Bayes. Experiments on Yahoo! Music and MovieLens datasets confirm the useful behavior of our model by demonstrating that: (1) it outperforms state-of-the-art methods in yielding higher predictive performance; (2) it finds meaningful solutions instead of undesirable boundary solutions; (3) it provides rating trend analysis on why ratings are observed.

Supplementary Material

JPG File (p201-sidebyside.jpg)
MP4 File (p201-sidebyside.mp4)

References

[1]
E. Candès and T. Tao. The power of convex relaxation: Near-optimal matrix completion. IEEE Transactions on Information Theory, 56(5):2053--2080, 2009.
[2]
G. Dror, N. Koenigstein, Y. Koren, and M. Weimer. The yahoo! music dataset and KDD-Cup'11. In Proceedings of KDD Cup and Workshop, 2011.
[3]
Y.-D. Kim and S. Choi. Variational Bayesian view of weighted trace norm regularization for matrix factorization. IEEE Signal Processing Letters, 20(3):261--264, 2013.
[4]
Y.-D. Kim and S. Choi. Scalable variational Bayesian matrix factorization with side information. In Proceedings of the International Conference on Artificial Intelligence and Statistics (AISTATS), Reykjavik, Iceland, 2014.
[5]
Y. J. Lim and Y. W. Teh. Variational Bayesian approach to movie rating prediction. In Proceedings of KDD Cup and Workshop, San Jose, CA, 2007.
[6]
G. Ling, H. Yang, M. R. Lyu, and I. King. Response aware model-based collaborative filtering. In Proceedings of the Annual Conference on Uncertainty in Artificial Intelligence (UAI), Catalina Island, California, USA, 2012.
[7]
R. J. A. Little and D. B. Rubin. Statistical Analysis with Missing Data. John Willey & Sons, Inc., 1987.
[8]
B. M. Marlin. Modeling user rating profiles for collaborative filtering. In Advances in Neural Information Processing Systems (NIPS), volume 16, 2004.
[9]
B. M. Marlin. Missing Data Problems in Machine Learning. PhD thesis, University of Toronto, 2008.
[10]
B. M. Marlin and R. S. Zemel. Collaborative prediction and ranking with non-random missing data. In Proceedings of the ACM International Conference on Recommender Systems (RecSys), New York, New York, USA, 2009.
[11]
B. M. Marlin, R. S. Zemel, S. T. Roweis, and M. Slaney. Collaborative filtering and the missing at random assumption. In Proceedings of the Annual Conference on Uncertainty in Artificial Intelligence (UAI), Vancouver, Canada, 2007.
[12]
T. Minka. Estimating a Dirichlet distribution. Technical report, Microsoft Research, 2000.
[13]
U. Paquet and N. Koenigstein. One-class collaborative filtering with random graphs. In Proceedings of the International Conference on World Wide Web (WWW), Rio de Janeiro, Brazil, 2013.
[14]
S. Park, Y.-D. Kim, and S. Choi. Hierarchical Bayesian matrix factorization with side information. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), Beijing, China, 2013.
[15]
T. Raiko, A. Ilin, and J. Karhunen. Principal component analysis for large scale problems with lots of missing values. In Proceedings of the European Conference on Machine Learning (ECML), pages 691--698, Warsaw, Poland, 2007.
[16]
R. Salakhutdinov and A. Mnih. Bayesian probablistic matrix factorization using MCMC. In Proceedings of the International Conference on Machine Learning (ICML), Helsinki, Finland, 2008.
[17]
R. Salakhutdinov, A. Mnih, and G. Hinton. Restricted Boltzmann machines for collaborative filtering. In Proceedings of the International Conference on Machine Learning (ICML), Corvallis, OR, USA, 2007.
[18]
R. Salakhutdinov and N.Srebro. Collaborative filtering in a non-uniform world: Learning with weighted trace norm. In Advances in Neural Information Processing Systems (NIPS), volume 23. MIT Press, 2010.
[19]
H. Steck. Training and testing of recommender systems on data missing not at random. In Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), Washington, DC, USA, 2010.
[20]
J. Wu. Binomial matrix factorization for discrete collaborative filtering. In Proceedings of the IEEE International Conference on Data Mining (ICDM), Miami, Florida, USA, 2009.
[21]
J. Yoo and S. Choi. Bayesian matrix co-factorization: Variational algorithm and Cramér-Rao bound. In Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD), Athens, Greece, 2011.

Cited By

View all
  • (2023)A novel neighbor selection scheme based on dynamic evaluation towards recommender systemsScience Progress10.1177/00368504231180090106:2Online publication date: 8-Jun-2023
  • (2023)ST-MoE: Spatio-Temporal Mixture-of-Experts for Debiasing in Traffic PredictionProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615068(1208-1217)Online publication date: 21-Oct-2023
  • (2023)CDR: Conservative Doubly Robust Learning for Debiased RecommendationProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614805(2321-2330)Online publication date: 21-Oct-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
RecSys '14: Proceedings of the 8th ACM Conference on Recommender systems
October 2014
458 pages
ISBN:9781450326681
DOI:10.1145/2645710
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 October 2014

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. collaborative filtering
  2. non-random missing data
  3. probabilistic models
  4. recommender systems
  5. variational bayesian inference

Qualifiers

  • Research-article

Funding Sources

Conference

RecSys'14
Sponsor:
RecSys'14: Eighth ACM Conference on Recommender Systems
October 6 - 10, 2014
California, Foster City, Silicon Valley, USA

Acceptance Rates

RecSys '14 Paper Acceptance Rate 35 of 234 submissions, 15%;
Overall Acceptance Rate 254 of 1,295 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)21
  • Downloads (Last 6 weeks)0
Reflects downloads up to 14 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)A novel neighbor selection scheme based on dynamic evaluation towards recommender systemsScience Progress10.1177/00368504231180090106:2Online publication date: 8-Jun-2023
  • (2023)ST-MoE: Spatio-Temporal Mixture-of-Experts for Debiasing in Traffic PredictionProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615068(1208-1217)Online publication date: 21-Oct-2023
  • (2023)CDR: Conservative Doubly Robust Learning for Debiased RecommendationProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614805(2321-2330)Online publication date: 21-Oct-2023
  • (2023)Bias and Debias in Recommender System: A Survey and Future DirectionsACM Transactions on Information Systems10.1145/356428441:3(1-39)Online publication date: 7-Feb-2023
  • (2023)A Transformer-based Patient Clustering Method in Continuing Care Retirement Communities2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)10.1109/BIBM58861.2023.10385837(684-689)Online publication date: 5-Dec-2023
  • (2022)Recommender systems, ground truth, and preference pollutionAI Magazine10.1002/aaai.1205543:2(177-189)Online publication date: 23-Jun-2022
  • (2021)Spiral of Silence and Its Application in Recommender SystemsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2020.3013973(1-1)Online publication date: 2021
  • (2021)Learning from missing data with the binary latent block modelStatistics and Computing10.1007/s11222-021-10058-y32:1Online publication date: 20-Dec-2021
  • (2021)A sampling approach to Debiasing the offline evaluation of recommender systemsJournal of Intelligent Information Systems10.1007/s10844-021-00651-y58:2(311-336)Online publication date: 10-Jul-2021
  • (2020)Debiased offline evaluation of recommender systemsProceedings of the 35th Annual ACM Symposium on Applied Computing10.1145/3341105.3375759(1435-1442)Online publication date: 30-Mar-2020
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media