Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2396761.2396849acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

The efficient imputation method for neighborhood-based collaborative filtering

Published: 29 October 2012 Publication History

Abstract

As each user tends to rate a small proportion of available items, the resulted Data Sparsity issue brings significant challenges to the research of recommender systems. This issue becomes even more severe for neighborhood-based collaborative filtering methods, as there are even lower numbers of ratings available in the neighborhood of the query item. In this paper, we aim to address the Data Sparsity issue in the context of the neighborhood-based collaborative filtering. Given the (user, item) query, a set of key ratings are identified, and an auto-adaptive imputation method is proposed to fill the missing values in the set of key ratings. The proposed method can be used with any similarity metrics, such as the Pearson Correlation Coefficient and Cosine-based similarity, and it is theoretically guaranteed to outperform the neighborhood-based collaborative filtering approaches. Results from experiments prove that the proposed method could significantly improve the accuracy of recommendations for neighborhood-based Collaborative Filtering algorithms.

References

[1]
G. Adomavicius and A. Tuzhilin. Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE transactions on knowledge and data engineering, 17(6):734--749, 2005.
[2]
N. S. Altman. An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression. The American Statistician, 46(3):175, Aug. 1992.
[3]
R. M. Bell and Y. Koren. Lessons from the Netflix Prize Challenge. SIGKDD Explorations, 9(2):75--79, 2007.
[4]
D. Billsus and M. Pazzani. Learning collaborative information filters. In Proceedings of the Fifteenth International Conference on Machine Learning, volume 54, page 48, 1998.
[5]
J. Breese, D. Heckerman, C. Kadie, and Others. Empirical analysis of predictive algorithms for collaborative filtering. In Proceedings of the 14th conference on Uncertainty in Artificial Intelligence, pages 43--52, 1998.
[6]
L. M. D. Campos, J. M. Fernandez-luna, J. F. Huete, and M. A. Rueda-morales. Measuring Predictive Capability in Collaborative Filtering. In Proceedings of the 3rd ACM Conference on Recommender Systems, pages 313--316, 2009.
[7]
S. Chee, J. Han, and K. Wang. Rectree: An efficient collaborative filtering method. Data Warehousing and Knowledge Discovery, pages 141--151, 2001.
[8]
T. Cover. Estimation by the nearest neighbor rule. IEEE Transactions on Information Theory, pages 50--55, 1968.
[9]
C. Desrosiers and G. Karypis. A Novel Approach to Compute Similarities and Its Application to Item Recommendation. In Proceeding of PRICAI 2010, pages 39--51, 2010.
[10]
C. Desrosiers and G. Karypis. Chapter 4 A comprehensive Survey of Neighborhood-based Recommendation Methods, chapter 4, pages 107--144. Springer US, Boston, MA, 2011.
[11]
S. a. Goldman and M. K. Warmuth. Learning binary relations using weighted majority voting. Machine Learning, 20(3):245--271, Sept. 1995.
[12]
Y. Koren. Factorization meets the neighborhood: a multifaceted collaborative filtering model. In SIGKDD 2008, pages 426--434. ACM, 2008.
[13]
M. Larson and A. Hanjalic. Exploiting User Similarity based on Rated-Item Pools for Improved User-based Collaborative Filtering. In Proceedings of the 3rd ACM Conference on Recommender Systems, pages 125--132, 2009.
[14]
D. Lemire and A. Maclachlan. Slope one predictors for online rating-based collaborative filtering. In In SIAM Data Mining (SDM'05), volume 05, 2005.
[15]
R. Little. Missing-data adjustments in large surveys. Journal of Business & Economic Statistics, 6(3):287--296, 1988.
[16]
H. Ma, I. King, and M. R. Lyu. Effective missing data prediction for collaborative filtering. In SIGIR 2007, pages 39--46, New York, New York, USA, 2007. ACM Press.
[17]
M. R. Mclaughlin and J. L. Herlocker. A Collaborative Filtering Algorithm and Evaluation Metric that Accurately Model the User Experience. Proceeding of SIGIR 2004, 2004.
[18]
Y. Ren, G. Li, and W. Zhou. Learning Rating Patterns for Top-N Recommendations. In Proceeding of The IEEE/ACM International Conference on Social Networks Analysis and Mining (ASONAM 2012), 2012.
[19]
P. Resnick, N. Iacovou, M. Suchak, P. Bergstrom, and J. Riedl. GroupLens : An Open Architecture for Collaborative Filtering of Netnews. In ACM Conference on Computer Supported Cooperative Work, pages 175--186, 1994.
[20]
G. K. B. Richard A. Johnsom. Statistics: Principles and Methods. John Wiley and Sons, 6 edition, 2009.
[21]
D. Rubin. Multiple imputation for nonresponse in surveys. New York, USA: John Willey & Sons, 1987.
[22]
B. Sarwar, G. Karypis, J. Konstan, and J. Reidl. Item-based collaborative filtering recommendation algorithms. In Proceedings of the 10th international conference on World Wide Web, pages 285--295. ACM, 2001.
[23]
X. Su and R. Greiner. A Mixture Imputation-Boosted Collaborative Filter. Artificial Intelligence, pages 312--317, 2008.
[24]
X. Su and T. Khoshgoftaar. Collaborative Filtering for Multi-class Data Using Belief Nets Algorithms. In 2006 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'06), pages 497--504. IEEE, Nov. 2006.
[25]
X. Su, T. Khoshgoftaar, and R. Greiner. Imputed Neighborhood Based Collaborative Filtering. In Web intelligence 2008, volume 1, pages 633--639. IEEE, 2008.
[26]
X. Su, T. M. Khoshgoftaar, X. Zhu, and R. Greiner.Imputation-boosted collaborative filtering using machine learning classifiers. Proceedings of the 2008 ACM symposium on Applied computing - SAC'08, (2):949, 2008.
[27]
J. Wang, A. P. de Vries, and M. J. T. Reinders. Unifying user-based and item-based collaborative filtering approaches by similarity fusion. In SIGIR 2006, pages 501--208, New York, New York, USA, 2006. ACM Press.
[28]
G.-R. Xue, C. Lin, Q. Yang, W. Xi, H.-J. Zeng, Y. Yu, and Z. Chen. Scalable collaborative filtering using cluster-based smoothing. In SIGIR 2005, pages 114--121, New York, New York, USA, 2005. ACM Press.
[29]
J. Zhang, Y. Xiang, Y. Wang, W. Zhou, Y. Xiang, and Y. Guan. Network Traffic Classification Using Correlation information. IEEE Transaction on Parallel and Distributed Systems, 2012.

Cited By

View all
  • (2024)Explaining Recommendation Fairness from a User/Item PerspectiveACM Transactions on Information Systems10.1145/369887743:1(1-30)Online publication date: 5-Oct-2024
  • (2024)Data augmentation and refinement for recommender system: A semi-supervised approach using maximum margin matrix factorizationExpert Systems with Applications10.1016/j.eswa.2023.121967238(121967)Online publication date: Mar-2024
  • (2023)SetRank: A Setwise Bayesian Approach for Collaborative Ranking in Recommender SystemACM Transactions on Information Systems10.1145/362619442:2(1-32)Online publication date: 3-Oct-2023
  • Show More Cited By

Index Terms

  1. The efficient imputation method for neighborhood-based collaborative filtering

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management
        October 2012
        2840 pages
        ISBN:9781450311564
        DOI:10.1145/2396761
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 29 October 2012

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. collaborative filtering
        2. imputation
        3. recommender systems

        Qualifiers

        • Research-article

        Conference

        CIKM'12
        Sponsor:

        Acceptance Rates

        Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

        Upcoming Conference

        CIKM '25

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)29
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 09 Jan 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)Explaining Recommendation Fairness from a User/Item PerspectiveACM Transactions on Information Systems10.1145/369887743:1(1-30)Online publication date: 5-Oct-2024
        • (2024)Data augmentation and refinement for recommender system: A semi-supervised approach using maximum margin matrix factorizationExpert Systems with Applications10.1016/j.eswa.2023.121967238(121967)Online publication date: Mar-2024
        • (2023)SetRank: A Setwise Bayesian Approach for Collaborative Ranking in Recommender SystemACM Transactions on Information Systems10.1145/362619442:2(1-32)Online publication date: 3-Oct-2023
        • (2023)Co-Training-Teaching: A Robust Semi-Supervised Framework for Review-Aware Rating RegressionACM Transactions on Knowledge Discovery from Data10.1145/362539118:2(1-16)Online publication date: 26-Sep-2023
        • (2023)Bootstrapped Personalized Popularity for Cold Start Recommender SystemsProceedings of the 17th ACM Conference on Recommender Systems10.1145/3604915.3608820(715-722)Online publication date: 14-Sep-2023
        • (2022)An Uncertainty-Aware Imputation Framework for Alleviating the Sparsity Problem in Collaborative FilteringProceedings of the 31st ACM International Conference on Information & Knowledge Management10.1145/3511808.3557236(802-811)Online publication date: 17-Oct-2022
        • (2022)Research on Different Weights of Single-Valued Neutrosophic Sets in Recommendation System2022 5th International Conference on Data Science and Information Technology (DSIT)10.1109/DSIT55514.2022.9943972(01-06)Online publication date: 22-Jul-2022
        • (2021)The Footprint of Factorization Models and Their Applications in Collaborative FilteringACM Transactions on Information Systems10.1145/349047540:4(1-32)Online publication date: 29-Nov-2021
        • (2021)“How to get consensus with neighbors?”Knowledge-Based Systems10.1016/j.knosys.2021.107549234:COnline publication date: 25-Dec-2021
        • (2021)Semi-supervised collaborative filtering ensembleWorld Wide Web10.1007/s11280-021-00866-7Online publication date: 12-Mar-2021
        • Show More Cited By

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media