Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Predicting who rated what in large-scale datasets

Published: 01 December 2007 Publication History

Abstract

KDD Cup 2007 focuses on movie rating behaviors. The goal of the task "Who Rated What" is to predict whether "existing" users will review "existing" movies in the future. We cast the task as a link prediction problem and address it via a simple classification approach. Compared with other applications for link prediction, there are two major challenges in our task: (1) the huge size of the Netflix data; (2) the prediction target is complicated by many factors, such as a general decrease of interest in old movies and more tendency to review more movies by Netflix users due to the success of the internet DVD rental industries. We address the first challenge by "selective" subsampling and the second by combining information from the review scores, movie contents and graph topology effectively.

References

[1]
M. Berry, S. Dumais, and T. Letsche. Computational methods for intelligent access. In Proc. of Supercomputing'95, 1995.
[2]
G. Golub and C. V. Loan. Matrix Computations. Johns-Hopkins, Baltimore, second ed., 1989.
[3]
M. Hasan, V. Chaoji, S. Salem, and M. J. Zaki. Link prediction using supervised learning. In Proc. of SDM 06 workshop on Link Analysis, Counterrorism and Security, 2006.
[4]
T. Hastie, R. Tibshirani, and J. Friedman. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer; 1st ed., 2001.
[5]
Z. Huang. Link prediction based on graph topology: The predictive value of generalized clustering coefficient. In Proc. of KDD 06 workshop on Link Analysis: Dynamics and Static of Large Networks (LinkKDD2006), 2006.
[6]
H. Kashima and N. Abe. A parameterized probabilistic model of network evolution for supervised link prediction. In Proc. of the 2006 IEEE International Conference on Data Mining (ICDM 2006), 2006.
[7]
D. Liben-Nowell and J. Kleinberg. The link prediction problem for social networks. In Proc. of 12th International Conference on Information and Knowledge Management (CIKM), 2003.
[8]
B. Taskar, M.-F. Wong, P. Abbeel, and D. Koller. Link prediction in relational data. In Proc. of Advances in Neural Information Processing Systems (NIPS 2003), 2003.

Cited By

View all
  • (2024)Analysis and prediction of the Horizon 2020 R&D&I collaboration networkExpert Systems with Applications: An International Journal10.1016/j.eswa.2024.124417255:PBOnline publication date: 18-Oct-2024
  • (2023)Artificial Intelligence, Dynamic Capabilities, and Business OptimizationStreamlining Organizational Processes Through AI, IoT, Blockchain, and Virtual Environments10.4018/978-1-6684-8639-9.ch001(1-20)Online publication date: 30-Jun-2023
  • (2022)Graph regularized nonnegative matrix factorization for link prediction in directed temporal networks using PageRank centralityChaos, Solitons & Fractals10.1016/j.chaos.2022.112107159(112107)Online publication date: Jun-2022
  • Show More Cited By

Index Terms

  1. Predicting who rated what in large-scale datasets

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM SIGKDD Explorations Newsletter
    ACM SIGKDD Explorations Newsletter  Volume 9, Issue 2
    Special issue on visual analytics
    December 2007
    105 pages
    ISSN:1931-0145
    EISSN:1931-0153
    DOI:10.1145/1345448
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 December 2007
    Published in SIGKDD Volume 9, Issue 2

    Check for updates

    Author Tags

    1. KDD Cup
    2. Netflix
    3. link prediction

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 02 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Analysis and prediction of the Horizon 2020 R&D&I collaboration networkExpert Systems with Applications: An International Journal10.1016/j.eswa.2024.124417255:PBOnline publication date: 18-Oct-2024
    • (2023)Artificial Intelligence, Dynamic Capabilities, and Business OptimizationStreamlining Organizational Processes Through AI, IoT, Blockchain, and Virtual Environments10.4018/978-1-6684-8639-9.ch001(1-20)Online publication date: 30-Jun-2023
    • (2022)Graph regularized nonnegative matrix factorization for link prediction in directed temporal networks using PageRank centralityChaos, Solitons & Fractals10.1016/j.chaos.2022.112107159(112107)Online publication date: Jun-2022
    • (2022)A probabilistic perspective on nearest neighbor for implicit recommendationInternational Journal of Data Science and Analytics10.1007/s41060-022-00367-416:2(217-235)Online publication date: 29-Oct-2022
    • (2020)Exploiting implicit social relationships via dimension reduction to improve recommendation system performancePLOS ONE10.1371/journal.pone.023145715:4(e0231457)Online publication date: 22-Apr-2020
    • (2020)A Comparative Study of Classification Algorithms for Link Prediction2020 2nd International Conference on Innovative Mechanisms for Industry Applications (ICIMIA)10.1109/ICIMIA48430.2020.9074840(479-483)Online publication date: Mar-2020
    • (2020)Mining consuming Behaviors with Temporal Evolution for Personalized Recommendation in Mobile Marketing AppsMobile Networks and Applications10.1007/s11036-020-01535-1Online publication date: 6-Jun-2020
    • (2019)Dual network embedding for representing research interests in the link prediction problem on co-authorship networksPeerJ Computer Science10.7717/peerj-cs.1725(e172)Online publication date: 21-Jan-2019
    • (2019)Predicting Collaborations in Co-authorship Network2019 14th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP)10.1109/SMAP.2019.8864887(1-6)Online publication date: Jun-2019
    • (2019)Link Prediction Regression for Weighted Co-authorship NetworksAdvances in Computational Intelligence10.1007/978-3-030-20518-8_55(667-677)Online publication date: 16-May-2019
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media