Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2661829.2662059acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Collaborative Filtering Incorporating Review Text and Co-clusters of Hidden User Communities and Item Groups

Published: 03 November 2014 Publication History
  • Get Citation Alerts
  • Abstract

    Most collaborative filtering (CF) algorithms only make use of the rating scores given by users for items. However, it is often the case that each rating score is associated with a piece of review text. Such review texts, which are capable of providing us valuable information to reveal the reasons why users give a certain rating, have not been exploited and they are usually ignored by most CF algorithms. Moreover, the underlying relationship buried in users and items has not been fully exploited. Items we would recommend can often be characterized into hidden groups (e.g. comedy, horror movie and action movie), and users can also be organized as hidden communities. We propose a new generative model to predict user's ratings on previously unrated items by considering review texts as well as hidden user communities and item groups relationship. Regarding the rating scores, traditional algorithms would not perform well on uncovering the community and group information of each user and each item since the user-item rating matrix is dyadic involving the mutual interactions between users and items. Instead, co-clustering, which is capable of conducting simultaneous clustering of two variables, is able to take advantage of such user-item relationships to better predict the rating scores. Additionally, co-clustering would be more effective for modeling the generation of review texts since different user communities would discuss different topics and vary their own wordings or expression patterns when dealing with different item groups. Besides, by modeling as a mixed membership over community and group respectively, each user or item can belong to multiple communities or groups with varying degrees. We have conducted extensive experiments to predict the missing rating scores on 22 real word datasets. The experimental results demonstrate the superior performance of our proposed model comparing with the state-of-the-art methods.

    References

    [1]
    A.Beutel, K.Murray, C.Faloutsos, and A.J. Smola. Cobafi: collaborative bayesian filtering. In WWW, pages 97--108, 2014.
    [2]
    A.Paterek. Improving regularized singular value decomposition for collaborative filtering. In Proceedings of KDD cup and workshop, volume 2007, pages 5--8, 2007.
    [3]
    B.Sarwar, G.Karypis, J.Konstan, and J.Riedl. Incremental singular value decomposition algorithms for highly scalable recommender systems. In ICIS, pages 27--28, 2002.
    [4]
    C.Tan, E.Chi, D.Huffaker, G.Kossinets, and A.Smola. Instant foodie: predicting expert ratings from grassroots. In CIKM, pages 1127--1136, 2013.
    [5]
    C.Wang and D.M. Blei. Collaborative topic modeling for recommending scientific articles. In SIGKDD, pages 448--456, 2011.
    [6]
    D.Agarwal and B.C.Chen. flda: matrix factorization through latent dirichlet allocation. In WSDM, pages 91--100, 2010.s
    [7]
    D.Agarwal and S.Merugu. Predictive discrete latent factor models for large scale dyadic data. In SIGKDD, pages 26--35, 2007.
    [8]
    D.DeCoste. Collaborative prediction using ensembles of maximum margin matrix factorizations. In ICML, pages 249--256, 2006.
    [9]
    A.Ng D.M.Blei and M.I.Jordan. Latent dirichlet allocation. JMLR, 3:993--1022, 2003.
    [10]
    G.Ganu, N.Elhadad, and A.Marian. Beyond the stars: Improving rating predictions using review text content. In WebDB, 2009.
    [11]
    H.Shan and A.Banerjee. Bayesian co-clustering. In ICDM, pages 530--539, 2008.
    [12]
    I.Dhillon, S.Mallela, and D.Modha. Information-theoretic co-clustering. In SIGKDD, pages 89--98, 2003.
    [13]
    I.Titov and R.McDonald. A joint model of text and aspect ratings for sentiment summarization. In ACL, pages 308--316, 2008.
    [14]
    I.Titov and R.McDonald. Modeling online reviews with multi-grain topic models. In WWW, pages 111--120, 2008.
    [15]
    J.Bobadilla, F.Ortega, A.Hernando, and A.Gutiérrez. Recommender systems survey. Knowledge-Based Systems, 46:109--132, 2013.
    [16]
    J.Hartigan. Direct clustering of a data matrix. Journal of the american statistical association, 67(337):123--129, 1972.
    [17]
    J.McAuley and J.Leskovec. From amateurs to connoisseurs: modeling the evolution of user expertise through online reviews. In WWW, pages 897--908, 2013.
    [18]
    J.McAuley and J.Leskovec. Hidden factors and hidden topics: understanding rating dimensions with review text. In RecSys, pages 165--172, 2013.
    [19]
    J.McAuley, J.Leskovec, and D.Jurafsky. Learning attitudes and attributes from multi-aspect reviews. In ICDM, pages 1020--1025, 2012.
    [20]
    J.Rennie and N.Srebro. Fast maximum margin matrix factorization for collaborative prediction. In ICML, pages 713--719, 2005.
    [21]
    J.Tenenbaum, V.Silva, and J.Langford. A global geometric framework for nonlinear dimensionality reduction. Science, 290(5500):2319--2323, 2000.
    [22]
    J.Weston, S.Bengio, and N.Usunier. Wsabie: Scaling up to large vocabulary image annotation. In IJCAI, pages 2764--2770, 2011.
    [23]
    J.Lafferty K.Yu, S.Zhu and Y.Gong. Fast nonparametric matrix factorization for large-scale collaborative filtering. In SIGIR, pages 211--218, 2009.
    [24]
    M.Weimer, A.Karatzoglou, Q.V.Le, and A.Smola. Maximum margin matrix factorization for collaborative ranking. NIPS, 2007.
    [25]
    A.Karatzoglou M.Weimer and A.Smola. Improving maximum margin matrix factorization. Machine Learning, 72(3):263--276, 2008.
    [26]
    N.Aizenberg, Y.Koren, and O.Somekh. Build your own music recommender by modeling internet radio streams. In WWW, pages 1--10, 2012.
    [27]
    N.Srebro, J.D.Rennie, and T.Jaakkola. Maximum-margin matrix factorization. In NIPS, volume 17, pages 1329--1336, 2004.
    [28]
    N.Srebro, N.Alon, and T.Jaakkola. Generalization error bounds for collaborative prediction with low-rank matrices. In NIPS, 2004.
    [29]
    N.Srebro and T.Jaakkola. Weighted low-rank approximations. In ICML, volume 3, pages 720--727, 2003.
    [30]
    R.Bell, Y.Koren, and C.Volinsky. Modeling relationships at multiple scales to improve accuracy of large recommender systems. In SIGKDD, pages 95--104, 2007.
    [31]
    R.Salakhutdinov and M.Andriy. Probabilistic matrix factorization. In NIPS, volume 1, pages 2--1, 2007.
    [32]
    R.Salakhutdinov and M.Andriy. Bayesian probabilistic matrix factorization using markov chain monte carlo. In ICML, pages 880--887, 2008.
    [33]
    S.Geman and D.Geman. Stochastic relaxation, gibbs distributions, and the bayesian restoration of images. Pattern Analysis and Machine Intelligence, IEEE Transactions on, (6):721--741, 1984.
    [34]
    S.Lacoste-Julien, F.Sha, and M.Jordan. Disclda: Discriminative learning for dimensionality reduction and classification. In NIPS, volume 83, page 85, 2008.
    [35]
    S.Yang, B.Long, A.Smola, H.Zha, and Z.Zheng. Collaborative competitive filtering: learning recommender using context of user choice. In SIGIR, pages 295--304, 2011.
    [36]
    T.George and S.Merugu. A scalable collaborative filtering framework based on co-clustering. In ICDM, pages 4--pp, 2005.
    [37]
    T.L.Griffiths and M.Steyvers. Finding scientific topics. Proceedings of the National academy of Sciences of the United States of America, 101:5228--5235, 2004.
    [38]
    X.Wang and A.McCallum. Topics over time: a non-markov continuous-time model of topical trends. In SIGKDD, pages 424--433, 2006.
    [39]
    Y.Cheng and G.Church. Biclustering of expression data. In Ismb, volume 8, pages 93--103, 2000.
    [40]
    Y.Koren. Factorization meets the neighborhood: a multifaceted collaborative filtering model. In SIGKDD, pages 426--434, 2008.
    [41]
    Y.Koren, R.Bell, and C.Volinsky. Matrix factorization techniques for recommender systems. Computer, 42(8):30--37, 2009.

    Cited By

    View all
    • (2024)A type-2 fuzzy review topic-based model for personalized recommendationElectronic Commerce Research10.1007/s10660-024-09829-2Online publication date: 9-Apr-2024
    • (2021)Product Recommendation Algorithm for Score Prediction Based on Joint Feature Vector Extraction2021 4th International Conference on Data Science and Information Technology10.1145/3478905.3478916(55-61)Online publication date: 23-Jul-2021
    • (2020)Ratings meet reviews in the monitoring of online products and servicesJournal of Quality Technology10.1080/00224065.2020.182921654:2(197-214)Online publication date: 22-Oct-2020
    • Show More Cited By

    Index Terms

    1. Collaborative Filtering Incorporating Review Text and Co-clusters of Hidden User Communities and Item Groups

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Conferences
          CIKM '14: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management
          November 2014
          2152 pages
          ISBN:9781450325981
          DOI:10.1145/2661829
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Sponsors

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 03 November 2014

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. co-clustering
          2. collaborative filtering
          3. item group
          4. topic model
          5. user community

          Qualifiers

          • Research-article

          Funding Sources

          Conference

          CIKM '14
          Sponsor:

          Acceptance Rates

          CIKM '14 Paper Acceptance Rate 175 of 838 submissions, 21%;
          Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

          Upcoming Conference

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)7
          • Downloads (Last 6 weeks)2
          Reflects downloads up to 26 Jul 2024

          Other Metrics

          Citations

          Cited By

          View all
          • (2024)A type-2 fuzzy review topic-based model for personalized recommendationElectronic Commerce Research10.1007/s10660-024-09829-2Online publication date: 9-Apr-2024
          • (2021)Product Recommendation Algorithm for Score Prediction Based on Joint Feature Vector Extraction2021 4th International Conference on Data Science and Information Technology10.1145/3478905.3478916(55-61)Online publication date: 23-Jul-2021
          • (2020)Ratings meet reviews in the monitoring of online products and servicesJournal of Quality Technology10.1080/00224065.2020.182921654:2(197-214)Online publication date: 22-Oct-2020
          • (2020)AGTR: Adversarial Generation of Target Review for Rating PredictionData Science and Engineering10.1007/s41019-020-00141-15:4(346-359)Online publication date: 17-Sep-2020
          • (2020)Adversarial Generation of Target Review for Rating PredictionDatabase Systems for Advanced Applications10.1007/978-3-030-59416-9_5(73-89)Online publication date: 22-Sep-2020
          • (2019)Recommendation system in social networks with topical attention and probabilistic matrix factorizationPLOS ONE10.1371/journal.pone.022396714:10(e0223967)Online publication date: 31-Oct-2019
          • (2019)Attentive Aspect Modeling for Review-Aware RecommendationACM Transactions on Information Systems10.1145/330954637:3(1-27)Online publication date: 27-Mar-2019
          • (2019)Recommendation Based on Review Texts and Social Communities: A Hybrid ModelIEEE Access10.1109/ACCESS.2019.28975867(40416-40427)Online publication date: 2019
          • (2019)MMM: Multi-source Multi-net Micro-video Recommendation with Clustered Hidden Item Representation LearningData Science and Engineering10.1007/s41019-019-00101-44:3(240-253)Online publication date: 6-Sep-2019
          • (2019)HHMF: hidden hierarchical matrix factorization for recommender systemsData Mining and Knowledge Discovery10.1007/s10618-019-00632-4Online publication date: 27-May-2019
          • Show More Cited By

          View Options

          Get Access

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media