Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2736277.2741091acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

ACCAMS: Additive Co-Clustering to Approximate Matrices Succinctly

Published: 18 May 2015 Publication History

Abstract

Matrix completion and approximation are popular tools to capture a user's preferences for recommendation and to approximate missing data. Instead of using low-rank factorization we take a drastically different approach, based on the simple insight that an additive model of co-clusterings allows one to approximate matrices efficiently. This allows us to build a concise model that, per bit of model learned, significantly beats all factorization approaches in matrix completion. Even more surprisingly, we find that summing over small co-clusterings is more effective in modeling matrices than classic co-clustering, which uses just one large partitioning of the matrix. Following Occam's razor principle, the fact that our model is more concise and yet just as accurate as more complex models suggests that it better captures the latent preferences and decision making processes present in the real world. We provide an iterative minimization algorithm, a collapsed Gibbs sampler, theoretical guarantees for matrix approximation, and excellent empirical evidence for the efficacy of our approach. We achieve state-of-the-art results for matrix completion on Netflix at a fraction of the model complexity.

References

[1]
E. M. Airoldi, D. M. Blei, S. E. Fienberg, and E. P. Xing. Mixed-membership stochastic blockmodels. Journal of Machine Learning Research, 9:1981--2014, 2008.
[2]
A. Banerjee, I. Dhillon, J. Ghosh, S. Merugu, and D. S. Modha. A generalized maximum entropy approach to bregman co-clustering and matrix approximation. In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 509--514. ACM, 2004.
[3]
A. Beutel, K. Murray, C. Faloutsos, and A. J. Smola. CoBaFi: Collaborative Bayesian Filtering. In World Wide Web Conference, pages 97--108, 2014.
[4]
A. Gittens and M. W. Mahoney. Revisiting the nystrom method for improved large-scale machine learning. In ICML (3), pages 567--575, 2013.
[5]
D. Gorur, F. Jakel, and C. E. Rasmussen. A choice model with infinitely many latent features. In Proceedings of the 23rd international conference on Machine learning, pages 361--368. ACM, 2006.
[6]
T. Griffiths and Z. Ghahramani. The indian buffet process: An introduction and review. Journal of Machine Learning Research, 12:1185--1224, 2011.
[7]
N. Halko, P. Martinsson, and J. A. Tropp. Finding structure with randomness: Stochastic algorithms for constructing approximate matrix decompositions, 2009. oai:arXiv.org:0909.4061.
[8]
J. A. Hartigan. Direct clustering of a data matrix. Journal of the american statistical association, 67(337):123--129, 1972.
[9]
Y. Koren. Factorization meets the neighborhood: a multifaceted collaborative filtering model. In Knowledge discovery and data mining KDD, pages 426--434, 2008.
[10]
Y. Koren, R. Bell, and C. Volinsky. Matrix factorization techniques for recommender systems. IEEE Computer, 42(8):30--37, 2009.
[11]
D. Koutra, U. Kang, J. Vreeken, and C. Faloutsos. VOG: Summarizing and Understanding Large Graphs, chapter 11, pages 91--99. SIAM, 2014.
[12]
A. Kyrola, G. Blelloch, and C. Guestrin. GraphChi: Large-scale graph computation on just a pc. In OSDI, Hollywood, CA, 2012.
[13]
J. Lee, S. Kim, G. Lebanon, and Y. Singer. Local low-rank matrix approximation. In Proceedings of The 30th International Conference on Machine Learning, pages 82--90, 2013.
[14]
M. Leeuwen, J. Vreeken, and A. Siebes. Identifying the components. Data Mining and Knowledge Discovery, 19(2):176--193, 2009.
[15]
M. Li, G. L. Miller, and R. Peng. Iterative row sampling. In FOCS 2013, pages 127--136, 2013.
[16]
L. W. Mackey, M. I. Jordan, and A. Talwalkar. Divide-and-conquer matrix factorization. In Advances in Neural Information Processing Systems, pages 1134--1142, 2011.
[17]
K. Palla, D. Knowles, and Z. Ghahramani. An infinite latent attribute model for network data. In International Conference on Machine Learning, 2012.
[18]
E. E. Papalexakis, N. D. Sidiropoulos, and R. Bro. From k-means to higher-way co-clustering: multilinear decomposition with sparse latent factors. Signal Processing, IEEE Transactions on, 61(2):493--506, 2013.
[19]
I. Porteous, E. Bart, and M. Welling. Multi-HDP: A non parametric bayesian model for tensor factorization. In D. Fox and C. Gomes, editors, Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, pages 1487--1490. AAAI Press, 2008.
[20]
J. Rissanen. Modeling by shortest data description. Automatica, 14(5):465--471, 1978.
[21]
H. Shan and A. Banerjee. Bayesian co-clustering. In Data Mining, 2008. ICDM'08. Eighth IEEE International Conference on, pages 530--539. IEEE, 2008.
[22]
M. Turk and A. Pentland. Face recognition using eigenfaces. In Proceedings CVPR, pages 586--591, Hawaii, June 1991.
[23]
J. Vreeken, M. Leeuwen, and A. Siebes. Krimp: mining itemsets that compress. Data Mining and Knowledge Discovery, 23(1):169--214, 2011.
[24]
P. Wang, K. B. Laskey, C. Domeniconi, and M. I. Jordan. Nonparametric bayesian co-clustering ensembles. In SDM, pages 331--342. SIAM, 2011.

Cited By

View all
  • (2023)A Generative Adversarial Approach with Social Relationship for Recommender SystemsProceedings of the 2023 International Conference on Power, Communication, Computing and Networking Technologies10.1145/3630138.3630424(1-5)Online publication date: 24-Sep-2023
  • (2023)NeuKron: Constant-Size Lossy Compression of Sparse Reorderable Matrices and TensorsProceedings of the ACM Web Conference 202310.1145/3543507.3583226(71-81)Online publication date: 30-Apr-2023
  • (2022)Destructure-and-restructure matrix approximationInformation Sciences: an International Journal10.1016/j.ins.2019.11.025514:C(434-448)Online publication date: 21-Apr-2022
  • Show More Cited By

Index Terms

  1. ACCAMS: Additive Co-Clustering to Approximate Matrices Succinctly

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        WWW '15: Proceedings of the 24th International Conference on World Wide Web
        May 2015
        1460 pages
        ISBN:9781450334693

        Sponsors

        • IW3C2: International World Wide Web Conference Committee

        In-Cooperation

        Publisher

        International World Wide Web Conferences Steering Committee

        Republic and Canton of Geneva, Switzerland

        Publication History

        Published: 18 May 2015

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. clustering
        2. collaborative filtering
        3. recommender systems

        Qualifiers

        • Research-article

        Funding Sources

        • National Science Foundation
        • Google Inc.
        • Facebook Inc.

        Conference

        WWW '15
        Sponsor:
        • IW3C2

        Acceptance Rates

        WWW '15 Paper Acceptance Rate 131 of 929 submissions, 14%;
        Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)9
        • Downloads (Last 6 weeks)1
        Reflects downloads up to 30 Aug 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2023)A Generative Adversarial Approach with Social Relationship for Recommender SystemsProceedings of the 2023 International Conference on Power, Communication, Computing and Networking Technologies10.1145/3630138.3630424(1-5)Online publication date: 24-Sep-2023
        • (2023)NeuKron: Constant-Size Lossy Compression of Sparse Reorderable Matrices and TensorsProceedings of the ACM Web Conference 202310.1145/3543507.3583226(71-81)Online publication date: 30-Apr-2023
        • (2022)Destructure-and-restructure matrix approximationInformation Sciences: an International Journal10.1016/j.ins.2019.11.025514:C(434-448)Online publication date: 21-Apr-2022
        • (2021)Bayesian Additive Matrix Approximation for Social RecommendationACM Transactions on Knowledge Discovery from Data10.1145/345139116:1(1-34)Online publication date: 20-Jul-2021
        • (2021)Mixture Matrix Approximation for Collaborative FilteringIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2019.295510033:6(2640-2653)Online publication date: 1-Jun-2021
        • (2021)A novel Collaborative Filtering recommendation approach based on Soft Co-ClusteringPhysica A: Statistical Mechanics and its Applications10.1016/j.physa.2020.125140561(125140)Online publication date: Jan-2021
        • (2020)AutoConjunction: Adaptive Model-based Feature Conjunction for CTR Prediction2020 21st IEEE International Conference on Mobile Data Management (MDM)10.1109/MDM48529.2020.00043(202-209)Online publication date: Jun-2020
        • (2020)Financial product recommendation system based on transformer2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC)10.1109/ITNEC48623.2020.9084812(2547-2551)Online publication date: Jun-2020
        • (2020)Research Progress of Trust Evaluation2020 13th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)10.1109/CISP-BMEI51763.2020.9263498(1081-1086)Online publication date: 17-Oct-2020
        • (2020)Local matrix approximation via automatic anchor selection and asymmetric neighbor inclusion based on feature divergence measureNeurocomputing10.1016/j.neucom.2019.12.025383:C(368-379)Online publication date: 28-Mar-2020
        • Show More Cited By

        View Options

        Get Access

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media