Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3627673.3679871acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper

Accurate Embedding-based Log Determinant Optimization

Published: 21 October 2024 Publication History

Abstract

Many tangible and intangible objects are represented as itemsets; i.e., composition of individual items. In this paper, we address the problem of finding the embedding of such items so as to use those embeddings in tasks like missing item prediction. We approach this problem by means of determinantal point process (DPP) in order to reflect the diversity within each set. Doing so requires an optimization of a log determinant of a symmetric positive definite (SPD) matrix. The standard practice to achieve this is to perform a low-rank decomposition of the matrix and derive update rules for the low rank matrix. In this work, we propose to approach this problem by means of item embedding. That is, we will learn the SPD matrix by trying to find the right vector representations for the given data for a fixed kernel function. To this end, we propose a novel algorithm to accurately compute the gradients of the log determinant with respect to the embedding vectors. We also show that our approach outperforms Autodiff-based learning in terms of gradient direction and running time, and that other general log determinant optimization problems can be addressed.

References

[1]
Idan Achituve, Gal Chechik, and Ethan Fetaya. 2023. Guided deep kernel learning. In Proceedings of UAI.
[2]
Akshay Agrawal, Brandon Amos, Shane Barratt, Stephen Boyd, Steven Diamond, and Zico Kolter. 2019. Differentiable convex optimization layers. In Proceedings of NeurIPS.
[3]
Shaojie Bai, J. Zico Kolter, and Vladlen Koltun. 2019. Deep equilibrium models. In Proceedings of NeurIPS.
[4]
H. M. Bücker and G. F. Corliss. 2005. A bibliography on automatic differentiation. In Automatic Differentiation: Applications, Theory, and Implementations. Lecture Notes in Computational Science and Engineering. Vol. 50. H. M. Bücker, G. F. Corliss, P. D. Hovland, U. Naumann, and B. Norris, (Eds.) Springer, New York, NY, 321--322.
[5]
Anoop Cherian, Panagiotis Stanitsas, Jue Wang, Mehrtash Harandi, Vassilios Morellas, and Nikolaos Papanikolopoulos. 2022. Learning log-determinant divergences for positive definite matrices. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44, 9.
[6]
Jacob R Gardner, Geoff Pleiss, David Bindel, Kilian Q Weinberger, and Andrew Gordon Wilson. 2018. GPyTorch: blackbox matrix-matrix gaussian process inference with GPU acceleration. In Proceedings of NeurIPS.
[7]
Mike Gartell, Ulrich Paquet, and Noam Koenigstein. 2017. Low-rank factorization of determinantal point processes. In Proceedings of AAAI.
[8]
Mike Gartrell, Ulrich Paquet, and Noam Koenigstein. 2016. Low-rank factorization of determinantal point processes for recommendation. (2016). arXiv: 1602.05436 [stat.ML].
[9]
J. Gillenwater, A. Kulesza, and B. Taskar. 2012. Near-optimal MAP inference for determinantal point processes. In Proceedings of NIPS.
[10]
Michael Gutmann and Aapo Hyvärinen. 2010. Noise-contrastive estimation: a new estimation principle for unnormalized statistical models. In Proceedings of AISTATS.
[11]
Xintong Han, Zuxuan Wu, Yu-Gang Jiang, and Larry S Davis. 2017. Learning fashion compatibility with bidirectional LSTMs. In Proceedings of ACM MM.
[12]
Zhiwu Huang, Ruiping Wang, Shiguang Shan, Xianqiu Li, and Xilin Chen. 2015. Log-Euclidean metric learning on symmetric positive definite manifold with application to image set classification. In Proceedings of ICML.
[13]
Markelle Kelly, Rachel Longjohn, and Kolby Nottingham. 1993. UCI machine learning repository. https://archive.ics.uci.edu. (1993).
[14]
A. Kulesza and B. Taskar. 2011. Learning determinantal point processes. In Proceedings of UAI.
[15]
Po-ling Loh and Martin J. Wainwright. 2012. Structure estimation for discrete graphical models: generalized covariance matrices and their inverses. In Proceedings of NIPS.
[16]
Zelda Mariet, Mike Gartrell, and Suvrit Sra. 2019. Learning determinantal point processes by corrective negative sampling. In Proceedings of AISTATS.
[17]
Zelda Mariet, Yaniv Ovadia, and Jasper Snoek. 2019. DppNet: Approximating Determinantal Point Processes with Deep Networks. (2019). eprint: arXiv:1901.02051.
[18]
Takayuki Osogami and Rudy Raymond. 2019. Determinantal reinforcement learning. In Proceedings of AAAI.
[19]
Adam Paszke et al. 2019. Pytorch: an imperative style, high-performance deep learning library. In Proceedings of NeurIPS.
[20]
Hao Peng, Sam Thomson, and Noah A. Smith. 2018. Backpropagating through structured argmax using a SPIGOT. In Proceedings of ACL.
[21]
K. B. Petersen and M. S. Pedersen. 2012. The matrix cookbook. Version 20121115. (2012). http://www2.compute.dtu.dk/pubdb/pubs/3274-full.html.
[22]
Carl Edward Rasmussen and Christopher K. I. Williams. 2005. Gaussian Processes for Machine Learning. MIT Press. isbn: 9780262256834.
[23]
Mohit Sharma, F. Maxwell Harper, and George Karypis. 2019. Learning from sets of items in recommender systems. ACM Transactions of Interactive Intelligent Systems, 1, 1.
[24]
Andrew Gordon Wilson, Zhiting Hu, Ruslan Salakhutdinov, and Eric P. Xing. 2016. Deep kernel learning. In Proceedings of AISTATS.
[25]
Andrew Gordon Wilson and Hannes Nickisch. 2015. Kernel interpolation for scalable structured Gaussian processes (KISS-GP). In Proceedings of ICML.
[26]
Yaodong Yang, Ying Wen, Liheng Chen, Jun Wang, Kun Shao, David Mguni, and Weinan Zhang. 2020. Multi-agent determinantal Q-learning. In Proceedings of ICML.
[27]
Qi Zhao, Yongfeng Zhang, Yi Zhang, and Daniel Friedman. 2017. Multi-product utility maximization for economic recommendation. In Proceedings of WSDM.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '24: Proceedings of the 33rd ACM International Conference on Information and Knowledge Management
October 2024
5705 pages
ISBN:9798400704369
DOI:10.1145/3627673
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 October 2024

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. embedding
  2. kernel
  3. log determinant

Qualifiers

  • Short-paper

Funding Sources

  • Institute of Information & Communications Technology Planning & Evaluation

Conference

CIKM '24
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 35
    Total Downloads
  • Downloads (Last 12 months)35
  • Downloads (Last 6 weeks)1
Reflects downloads up to 28 Jan 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media