short-paper

Accurate Embedding-based Log Determinant Optimization

Authors:

Byungkon KangAuthors Info & Claims

CIKM '24: Proceedings of the 33rd ACM International Conference on Information and Knowledge Management

Pages 3747 - 3751

https://doi.org/10.1145/3627673.3679871

Published: 21 October 2024 Publication History

Abstract

Many tangible and intangible objects are represented as itemsets; i.e., composition of individual items. In this paper, we address the problem of finding the embedding of such items so as to use those embeddings in tasks like missing item prediction. We approach this problem by means of determinantal point process (DPP) in order to reflect the diversity within each set. Doing so requires an optimization of a log determinant of a symmetric positive definite (SPD) matrix. The standard practice to achieve this is to perform a low-rank decomposition of the matrix and derive update rules for the low rank matrix. In this work, we propose to approach this problem by means of item embedding. That is, we will learn the SPD matrix by trying to find the right vector representations for the given data for a fixed kernel function. To this end, we propose a novel algorithm to accurately compute the gradients of the log determinant with respect to the embedding vectors. We also show that our approach outperforms Autodiff-based learning in terms of gradient direction and running time, and that other general log determinant optimization problems can be addressed.

References

[1]

Idan Achituve, Gal Chechik, and Ethan Fetaya. 2023. Guided deep kernel learning. In Proceedings of UAI.

[2]

Akshay Agrawal, Brandon Amos, Shane Barratt, Stephen Boyd, Steven Diamond, and Zico Kolter. 2019. Differentiable convex optimization layers. In Proceedings of NeurIPS.

[3]

Shaojie Bai, J. Zico Kolter, and Vladlen Koltun. 2019. Deep equilibrium models. In Proceedings of NeurIPS.

[4]

H. M. Bücker and G. F. Corliss. 2005. A bibliography on automatic differentiation. In Automatic Differentiation: Applications, Theory, and Implementations. Lecture Notes in Computational Science and Engineering. Vol. 50. H. M. Bücker, G. F. Corliss, P. D. Hovland, U. Naumann, and B. Norris, (Eds.) Springer, New York, NY, 321--322.

[5]

Anoop Cherian, Panagiotis Stanitsas, Jue Wang, Mehrtash Harandi, Vassilios Morellas, and Nikolaos Papanikolopoulos. 2022. Learning log-determinant divergences for positive definite matrices. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44, 9.

[6]

Jacob R Gardner, Geoff Pleiss, David Bindel, Kilian Q Weinberger, and Andrew Gordon Wilson. 2018. GPyTorch: blackbox matrix-matrix gaussian process inference with GPU acceleration. In Proceedings of NeurIPS.

[7]

Mike Gartell, Ulrich Paquet, and Noam Koenigstein. 2017. Low-rank factorization of determinantal point processes. In Proceedings of AAAI.

[8]

Mike Gartrell, Ulrich Paquet, and Noam Koenigstein. 2016. Low-rank factorization of determinantal point processes for recommendation. (2016). arXiv: 1602.05436 [stat.ML].

[9]

J. Gillenwater, A. Kulesza, and B. Taskar. 2012. Near-optimal MAP inference for determinantal point processes. In Proceedings of NIPS.

[10]

Michael Gutmann and Aapo Hyvärinen. 2010. Noise-contrastive estimation: a new estimation principle for unnormalized statistical models. In Proceedings of AISTATS.

[11]

Xintong Han, Zuxuan Wu, Yu-Gang Jiang, and Larry S Davis. 2017. Learning fashion compatibility with bidirectional LSTMs. In Proceedings of ACM MM.

Digital Library

[12]

Zhiwu Huang, Ruiping Wang, Shiguang Shan, Xianqiu Li, and Xilin Chen. 2015. Log-Euclidean metric learning on symmetric positive definite manifold with application to image set classification. In Proceedings of ICML.

[13]

Markelle Kelly, Rachel Longjohn, and Kolby Nottingham. 1993. UCI machine learning repository. https://archive.ics.uci.edu. (1993).

[14]

A. Kulesza and B. Taskar. 2011. Learning determinantal point processes. In Proceedings of UAI.

[15]

Po-ling Loh and Martin J. Wainwright. 2012. Structure estimation for discrete graphical models: generalized covariance matrices and their inverses. In Proceedings of NIPS.

[16]

Zelda Mariet, Mike Gartrell, and Suvrit Sra. 2019. Learning determinantal point processes by corrective negative sampling. In Proceedings of AISTATS.

[17]

Zelda Mariet, Yaniv Ovadia, and Jasper Snoek. 2019. DppNet: Approximating Determinantal Point Processes with Deep Networks. (2019). eprint: arXiv:1901.02051.

[18]

Takayuki Osogami and Rudy Raymond. 2019. Determinantal reinforcement learning. In Proceedings of AAAI.

Digital Library

[19]

Adam Paszke et al. 2019. Pytorch: an imperative style, high-performance deep learning library. In Proceedings of NeurIPS.

[20]

Hao Peng, Sam Thomson, and Noah A. Smith. 2018. Backpropagating through structured argmax using a SPIGOT. In Proceedings of ACL.

[21]

K. B. Petersen and M. S. Pedersen. 2012. The matrix cookbook. Version 20121115. (2012). http://www2.compute.dtu.dk/pubdb/pubs/3274-full.html.

[22]

Carl Edward Rasmussen and Christopher K. I. Williams. 2005. Gaussian Processes for Machine Learning. MIT Press. isbn: 9780262256834.

Digital Library

[23]

Mohit Sharma, F. Maxwell Harper, and George Karypis. 2019. Learning from sets of items in recommender systems. ACM Transactions of Interactive Intelligent Systems, 1, 1.

Digital Library

[24]

Andrew Gordon Wilson, Zhiting Hu, Ruslan Salakhutdinov, and Eric P. Xing. 2016. Deep kernel learning. In Proceedings of AISTATS.

[25]

Andrew Gordon Wilson and Hannes Nickisch. 2015. Kernel interpolation for scalable structured Gaussian processes (KISS-GP). In Proceedings of ICML.

[26]

Yaodong Yang, Ying Wen, Liheng Chen, Jun Wang, Kun Shao, David Mguni, and Weinan Zhang. 2020. Multi-agent determinantal Q-learning. In Proceedings of ICML.

[27]

Qi Zhao, Yongfeng Zhang, Yi Zhang, and Daniel Friedman. 2017. Multi-product utility maximization for economic recommendation. In Proceedings of WSDM.

Digital Library

Index Terms

Accurate Embedding-based Log Determinant Optimization
1. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Kernel methods
      2. Learning latent representations
2. Mathematics of computing
  1. Mathematical analysis
    1. Mathematical optimization
      1. Continuous optimization

Recommendations

Matrix Classes That Generate All Matrices with Positive Determinant

New factorization results dealing mainly with P-matrices and M-matrices are presented. It is proved that any matrix in $M_n (\mathbb{R})$ with positive determinant can be written as the product of three P-matrices (compared with the classical result ...
Accurate Eigenvalues and SVDs of Totally Nonnegative Matrices

We consider the class of totally nonnegative (TN) matrices---matrices all of whose minors are nonnegative. Any nonsingular TN matrix factors as a product of nonnegative bidiagonal matrices. The entries of the bidiagonal factors parameterize the set of ...
Parallel Sparse Approximate Inverse Preconditioning on Graphic Processing Units

Accelerating numerical algorithms for solving sparse linear systems on parallel architectures has attracted the attention of many researchers due to their applicability to many engineering and scientific problems. The solution of sparse systems often ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

CIKM '24: Proceedings of the 33rd ACM International Conference on Information and Knowledge Management

October 2024

5705 pages

ISBN:9798400704369

DOI:10.1145/3627673

General Chairs:
Edoardo Serra
Boise State University, USA
,
Francesca Spezzano
Boise State University, USA

Copyright © 2024 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 October 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Funding Sources

Institute of Information & Communications Technology Planning & Evaluation

Conference

CIKM '24

Sponsor:

SIGIR

CIKM '24: The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

ID, Boise, USA

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Sponsor:
sigir
sigir

The 34th ACM International Conference on Information and Knowledge Management

November 10 - 14, 2025

Seoul , Republic of Korea

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
35
Total Downloads

Downloads (Last 12 months)35
Downloads (Last 6 weeks)1

Reflects downloads up to 28 Jan 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten