Doubly Non-Central Beta Matrix Factorization for Stable Dimensionality Reduction of Bounded Support Matrix Data

Albert, Anjali N.; Flaherty, Patrick; Schein, Aaron

Computer Science > Machine Learning

arXiv:2410.18425 (cs)

[Submitted on 24 Oct 2024]

Title:Doubly Non-Central Beta Matrix Factorization for Stable Dimensionality Reduction of Bounded Support Matrix Data

Authors:Anjali N. Albert, Patrick Flaherty, Aaron Schein

View PDF HTML (experimental)

Abstract:We consider the problem of developing interpretable and computationally efficient matrix decomposition methods for matrices whose entries have bounded support. Such matrices are found in large-scale DNA methylation studies and many other settings. Our approach decomposes the data matrix into a Tucker representation wherein the number of columns in the constituent factor matrices is not constrained. We derive a computationally efficient sampling algorithm to solve for the Tucker decomposition. We evaluate the performance of our method using three criteria: predictability, computability, and stability. Empirical results show that our method has similar performance as other state-of-the-art approaches in terms of held-out prediction and computational complexity, but has significantly better performance in terms of stability to changes in hyper-parameters. The improved stability results in higher confidence in the results in applications where the constituent factors are used to generate and test scientific hypotheses such as DNA methylation analysis of cancer samples.

Comments:	33 pages, 18 figures
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2410.18425 [cs.LG]
	(or arXiv:2410.18425v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2410.18425

Submission history

From: Patrick Flaherty [view email]
[v1] Thu, 24 Oct 2024 04:24:47 UTC (2,543 KB)

Computer Science > Machine Learning

Title:Doubly Non-Central Beta Matrix Factorization for Stable Dimensionality Reduction of Bounded Support Matrix Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Doubly Non-Central Beta Matrix Factorization for Stable Dimensionality Reduction of Bounded Support Matrix Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators