Stochastic EM for Shuffled Linear Regression

Abid, Abubakar; Zou, James

Statistics > Machine Learning

arXiv:1804.00681 (stat)

[Submitted on 2 Apr 2018]

Title:Stochastic EM for Shuffled Linear Regression

Authors:Abubakar Abid, James Zou

View PDF

Abstract:We consider the problem of inference in a linear regression model in which the relative ordering of the input features and output labels is not known. Such datasets naturally arise from experiments in which the samples are shuffled or permuted during the protocol. In this work, we propose a framework that treats the unknown permutation as a latent variable. We maximize the likelihood of observations using a stochastic expectation-maximization (EM) approach. We compare this to the dominant approach in the literature, which corresponds to hard EM in our framework. We show on synthetic data that the stochastic EM algorithm we develop has several advantages, including lower parameter error, less sensitivity to the choice of initialization, and significantly better performance on datasets that are only partially shuffled. We conclude by performing two experiments on real datasets that have been partially shuffled, in which we show that the stochastic EM algorithm can recover the weights with modest error.

Comments:	11 pages, 5 figures
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1804.00681 [stat.ML]
	(or arXiv:1804.00681v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1804.00681

Submission history

From: Abubakar Abid [view email]
[v1] Mon, 2 Apr 2018 18:13:49 UTC (1,248 KB)

Statistics > Machine Learning

Title:Stochastic EM for Shuffled Linear Regression

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Stochastic EM for Shuffled Linear Regression

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators