Wasserstein Discriminant Analysis

Flamary, Rémi; Cuturi, Marco; Courty, Nicolas; Rakotomamonjy, Alain

doi:10.1007/s10994-018-5717-1

Statistics > Machine Learning

arXiv:1608.08063 (stat)

[Submitted on 29 Aug 2016 (v1), last revised 23 May 2018 (this version, v2)]

Title:Wasserstein Discriminant Analysis

Authors:Rémi Flamary, Marco Cuturi, Nicolas Courty, Alain Rakotomamonjy

View PDF

Abstract:Wasserstein Discriminant Analysis (WDA) is a new supervised method that can improve classification of high-dimensional data by computing a suitable linear map onto a lower dimensional subspace. Following the blueprint of classical Linear Discriminant Analysis (LDA), WDA selects the projection matrix that maximizes the ratio of two quantities: the dispersion of projected points coming from different classes, divided by the dispersion of projected points coming from the same class. To quantify dispersion, WDA uses regularized Wasserstein distances, rather than cross-variance measures which have been usually considered, notably in LDA. Thanks to the the underlying principles of optimal transport, WDA is able to capture both global (at distribution scale) and local (at samples scale) interactions between classes. Regularized Wasserstein distances can be computed using the Sinkhorn matrix scaling algorithm; We show that the optimization of WDA can be tackled using automatic differentiation of Sinkhorn iterations. Numerical experiments show promising results both in terms of prediction and visualization on toy examples and real life datasets such as MNIST and on deep features obtained from a subset of the Caltech dataset.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1608.08063 [stat.ML]
	(or arXiv:1608.08063v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1608.08063
Related DOI:	https://doi.org/10.1007/s10994-018-5717-1

Submission history

From: Remi Flamary [view email]
[v1] Mon, 29 Aug 2016 14:18:40 UTC (608 KB)
[v2] Wed, 23 May 2018 08:42:15 UTC (3,120 KB)

Statistics > Machine Learning

Title:Wasserstein Discriminant Analysis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Wasserstein Discriminant Analysis

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators