Generalized Cross Entropy Loss for Training Deep Neural Networks with Noisy Labels

Zhang, Zhilu; Sabuncu, Mert R.

Computer Science > Machine Learning

arXiv:1805.07836 (cs)

[Submitted on 20 May 2018 (v1), last revised 29 Nov 2018 (this version, v4)]

Title:Generalized Cross Entropy Loss for Training Deep Neural Networks with Noisy Labels

Authors:Zhilu Zhang, Mert R. Sabuncu

View PDF

Abstract:Deep neural networks (DNNs) have achieved tremendous success in a variety of applications across many disciplines. Yet, their superior performance comes with the expensive cost of requiring correctly annotated large-scale datasets. Moreover, due to DNNs' rich capacity, errors in training labels can hamper performance. To combat this problem, mean absolute error (MAE) has recently been proposed as a noise-robust alternative to the commonly-used categorical cross entropy (CCE) loss. However, as we show in this paper, MAE can perform poorly with DNNs and challenging datasets. Here, we present a theoretically grounded set of noise-robust loss functions that can be seen as a generalization of MAE and CCE. Proposed loss functions can be readily applied with any existing DNN architecture and algorithm, while yielding good performance in a wide range of noisy label scenarios. We report results from experiments conducted with CIFAR-10, CIFAR-100 and FASHION-MNIST datasets and synthetically generated noisy labels.

Comments:	32nd Conference on Neural Information Processing Systems (NeurIPS 2018)
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:1805.07836 [cs.LG]
	(or arXiv:1805.07836v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1805.07836

Submission history

From: Zhilu Zhang [view email]
[v1] Sun, 20 May 2018 23:01:49 UTC (239 KB)
[v2] Tue, 19 Jun 2018 01:37:13 UTC (239 KB)
[v3] Fri, 26 Oct 2018 14:44:20 UTC (242 KB)
[v4] Thu, 29 Nov 2018 22:41:40 UTC (242 KB)

Computer Science > Machine Learning

Title:Generalized Cross Entropy Loss for Training Deep Neural Networks with Noisy Labels

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Generalized Cross Entropy Loss for Training Deep Neural Networks with Noisy Labels

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators