Optimizing F-measure: A Tale of Two Approaches

Nan, Ye; Chai, Kian Ming; Lee, Wee Sun; Chieu, Hai Leong

Computer Science > Machine Learning

arXiv:1206.4625 (cs)

[Submitted on 18 Jun 2012]

Title:Optimizing F-measure: A Tale of Two Approaches

Authors:Ye Nan (NUS), Kian Ming Chai (DSO National Laboratories), Wee Sun Lee (NUS), Hai Leong Chieu (DSO National Laboratories)

View PDF

Abstract:F-measures are popular performance metrics, particularly for tasks with imbalanced data sets. Algorithms for learning to maximize F-measures follow two approaches: the empirical utility maximization (EUM) approach learns a classifier having optimal performance on training data, while the decision-theoretic approach learns a probabilistic model and then predicts labels with maximum expected F-measure. In this paper, we investigate the theoretical justifications and connections for these two approaches, and we study the conditions under which one approach is preferable to the other using synthetic and real datasets. Given accurate models, our results suggest that the two approaches are asymptotically equivalent given large training and test sets. Nevertheless, empirically, the EUM approach appears to be more robust against model misspecification, and given a good model, the decision-theoretic approach appears to be better for handling rare classes and a common domain adaptation scenario.

Comments:	ICML2012
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1206.4625 [cs.LG]
	(or arXiv:1206.4625v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1206.4625

Submission history

From: Ye Nan [view email] [via ICML2012 proxy]
[v1] Mon, 18 Jun 2012 15:07:04 UTC (378 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2012-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Ye Nan
Nan Ye
Kian Ming Adam Chai
Wee Sun Lee
Hai Leong Chieu

export BibTeX citation

Computer Science > Machine Learning

Title:Optimizing F-measure: A Tale of Two Approaches

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Optimizing F-measure: A Tale of Two Approaches

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators