Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1081870.1081959acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

Regression error characteristic surfaces

Published: 21 August 2005 Publication History

Abstract

This paper presents a generalization of Regression Error Characteristic (REC) curves. REC curves describe the cumulative distribution function of the prediction error of models and can be seen as a generalization of ROC curves to regression problems. REC curves provide useful information for analyzing the performance of models, particularly when compared to error statistics like for instance the Mean Squared Error. In this paper we present Regression Error Characteristic (REC) surfaces that introduce a further degree of detail by plotting the cumulative distribution function of the errors across the distribution of the target variable, i.e. the joint cumulative distribution function of the errors and the target variable. This provides a more detailed analysis of the performance of models when compared to REC curves. This extra detail is particularly relevant in applications with non-uniform error costs, where it is important to study the performance of models for specific ranges of the target variable. In this paper we present the notion of REC surfaces, describe how to use them to compare the performance of models, and illustrate their use with an important practical class of applications: the prediction of rare extreme values.

References

[1]
J. Bi and K. P. Bennett. Regression error characteristic curves. In Proceedings of the 20th International Conference on Machine Learning, 2003.
[2]
J. P. Egan. Signal Detection Theory and ROC Analysis. Series in Cognition and Perception. Academic Press, 1975.
[3]
T. Fawcett. Roc graphs: Notes and practical considerations for data mining researchers. Technical Report HPL-2003-4, Hewlett Packard, 2003.
[4]
F. Provost, T. Fawcett, and R. Kohavi. The case against accuracy estimation for comparing induction algorithms. In Proc. 15th International Conf. on Machine Learning, pages 445--453. Morgan Kaufmann, San Francisco, CA, 1998.
[5]
R Development Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, 2004. ISBN 3-900051-07-0.
[6]
R. Ribeiro and L. Torgo. Predicting harmful algae blooms. In F. M. Pires and S. Abreu, editors, Proceedings of Portuguese AI Conference (EPIA'03), number 2902 in LNAI, pages 308--312. Springer, 2003.
[7]
L. Torgo and R. Ribeiro. Predicting outliers. In N. Lavrac, D. Gamberger, L. Todorovski, and H. Blockeel, editors, Proceedings of Principles of Data Mining and Knowledge Discovery (PKDD'03), number 2838 in LNAI, pages 447--458. Springer, 2003.

Cited By

View all
  • (2023)MAEPI and CIR: New Metrics for Robust Evaluation of the Prediction Performance of AI-Based IOL FormulasTranslational Vision Science & Technology10.1167/tvst.12.3.2912:3(29)Online publication date: 28-Mar-2023
  • (2023)ASER: Adapted squared error relevance for rare cases prediction in imbalanced regressionJournal of Chemometrics10.1002/cem.351537:11Online publication date: 8-Sep-2023
  • (2022)Subgroup mining for performance analysis of regression modelsExpert Systems10.1111/exsy.1311840:1Online publication date: 9-Aug-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '05: Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
August 2005
844 pages
ISBN:159593135X
DOI:10.1145/1081870
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 August 2005

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. evaluation metrics
  2. model comparisons
  3. regression problems

Qualifiers

  • Article

Conference

KDD05

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '24

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)1
Reflects downloads up to 18 Aug 2024

Other Metrics

Citations

Cited By

View all
  • (2023)MAEPI and CIR: New Metrics for Robust Evaluation of the Prediction Performance of AI-Based IOL FormulasTranslational Vision Science & Technology10.1167/tvst.12.3.2912:3(29)Online publication date: 28-Mar-2023
  • (2023)ASER: Adapted squared error relevance for rare cases prediction in imbalanced regressionJournal of Chemometrics10.1002/cem.351537:11Online publication date: 8-Sep-2023
  • (2022)Subgroup mining for performance analysis of regression modelsExpert Systems10.1111/exsy.1311840:1Online publication date: 9-Aug-2022
  • (2021)A novel cost‐sensitive algorithm and new evaluation strategies for regression in imbalanced domainsExpert Systems10.1111/exsy.1268038:4Online publication date: 28-Feb-2021
  • (2020)Visual interpretation of regression errorExpert Systems10.1111/exsy.1262137:6Online publication date: 13-Aug-2020
  • (2020)Imbalanced regression and extreme value predictionMachine Learning10.1007/s10994-020-05900-9Online publication date: 4-Sep-2020
  • (2019)Software Cost EstimationInternational Journal of Service Science, Management, Engineering, and Technology10.4018/IJSSMET.201907010210:3(14-31)Online publication date: Jul-2019
  • (2019)Explaining the Performance of Black Box Regression Models2019 IEEE International Conference on Data Science and Advanced Analytics (DSAA)10.1109/DSAA.2019.00025(110-118)Online publication date: Oct-2019
  • (2019)Visual Interpretation of Regression ErrorProgress in Artificial Intelligence10.1007/978-3-030-30244-3_39(473-485)Online publication date: 30-Aug-2019
  • (2017)Selecting cash management models from a multiobjective perspectiveAnnals of Operations Research10.1007/s10479-017-2634-9261:1-2(275-288)Online publication date: 6-Sep-2017
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media