Article

Regression error characteristic surfaces

Author:

Luís TorgoAuthors Info & Claims

KDD '05: Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining

Pages 697 - 702

https://doi.org/10.1145/1081870.1081959

Published: 21 August 2005 Publication History

Get Access

Abstract

This paper presents a generalization of Regression Error Characteristic (REC) curves. REC curves describe the cumulative distribution function of the prediction error of models and can be seen as a generalization of ROC curves to regression problems. REC curves provide useful information for analyzing the performance of models, particularly when compared to error statistics like for instance the Mean Squared Error. In this paper we present Regression Error Characteristic (REC) surfaces that introduce a further degree of detail by plotting the cumulative distribution function of the errors across the distribution of the target variable, i.e. the joint cumulative distribution function of the errors and the target variable. This provides a more detailed analysis of the performance of models when compared to REC curves. This extra detail is particularly relevant in applications with non-uniform error costs, where it is important to study the performance of models for specific ranges of the target variable. In this paper we present the notion of REC surfaces, describe how to use them to compare the performance of models, and illustrate their use with an important practical class of applications: the prediction of rare extreme values.

References

[1]

J. Bi and K. P. Bennett. Regression error characteristic curves. In Proceedings of the 20th International Conference on Machine Learning, 2003.

Google Scholar

[2]

J. P. Egan. Signal Detection Theory and ROC Analysis. Series in Cognition and Perception. Academic Press, 1975.

Google Scholar

[3]

T. Fawcett. Roc graphs: Notes and practical considerations for data mining researchers. Technical Report HPL-2003-4, Hewlett Packard, 2003.

Google Scholar

[4]

F. Provost, T. Fawcett, and R. Kohavi. The case against accuracy estimation for comparing induction algorithms. In Proc. 15th International Conf. on Machine Learning, pages 445--453. Morgan Kaufmann, San Francisco, CA, 1998.

Digital Library

Google Scholar

[5]

R Development Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, 2004. ISBN 3-900051-07-0.

Google Scholar

[6]

R. Ribeiro and L. Torgo. Predicting harmful algae blooms. In F. M. Pires and S. Abreu, editors, Proceedings of Portuguese AI Conference (EPIA'03), number 2902 in LNAI, pages 308--312. Springer, 2003.

Google Scholar

[7]

L. Torgo and R. Ribeiro. Predicting outliers. In N. Lavrac, D. Gamberger, L. Todorovski, and H. Blockeel, editors, Proceedings of Principles of Data Mining and Knowledge Discovery (PKDD'03), number 2838 in LNAI, pages 447--458. Springer, 2003.

Google Scholar

Cited By

View all

Li TStein JNallasamy N(2023)MAEPI and CIR: New Metrics for Robust Evaluation of the Prediction Performance of AI-Based IOL FormulasTranslational Vision Science & Technology10.1167/tvst.12.3.2912:3(29)Online publication date: 28-Mar-2023
https://doi.org/10.1167/tvst.12.3.29
Kou YFu G(2023)ASER: Adapted squared error relevance for rare cases prediction in imbalanced regressionJournal of Chemometrics10.1002/cem.351537:11Online publication date: 8-Sep-2023
https://doi.org/10.1002/cem.3515
Pimentel JAzevedo PTorgo L(2022)Subgroup mining for performance analysis of regression modelsExpert Systems10.1111/exsy.1311840:1Online publication date: 9-Aug-2022
https://doi.org/10.1111/exsy.13118
Show More Cited By

Index Terms

Regression error characteristic surfaces

Recommendations

Regression error characteristic curves
ICML'03: Proceedings of the Twentieth International Conference on International Conference on Machine Learning

Receiver Operating Characteristic (ROC) curves provide a powerful tool for visualizing and comparing classification results. Regression Error Characteristic (REC) curves generalize ROC curves to regression. REC curves plot the error tolerance on the x-...
Error measures for fuzzy linear regression

HighlightsThe study covers different error measures that have not previously calculated for Monte Carlo study in fuzzy linear regression models.We obtain the most useful and the worst error measures to estimate fuzzy regression parameters without using ...
Parametric Bernstein/Bezier Curves and Tensor Product Surfaces

Comments

Information & Contributors

Information

Published In

KDD '05: Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining

August 2005

844 pages

ISBN:159593135X

DOI:10.1145/1081870

General Chair:
Robert Grossman
University of Illinois at Chicago & Open Data Partners, USA
,
Program Chairs:
Roberto Bayardo
IBM Almaden Research, USA
,
Kristin Bennett
RPI, USA

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 August 2005

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Conference

KDD05

Sponsor:

KDD05: The Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

August 21 - 24, 2005

Illinois, Chicago, USA

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '24

Sponsor:
sigkdd
sigkdd

The 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 25 - 29, 2024

Barcelona , Spain

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

20
Total Citations
View Citations
398
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)1

Reflects downloads up to 18 Aug 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Li TStein JNallasamy N(2023)MAEPI and CIR: New Metrics for Robust Evaluation of the Prediction Performance of AI-Based IOL FormulasTranslational Vision Science & Technology10.1167/tvst.12.3.2912:3(29)Online publication date: 28-Mar-2023
https://doi.org/10.1167/tvst.12.3.29
Kou YFu G(2023)ASER: Adapted squared error relevance for rare cases prediction in imbalanced regressionJournal of Chemometrics10.1002/cem.351537:11Online publication date: 8-Sep-2023
https://doi.org/10.1002/cem.3515
Pimentel JAzevedo PTorgo L(2022)Subgroup mining for performance analysis of regression modelsExpert Systems10.1111/exsy.1311840:1Online publication date: 9-Aug-2022
https://doi.org/10.1111/exsy.13118
Sadouk LGadi TEssoufi E(2021)A novel cost‐sensitive algorithm and new evaluation strategies for regression in imbalanced domainsExpert Systems10.1111/exsy.1268038:4Online publication date: 28-Feb-2021
https://doi.org/10.1111/exsy.12680
Areosa ITorgo L(2020)Visual interpretation of regression errorExpert Systems10.1111/exsy.1262137:6Online publication date: 13-Aug-2020
https://doi.org/10.1111/exsy.12621
Ribeiro RMoniz N(2020)Imbalanced regression and extreme value predictionMachine Learning10.1007/s10994-020-05900-9Online publication date: 4-Sep-2020
https://doi.org/10.1007/s10994-020-05900-9
Chatzipetrou P(2019)Software Cost EstimationInternational Journal of Service Science, Management, Engineering, and Technology10.4018/IJSSMET.201907010210:3(14-31)Online publication date: Jul-2019
https://doi.org/10.4018/IJSSMET.2019070102
Areosa ITorgo L(2019)Explaining the Performance of Black Box Regression Models2019 IEEE International Conference on Data Science and Advanced Analytics (DSAA)10.1109/DSAA.2019.00025(110-118)Online publication date: Oct-2019
https://doi.org/10.1109/DSAA.2019.00025
Areosa ITorgo L(2019)Visual Interpretation of Regression ErrorProgress in Artificial Intelligence10.1007/978-3-030-30244-3_39(473-485)Online publication date: 30-Aug-2019
https://doi.org/10.1007/978-3-030-30244-3_39
Salas-Molina FRodriguez-Aguilar JDíaz-García P(2017)Selecting cash management models from a multiobjective perspectiveAnnals of Operations Research10.1007/s10479-017-2634-9261:1-2(275-288)Online publication date: 6-Sep-2017
https://doi.org/10.1007/s10479-017-2634-9
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Regression error characteristic curves

Error measures for fuzzy linear regression

Parametric Bernstein/Bezier Curves and Tensor Product Surfaces