Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1007/978-3-030-91608-4_14guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

A Complexity Measure for Binary Classification Problems Based on Lost Points

Published: 25 November 2021 Publication History

Abstract

Complexity measures are focused on exploring and capturing the complexity of a data set. In this paper, the Lost points (LP) complexity measure is proposed. It is obtained by applying k-means in a recursive and hierarchical way and it provides both the data set and the instance perspective. On the instance level, the LP measure gives a probability value for each point informing about the dominance of its class in its neighborhood. On the data set level, it estimates the proportion of lost points, referring to those points that are expected to be misclassified since they lie in areas where its class is not dominant. The proposed measure shows easily interpretable results competitive with measures from state-of-art. In addition, it provides probabilistic information useful to highlight the boundary decision on classification problems.

References

[1]
Algar, M.J., et al.: A quality of experience management framework for mobile users. Wirel. Commun. Mob. Comput. 2019, 11 (2019). Article ID 2352941
[2]
Arruda JLM, Prudêncio RBC, and Lorena AC Cerri R and Prati RC Measuring instance hardness using data complexity measures Intelligent Systems 2020 Cham Springer 483-497
[3]
Dua, D., Graff, C.: UCI machine learning repository (2017). http://archive.ics.uci.edu/ml
[4]
Garcia, L., Lorena, A.: ECoL: Complexity Measures for Supervised Problems (2019). https://CRAN.R-project.org/package=ECoL, r package version 0.3.0
[5]
Ho TK and Basu M Complexity measures of supervised classification problems IEEE Trans. Pattern Anal. Mach. Intell. 2002 24 3 289-300
[6]
Lorena AC, Garcia LP, Lehmann J, Souto MC, and Ho TK How complex is your classification problem? A survey on measuring classification complexity ACM Comput. Surveys (CSUR) 2019 52 5 1-34
[7]
Oh S A new dataset evaluation method based on category overlap Comput. Biol. Med. 2011 41 2 115-122
[8]
Singh S Prism-a novel framework for pattern recognition Patt. Anal. Appl. 2003 6 2 134-149
[9]
Smith MR, Martinez T, and Giraud-Carrier C An instance level analysis of data complexity Mach. Learn. 2013 95 2 225-256
[10]
Wan S, Zhao Y, Wang T, Gu Z, Abbasi QH, and Choo KKR Multi-dimensional data indexing and range query processing via Voronoi diagram for internet of things Futur. Gener. Comput. Syst. 2019 91 382-391
[11]
Weitzman, M.S.: Measures of overlap of income distributions of white and Negro families in the United States, vol. 22. US Bureau of the Census (1970)

Cited By

View all
  • (2024)Assessing the Effect of Data Complexity and Instance Overlap Issues on Imbalanced LearningProceedings of the 2024 7th International Conference on Big Data and Education10.1145/3704289.3704292(49-56)Online publication date: 24-Sep-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
Intelligent Data Engineering and Automated Learning – IDEAL 2021: 22nd International Conference, IDEAL 2021, Manchester, UK, November 25–27, 2021, Proceedings
Nov 2021
662 pages
ISBN:978-3-030-91607-7
DOI:10.1007/978-3-030-91608-4

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 25 November 2021

Author Tags

  1. Complexity measures
  2. Neighborhood measures
  3. Binary classification
  4. Supervised machine learning

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Assessing the Effect of Data Complexity and Instance Overlap Issues on Imbalanced LearningProceedings of the 2024 7th International Conference on Big Data and Education10.1145/3704289.3704292(49-56)Online publication date: 24-Sep-2024

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media