Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1830483.1830674acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
research-article

In search of targeted-complexity problems

Published: 07 July 2010 Publication History

Abstract

Currently available real-world problems do not cover the whole complexity space and, therefore, do not allow us to thoroughly test learner behavior on the border of its domain of competence. Thus, the necessity of developing a more suitable testing scenario arises. With this in mind, data complexity analysis has shown promise in characterizing difficulty of classification problems through a set of complexity descriptors which used in artificial data sets generation could supply the required framework to refine and design learners. This paper, then, proposes the use of instance selection based on an evolutionary multiobjective technique to generate data sets that meet specific characteristics established by such complexity descriptors. These artificial targeted-complexity problems, which capture the essence of real-world structures, may help to define a set of benchmarks that contributes to test the properties of learners and to improve them.

References

[1]
A. Asuncion and D. J. Newman. UCI Machine Learning Repository. {http://www.ics.uci.edu/~mlearn/MLRepository.html}. University of California, 2007.
[2]
M. Basu and T. K. Ho, editors. Data Complexity in Pattern Recognition. Springer-Verlag, 2006.
[3]
E. Bernadó-Mansilla, T. K. Ho, and A. Orriols-Puig. Data complexity and evolutionary learning: Classifier's behavior and domain of competence, pages 115--134. Springer, 2006.
[4]
C. A. Coello, G. B. Lamont, and D. A. V. Veldhuizen. Evolutionary algorithms for solving multi-objective problems. Springer, New York, 2nd edition, 2007.
[5]
K. Deb, A. Pratap, S. Agarwal, and T. Meyarivan. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE TEC, 6(2):182--197, 2002.
[6]
D. E. Goldberg. The design of innovation: Lessons from and for competent genetic algorithms. Kluwer Academic Publishers, 1 edition, 2002.
[7]
T. K. Ho. Data complexity analysis for classifier combination. In Proceedings of the 2nd Int. Workshop on MCS, pages 53--67. Springer-Verlag, 2001.
[8]
T. K. Ho and M. Basu. Measuring the complexity of classification problems. In Proceedings of the 15th Int. Conference on PR, pages 43--47, 2000.
[9]
T. K. Ho and M. Basu. Complexity measures of supervised classification problems. IEEE Transactions on PAMI, 24(3):289--300, 2002.
[10]
E. J. Hughes. Evolutionary many-objective optimisation: many once or one many? In Congress on Evolutionary Computation, pages 222--227, 2005.
[11]
H. Ishibuchi, N. Tsukamoto, and Y. Nojima. Evolutionary many-objective optimization: A short review. In IEEE CEC, pages 2419--2426, 2008.
[12]
N. Macià, A. Orriols-Puig, and E. Bernadó-Mansilla. EMO shines a light on the holes of complexity space. In GECCO, pages 1907--1908, 2009.
[13]
I. H. Witten and E. Frank. Data mining: Practical machine learning tools and techniques. Morgan Kaufmann, San Francisco, 2nd edition, 2005.

Cited By

View all
  • (2022)A Many-Objective Optimization Approach to Generate Synthetic Datasets based on Real-World Classification Problems2022 IEEE Latin American Conference on Computational Intelligence (LA-CCI)10.1109/LA-CCI54402.2022.9981848(1-6)Online publication date: 23-Nov-2022
  • (2022)Dynamic selection of classifiers based on complexity measures2022 IEEE 34th International Conference on Tools with Artificial Intelligence (ICTAI)10.1109/ICTAI56018.2022.00021(82-89)Online publication date: Oct-2022
  • (2020)A Many-Objective optimization Approach for Complexity-based Data set Generation2020 IEEE Congress on Evolutionary Computation (CEC)10.1109/CEC48606.2020.9185543(1-8)Online publication date: Jul-2020
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
GECCO '10: Proceedings of the 12th annual conference on Genetic and evolutionary computation
July 2010
1520 pages
ISBN:9781450300728
DOI:10.1145/1830483
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 July 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. artificial data sets
  2. data complexity
  3. evolutionary multiobjective optimization

Qualifiers

  • Research-article

Conference

GECCO '10
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)1
Reflects downloads up to 17 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2022)A Many-Objective Optimization Approach to Generate Synthetic Datasets based on Real-World Classification Problems2022 IEEE Latin American Conference on Computational Intelligence (LA-CCI)10.1109/LA-CCI54402.2022.9981848(1-6)Online publication date: 23-Nov-2022
  • (2022)Dynamic selection of classifiers based on complexity measures2022 IEEE 34th International Conference on Tools with Artificial Intelligence (ICTAI)10.1109/ICTAI56018.2022.00021(82-89)Online publication date: Oct-2022
  • (2020)A Many-Objective optimization Approach for Complexity-based Data set Generation2020 IEEE Congress on Evolutionary Computation (CEC)10.1109/CEC48606.2020.9185543(1-8)Online publication date: Jul-2020
  • (2019)How Complex Is Your Classification Problem?ACM Computing Surveys10.1145/334771152:5(1-34)Online publication date: 13-Sep-2019
  • (2019)Evolving controllably difficult datasets for clusteringProceedings of the Genetic and Evolutionary Computation Conference10.1145/3321707.3321761(463-471)Online publication date: 13-Jul-2019
  • (2018)Instance spaces for machine learning classificationMachine Language10.1007/s10994-017-5629-5107:1(109-147)Online publication date: 1-Jan-2018
  • (2013)Learner excellence biased by data set selectionPattern Recognition10.1016/j.patcog.2012.09.02246:3(1054-1066)Online publication date: 1-Mar-2013
  • (2011)Toward quantitative definition of explanation ability of fuzzy rule-based classifiers2011 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE 2011)10.1109/FUZZY.2011.6007738(549-556)Online publication date: Jun-2011
  • (2010)The landscape contest at ICPR 2010Proceedings of the 20th International conference on Recognizing patterns in signals, speech, images, and videos10.5555/1939170.1939177(29-45)Online publication date: 23-Aug-2010
  • (2010)The Landscape Contest at ICPR 2010Recognizing Patterns in Signals, Speech, Images and Videos10.1007/978-3-642-17711-8_4(29-45)Online publication date: 2010

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media