Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3638530.3654208acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
poster

Runtime phylogenetic analysis enables extreme subsampling for test-based problems

Published: 01 August 2024 Publication History

Abstract

A phylogeny describes a population's evolutionary history. Evolutionary search algorithms can perfectly track the ancestry of candidate solutions, illuminating a population's trajectory through the search space. We introduce phylogeny-informed subsampling, a new class of subsampling methods that exploit runtime phylogenetic analyses for solving test-based problems. Specifically, we assess two phylogeny-informed subsampling methods---individualized random subsampling and ancestor-based subsampling---on ten genetic programming (GP) problems from program synthesis benchmark suites. Overall, we find that phylogeny-informed subsampling methods enable problem-solving success at extreme subsampling levels where other subsampling methods fail. For example, phylogeny-informed subsampling methods more reliably solved program synthesis problems when evaluating just one training case per-individual, per-generation. However, at moderate subsampling levels, phylogeny-informed subsampling generally performed no better than random subsampling on GP problems. Continued refinements of phylogeny-informed subsampling techniques offer a promising new direction for scaling up evolutionary systems to handle problems with many expensive-to-evaluate fitness criteria.

References

[1]
Ryan Boldi, Martin Briesch, Dominik Sobania, Alexander Lalejini, Thomas Helmuth, Franz Rothlauf, Charles Ofria, and Lee Spector. 2024. Informed Down-Sampled Lexicase Selection: Identifying Productive Training Cases for Efficient Problem Solving. Evolutionary Computation (March 2024), 1--31.
[2]
Austin J. Ferguson, Jose Guadalupe Hernandez, Daniel Junghans, Alexander Lalejini, Emily Dolson, and Charles Ofria. 2020. Characterizing the Effects of Random Subsampling on Lexicase Selection. In Genetic Programming Theory and Practice XVII, Wolfgang Banzhaf, Erik Goodman, Leigh Sheneman, Leonardo Trujillo, and Bill Worzel (Eds.). Springer International Publishing, 1--23.
[3]
Thomas Helmuth and Peter Kelly. 2021. PSB2: the second program synthesis benchmark suite. In Proceedings of the Genetic and Evolutionary Computation Conference. ACM, Lille France, 785--794.
[4]
Thomas Helmuth, Edward Pantridge, and Lee Spector. 2019. Lexicase selection of specialists. In Proceedings of the Genetic and Evolutionary Computation Conference on - GECCO '19. ACM Press, Prague, Czech Republic, 1030--1038.
[5]
Thomas Helmuth and Lee Spector. 2015. General Program Synthesis Benchmark Suite. In Proceedings of the 2015 on Genetic and Evolutionary Computation Conference - GECCO '15. ACM Press, Madrid, Spain, 1039--1046.
[6]
Thomas Helmuth and Lee Spector. 2022. Problem-Solving Benefits of Down-Sampled Lexicase Selection. Artificial Life 27, 3 (2022), 183--203.
[7]
Shouyong Jiang and Shengxiang Yang. 2017. Evolutionary Dynamic Multiobjective Optimization: Benchmarks and Algorithm Comparisons. IEEE Transactions on Cybernetics 47, 1 (Jan. 2017), 198--211.
[8]
Alexander Lalejini, Emily Dolson, and Matthew Andres Moreno. 2024. (Supplemental material) Archived GitHub Repository. https://github.com/amlalejini/GECCO-2024-phylogeny-informed-subsampling.
[9]
Alexander Lalejini, Matthew Andres Moreno, Jose Guadalupe Hernandez, and Emily Dolson. 2024. Phylogeny-Informed Fitness Estimation for Test-Based Parent Selection. In Genetic Programming Theory and Practice XX, Stephan Winkler, Leonardo Trujillo, Charles Ofria, and Ting Hu (Eds.). Springer Nature Singapore, Singapore, 241--261. Series Title: Genetic and Evolutionary Computation.
[10]
Alexander Lalejini and Charles Ofria. 2018. Evolving event-driven programs with SignalGP. In Proceedings of the Genetic and Evolutionary Computation Conference on - GECCO '18. ACM Press, Kyoto, Japan, 1135--1142.
[11]
Alexander Lalejini, Marcos Sanson, Jack Garbus, Matthew Andres Moreno, and Emily Dolson. 2024. Runtime phylogenetic analysis enables extreme subsampling for test-based problems. (2024).
[12]
Patryk Orzechowski, William La Cava, and Jason H. Moore. 2018. Where are we now? a large benchmark study of recent symbolic regression methods. In Proceedings of the Genetic and Evolutionary Computation Conference. ACM, Kyoto Japan, 1183--1190.
[13]
Christian Pilato, Daniele Loiacono, Antonino Tumeo, Fabrizio Ferrandi, Pier Luca Lanzi, and Donatella Sciuto. 2010. Speeding-Up Expensive Evaluations in HighLevel Synthesis Using Solution Modeling and Fitness Inheritance. In Computational Intelligence in Expensive Optimization Problems, Lim Meng Hiot, Yew Soon Ong, Yoel Tenne, and Chi-Keong Goh (Eds.). Vol. 2. Springer Berlin Heidelberg, Berlin, Heidelberg, 701--723.
[14]
Shakiba Shahbandegan, Jose Guadalupe Hernandez, Alexander Lalejini, and Emily Dolson. 2022. Untangling phylogenetic diversity's role in evolutionary computation using a suite of diagnostic fitness landscapes. In Proceedings of the Genetic and Evolutionary Computation Conference Companion (New York, NY, USA, 2022-07-19) (GECCO '22). Association for Computing Machinery, 2322--2325.
[15]
Lee Spector. 2012. Assessment of problem modality by differential performance of lexicase selection in genetic programming: a preliminary report. In Proceedings of the 14th annual conference companion on Genetic and evolutionary computation (2012). ACM, 401--408. http://dl.acm.org/citation.cfm?id=2330846

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
GECCO '24 Companion: Proceedings of the Genetic and Evolutionary Computation Conference Companion
July 2024
2187 pages
ISBN:9798400704956
DOI:10.1145/3638530
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s).

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 August 2024

Check for updates

Author Tags

  1. genetic programming
  2. subsampling
  3. selection schemes
  4. lexicase selection
  5. phylogenetic analysis
  6. test-based problems

Qualifiers

  • Poster

Conference

GECCO '24 Companion
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 28
    Total Downloads
  • Downloads (Last 12 months)28
  • Downloads (Last 6 weeks)3
Reflects downloads up to 18 Feb 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media