Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3622748.3622750acmotherconferencesArticle/Chapter ViewAbstractPublication PagessbcarsConference Proceedingsconference-collections
research-article

Feature selection in an interactive search-based PLA design approach

Published: 25 September 2023 Publication History
  • Get Citation Alerts
  • Abstract

    The Product Line Architecture (PLA) is one of the most important artifacts of a Software Product Line (SPL). PLA design can be formulated as an interactive optimization problem with many conflicting factors. Incorporating Decision Makers’ (DM) preferences during the search process may help the algorithms find more adequate solutions for their profiles. Interactive approaches allow the DM to evaluate solutions, guiding the optimization according to their preferences. However, this brings up human fatigue problems caused by the excessive amount of interactions and solutions to evaluate. A common strategy to prevent this problem is limiting the number of interactions and solutions evaluated by the DM. Machine Learning (ML) models were also used to learn how to evaluate solutions according to the DM profile and replace them after some interactions. Feature selection performs an essential task as non-relevant and/or redundant features used to train the ML model can reduce the accuracy and comprehensibility of the hypotheses induced by ML algorithms. This work aims to select features of an ML model used to prevent human fatigue in an interactive search-based PLA design approach. We applied four selectors and through results we were able to reduce 30% of features, obtaining an accuracy of 99%.

    References

    [1]
    Boukhdhir Amal, Marouane Kessentini, Slim Bechikh, Josselin Dea, and Lamjed Ben Said. 2014. On the use of machine learning and search-based software engineering for ill-defined fitness function: a case study on software refactoring. In International Symposium on Search-based Software Engineering (SSBSE). 31–45.
    [2]
    Caio Arasaki, Lucas Wolshick, Willian Marques Freire, and Aline Maria Malachini Miotto Amaral. 2023. Feature selection in an interactive search-based PLA design approach. https://doi.org/10.5281/zenodo.7930627
    [3]
    Carlos Vinicius Bindewald, Willian M Freire, Aline M M Miotto Amaral, and Thelma Elita Colanzi. 2019. Towards the support of user preferences in search-based product line architecture design: an exploratory study. In Proceedings of the XXXIII Brazilian Symposium on Software Engineering (SBES). 387–396.
    [4]
    Carlos Vinicius Bindewld, Willian Marques Freitas, Aline Maria Malachini Miotto Anaral, and Thelma Elita Colanzi. 2020. Supporting user preferences in search-based product line architecture design using Machine Learning. In XIV Brazilian Symposium on Software Components, Architectures, and Reuse. SBC.
    [5]
    Christopher M Bishop. 2006. Pattern recognition and machine learning. Springer.
    [6]
    Grady Booch, James Rumbaugh, and Ivar Jacobson. 1998. The Unified Modeling Language User Guide. Addison-Wesley Professional (1998).
    [7]
    John G Cleary and Leonard E Trigg. 1995. K*: An Instance-Based Learner Using an Entropic Distance Measure. Machine Learning 21, 1-2 (1995), 61–81.
    [8]
    Thelma Elita Colanzi, Silvia Regina Vergilio, Itana M. S. Gimenes, and Willian Nalepa Oizumi. 2014. A Search-Based Approach for Software Product Line Design. In 18th International Software Product Line Conference. 237–241.
    [9]
    Antonio C Contieri, Guilherme G Correia, Thelma E Colanzi, Itana MS Gimenes, Edson A OliveiraJr, Sandra Ferrari, Paulo C Masiero, and Alessandro F Garcia. 2011. Extending UML components to develop software product-line architectures: Lessons learned. In European Conference on Software Architecture. 130–138.
    [10]
    Thiago Nascimento Ferreira, Silvia Regina Vergilio, and Jerffeson T. de Souza. 2017. Incorporating user preferences in search-based software engineering: A systematic mapping study. Information and Software Technology 90 (2017), 55–69.
    [11]
    Ferreira FN, Allysson Allex Araújo, Altino Dantas Basílio Neto, and Jerffeson Teixeira de Souza. 2016. Incorporating user preferences in ant colony optimization for the next release problem. Applied Software Computing 49 (2016), 1283–1296.
    [12]
    Willian Freire, Cláudia Rosa, Aline Amaral, and Thelma Colanzi. 2022. Validating an Interactive Ranking Operator for NSGA-II to Support the Optimization of Software Engineering Problems. In Proceedings of the XXXVI Brazilian Symposium on Software Engineering (SBES). 337–346.
    [13]
    Willian Marques Freire, Carlos Vinícius Bindewald, Aline MM Miotto Amaral, and Thelma Elita Colanzi. 2019. Supporting decision makers in search-based product line architecture design using clustering. In 2019 IEEE 43rd Annual Computer Software and Applications Conference (COMPSAC), Vol. 1. 139–148.
    [14]
    Willian Marques Freire, Mamoru Massago, Arthur Cattaneo Zavadski, Aline M M Miotto Amaral, and Thelma Elita Colanzi. 2020. OPLA-Tool v2.0: a Tool for Product Line Architecture Design Optimization. In 34th Brazilian Symposium on Software Engineering (SBES).
    [15]
    Gnanambal S, Thangaraj M, Meenatchi V.T, and Gayathri V. 2018. Classification Algorithms with Attribute Selection: an evaluation study using Weka. International Journal of Advanced Networking and Applications 09(06) (2018), 3640–3644.
    [16]
    M. Hall and Guetlein M.2022. BestFirst — Weka. https://weka.sourceforge.io/doc.dev/weka/attributeSelection/BestFirst.html. Accessed in April 2023.
    [17]
    M. A. Hall. 1998. Correlation-based Feature Subset Selection for Machine Learning. Ph. D. Dissertation. University of Waikato, Hamilton, New Zealand.
    [18]
    Mark Harman and Bryan F. Jones. 2001. Search-based software engineering. Information and Soft. Technology 43, 14 (2001), 833–839.
    [19]
    Kyo C Kang, Sholom G Cohen, James A Hess, William E Novak, and A Spencer Peterson. 1990. Feature-oriented domain analysis (FODA) feasibility study. Technical Report. DTIC Document.
    [20]
    K.Sutha and Dr.J. Jebamalar Tamilselvi. 2015. A Review of Feature Selection Algorithms for Data Mining Techniques. International Journal on Computer Science and Engineering 7(6) (2015).
    [21]
    Fernando H Kuviatkovski, Willian M Freire, Aline MMM Amaral, Thelma E Colanzi, and Valéria D Feltrim. 2022. Evaluating Machine Learning Algorithms in Representing Decision Makers in search-based PLA. In 2022 IEEE 19th International Conference on Software Architecture Companion (ICSA-C). IEEE, 68–75.
    [22]
    Rensis Likert. 1932. A technique for the measurement of attitudes.Archives of psychology (1932).
    [23]
    Huan Liu and Hiroshi Motoda. 2012. Feature selection for knowledge discovery and data mining. Vol. 454. Springer Science & Business Media.
    [24]
    M. Ramaswami and R. Bhaskaran. 2009. A Study on Feature Selection Techniques in Educational Data Mining. JOURNAL OF COMPUTING 1(1) (2009).
    [25]
    Eric Mjolsness and Dennis Decoste. 2001. Machine Learning for Science: State of the Art and Future Prospects. Science (New York, N.Y.) 293 (10 2001), 2051–5.
    [26]
    Mohamed W Mkaouer, Marouane Kessentini, Bechikh Slim, and Daniel R Tauritz. 2013. Preference-based multi-objective software modelling. In Proceedings of the 1st International Workshop on Combining Modelling and Search-Based Software Engineering. 61–66.
    [27]
    E. OliveiraJr, I. M. S. Gimenes, and J. C. Maldonado. 2010. Systematic Management of Variability in UML-based Software Product Lines. Journal of Universal Computer Science 16 (2010), 2374–2393.
    [28]
    Klaus Pohl, Günter Böckle, and Frank J van Der Linden. 2005. Software product line engineering: foundations, principles and techniques. Springer.
    [29]
    J. Ross Quinlan. 2014. C4.5: programs for machine learning. Elsevier. 22 pages.
    [30]
    Aurora Ramirez, Jose Raul Romero, and Christopher Simons. 2018. A systematic review of interaction in search-based software engineering. IEEE Transactions on Software Engineering (2018).
    [31]
    Cláudia Tupan Rosa, Willian M Freire, Aline M M Miotto Amaral, and Thelma Elita Colanzi. 2022. Towards an interactive ranking operator for NSGA-II. In Proceedings of the Genetic and Evolutionary Computation Conference Companion (GECCO). 794–797.
    [32]
    Stuart Russell, Peter Norvig, and Ernest Davis. 2016. Artificial Intelligence: A Modern Approach. Pearson.
    [33]
    SEI. 2009. Software Engineering Institute - The Arcade Game Maker Pedagogical Product Line. https://resources.sei.cmu.edu/library/asset-view.cfm?assetID=485941. Accessed in 2018 August.
    [34]
    Chris Simons, Jeremy Singer, and David R. White. 2015. Search-Based Refactoring: Metrics Are Not Enough. In Search-Based Software Engineering, Márcio Barros and Yvan Labiche (Eds.). 47–61.
    [35]
    Sunita Beniwal and Jitender Arora. 2012. Classification and Feature Selection Techniques in Data Mining. International Journal of Engineering Research & Technology 1(6) (2012).
    [36]
    Yenisei Delgado Verdecia, Thelma Elita Colanzi, Silvia Regina Vergilio, and Marcelo C. Santos. 2017. An Enhanced Evaluation Model for Search-based Product Line Architecture Design. In 20th Iberoamerican Conference on Software Engineering (CIbSE). CIbSE, San Jose, Costa Rica, 155–168.
    [37]
    P. A. Vikhar. 2016. Evolutionary algorithms: A critical review and its future prospects. International Conference on Global Trends in Signal Processing, Information Computing and Communication (ICGTSPICC) (2016), 261–265.
    [38]
    Ian H. Witten and Eibe Frank. 2005. Data Mining: Practical Machine Learning Tools and Techniques (2nd ed.). Morgan Kaufmann, San Francisco.

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    SBCARS '23: Proceedings of the 17th Brazilian Symposium on Software Components, Architectures, and Reuse
    September 2023
    81 pages
    ISBN:9798400709524
    DOI:10.1145/3622748
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 25 September 2023

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Feature Selection
    2. Interactive search-based Software Engineering
    3. Machine Learning

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    SBCARS 2023

    Acceptance Rates

    Overall Acceptance Rate 23 of 79 submissions, 29%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 17
      Total Downloads
    • Downloads (Last 12 months)17
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 11 Aug 2024

    Other Metrics

    Citations

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media