Improved Software Fault Prediction Model Based on Optimal Features Set and Threshold Values Using Metaheuristic Approach

Singh, Manpreet; Chhabra, Jitender Kumar

doi:10.1007/s42979-023-02217-x

Improved Software Fault Prediction Model Based on Optimal Features Set and Threshold Values Using Metaheuristic Approach

Original Research
Published: 06 October 2023

Volume 4, article number 770, (2023)
Cite this article

SN Computer Science Aims and scope Submit manuscript

Manpreet Singh¹ &
Jitender Kumar Chhabra¹

74 Accesses
2 Citations
Explore all metrics

Abstract

Software fault prediction models are very important to prioritize software classes for effective testing and efficient use of resources so that the testing process’s time, effort, and cost can be reduced. Fault prediction models can be based on either metrics’ threshold values or machine learning. Code metrics’ threshold-based models are easy to automate and faster than machine learning-based models, which can save significant time in the testing process. ROC, Alves ranking, and VARL are famous threshold value calculation techniques. Out of which ROC is the best threshold calculation technique. This research article proposes a new threshold values calculation technique based on metaheuristics. A genetic algorithm and particle swarm optimizer are used to calculate the threshold values, and the proposed technique is tested on ten open-source object-oriented software datasets and four open-source procedural software datasets. Results show that the metaheuristic-based thresholds give better results than ROC-based thresholds.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Software fault prediction using lion optimization algorithm

Article 21 September 2021

Software fault prediction using evolving populations with mathematical diversification

Article 10 September 2022

A hybrid approach to software fault prediction using genetic programming and ensemble learning methods

Article 04 January 2022

Find the latest articles, discoveries, and news in related topics.

Artificial Intelligence

Data Availability

Data will be made available on reasonable request to corresponding author.

References

Boucher A, Badri M. Software metrics thresholds calculation techniques to predict fault proneness: an empirical comparison. Inf Softw Technol. 2018;96:38–67.
Article Google Scholar
Chidamber SR, Kemerer CF. A metrics suite for object oriented design. IEEE Trans Softw Eng. 1994;20(6):476–93.
Article Google Scholar
Shatnawi R, Li W, Swain J, Newman T. Finding software metrics threshold values using ROC curves. J Softw Maint Evol. 2010;22(1):1–16.
Article Google Scholar
Shatnawi R. A quantitative investigation of the acceptable risk levels of object oriented metrics in open-source systems. IEEE Trans Softw Eng. 2010;36(2):216–25.
Article Google Scholar
Gyimothy T, Ferenc R, Siket I. Empirical validation of object-oriented metrics on open source software for fault prediction. IEEE Trans Softw Eng. 2005;31(10):897–910.
Article Google Scholar
Malhotra R, Jain A. Fault prediction using statistical and machine learning methods for improving software quality. J Inf Process Syst. 2012;8(2):241–62.
Article Google Scholar
Jureczko M, Madeyski L. Towards identifying software project clusters with regard to defect prediction. In: Proceedings of the 6th International Conference on Predictive Models in Software Engineering - PROMISE ’10, 2010. p. 1.
Kaur A, Kaur K. Performance analysis of ensemble learning for predicting defects in open source software. In: 2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI), 2014. pp. 219–225.
Yu L. Using negative binomial regression analysis to predict software faults: a study of Apache ANT. Int J Inf Technol Comput Sci. 2012;4(8):63–70.
Google Scholar
Dejaeger K, Verbraken T, Baesens B. Toward comprehensible software fault prediction models using Bayesian network classifiers. IEEE Trans Softw Eng. 2013;39(2):237–57.
Article Google Scholar
Catal C, Sevim U, Diri B. Clustering and metrics thresholds based software fault prediction of unlabeled program modules. In: ITNG 2009 - 6th International conference on information technology: new generations, 2009. pp. 199–204.
Abaei G, Selamat A, Fujita H. An empirical study based on semi-supervised hybrid self-organizing map for software fault prediction. Knowl Based Syst. 2014;74:28–39.
Article Google Scholar
Shatnawi R. Improving software fault-prediction for imbalanced data. In: 2012 International Conference on Innovations in Information Technology, IIT 2012, 2012. pp. 54–59.
Henderson-Sellers B. Object-oriented metrics: measures of complexity. Prentice-Hall, Inc; 1995.
Google Scholar
Daly J, Brooks A, Miller J, Roper M, Wood M. Evaluating inheritance depth on the maintainability of object-oriented software. J Empir Softw Eng. 1996;1(2):109–32.
Article Google Scholar
Cartwright M. An empirical view of inheritance. Inf Softw Technol. 1998;40(4):795–9.
Article Google Scholar
Emam K, Benlarbi S, Goel N, Rai S. The confounding effect of class size on the validity of object-oriented metrics. IEEE Trans Softw Eng. 2001;27(7):630–48.
Article Google Scholar
El Emam K, Benlarbi S, Goel N, Melo W, Lounis H, Rai S. The optimal class size for object-oriented software. IEEE Trans Softw Eng. 2002;28(5):494–509.
Article Google Scholar
Erni K, Lewerentz C. Applying design-metrics to object-oriented frameworks. In: Proceedings of the third international symposium on software metrics: from measurement to empirical results, 1996; 64–74.
Bender R. Quantitative risk assessment in epidemiological studies investigating threshold effects. Biom J. 1999;41(3):305–19.
Article MATH Google Scholar
Alves TL, Ypma C, Visser J. Deriving metric thresholds from benchmark data. In: 2010 IEEE International Conference on Software Maintenance, 2010. pp. 1–10.
McCabe T. A complexity measure. IEEE Trans Softw Eng. 1976;SE-2(4):308–20.
Article MathSciNet MATH Google Scholar
Rosenberg LH (1998) Applying and interpreting object oriented metrics. In: Software Technology Conference.
Singh S, Kahlon KS. Object oriented software metrics threshold values at quantitative acceptable risk level. Csit. 2014;2(3):191–205.
Article Google Scholar
Benlarbi S, El Emam K, Goel N, Rai S. Thresholds for object-oriented measures. In: Proceedings 11th International Symposium on Software Reliability Engineering. ISSRE 2000, IEEE Comput. Soc, 2000. pp. 24–38
Catal C, Alan O, Balkan K. Class noise detection based on software metrics and ROC curves. Inf Sci. 2011;181(21):4867–77.
Article Google Scholar
Boetticher G. The PROMISE repository of empirical software engineering data, 2007. https://cir.nii.ac.jp/all?q=http://promisedata.org/repository
Canbek G, Sagiroglu S, Temizel TT, Baykal N. Binary classification performance measures/metrics: A comprehensive visualized roadmap to gain new insights. In: 2017 International Conference on Computer Science and Engineering (UBMK), IEEE, 2017. pp. 821–826.
Kennedy J, Eberhart R. Particle swarm optimization. In: Proceedings of ICNN'95-international conference on neural networks, vol. 4. IEEE, 1995. pp. 1942–1948.
Rathi SC, Misra S, Colomo-Palacios R, Adarsh R, Neti LBM, Kumar L. Empirical evaluation of the performance of data sampling and feature selection techniques for software fault prediction. Expert Syst Appl. 2023;223: 119806.
Article Google Scholar
Sharma U, Sadam R. How far does the predictive decision impact the software project? The cost, service time, and failure analysis from a cross-project defect prediction model. J Syst Softw. 2023;195: 111522.
Article Google Scholar
Feng S, Keung J, Zhang P, Xiao Y, Zhang M. The impact of the distance metric and measure on SMOTE-based techniques in software defect prediction. Inf Softw Technol. 2022;142: 106742.
Article Google Scholar
Arar ÖF, Ayan K. Deriving thresholds of software metrics to predict faults on open source software: replicated case studies. Expert Syst Appl. 2016;61:106–21.
Article Google Scholar
Nevendra M, Singh P. Empirical investigation of hyperparameter optimization for software defect count prediction. Expert Syst Appl. 2022;191: 116217.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Computer Engineering Department, National Institute of Technology, Kurukshetra, India
Manpreet Singh & Jitender Kumar Chhabra

Authors

Manpreet Singh
View author publications
You can also search for this author in PubMed Google Scholar
Jitender Kumar Chhabra
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Manpreet Singh.

Ethics declarations

Conflict of Interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article is part of the topical collection “Research Trends in Computational Intelligence” guest edited by Anshul Verma, Pradeepika Verma, Vivek Kumar Singh and S. Karthikeyan.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Singh, M., Chhabra, J.K. Improved Software Fault Prediction Model Based on Optimal Features Set and Threshold Values Using Metaheuristic Approach. SN COMPUT. SCI. 4, 770 (2023). https://doi.org/10.1007/s42979-023-02217-x

Download citation

Received: 28 February 2023
Accepted: 02 August 2023
Published: 06 October 2023
DOI: https://doi.org/10.1007/s42979-023-02217-x

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Improved Software Fault Prediction Model Based on Optimal Features Set and Threshold Values Using Metaheuristic Approach

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Software fault prediction using lion optimization algorithm

Software fault prediction using evolving populations with mathematical diversification

A hybrid approach to software fault prediction using genetic programming and ensemble learning methods

Data Availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Improved Software Fault Prediction Model Based on Optimal Features Set and Threshold Values Using Metaheuristic Approach

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Software fault prediction using lion optimization algorithm

Software fault prediction using evolving populations with mathematical diversification

A hybrid approach to software fault prediction using genetic programming and ensemble learning methods

Explore related subjects

Data Availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation