Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Advertisement

Improved Software Fault Prediction Model Based on Optimal Features Set and Threshold Values Using Metaheuristic Approach

  • Original Research
  • Published:
SN Computer Science Aims and scope Submit manuscript

Abstract

Software fault prediction models are very important to prioritize software classes for effective testing and efficient use of resources so that the testing process’s time, effort, and cost can be reduced. Fault prediction models can be based on either metrics’ threshold values or machine learning. Code metrics’ threshold-based models are easy to automate and faster than machine learning-based models, which can save significant time in the testing process. ROC, Alves ranking, and VARL are famous threshold value calculation techniques. Out of which ROC is the best threshold calculation technique. This research article proposes a new threshold values calculation technique based on metaheuristics. A genetic algorithm and particle swarm optimizer are used to calculate the threshold values, and the proposed technique is tested on ten open-source object-oriented software datasets and four open-source procedural software datasets. Results show that the metaheuristic-based thresholds give better results than ROC-based thresholds.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Explore related subjects

Find the latest articles, discoveries, and news in related topics.

Data Availability

Data will be made available on reasonable request to corresponding author.

References

  1. Boucher A, Badri M. Software metrics thresholds calculation techniques to predict fault proneness: an empirical comparison. Inf Softw Technol. 2018;96:38–67.

    Article  Google Scholar 

  2. Chidamber SR, Kemerer CF. A metrics suite for object oriented design. IEEE Trans Softw Eng. 1994;20(6):476–93.

    Article  Google Scholar 

  3. Shatnawi R, Li W, Swain J, Newman T. Finding software metrics threshold values using ROC curves. J Softw Maint Evol. 2010;22(1):1–16.

    Article  Google Scholar 

  4. Shatnawi R. A quantitative investigation of the acceptable risk levels of object oriented metrics in open-source systems. IEEE Trans Softw Eng. 2010;36(2):216–25.

    Article  Google Scholar 

  5. Gyimothy T, Ferenc R, Siket I. Empirical validation of object-oriented metrics on open source software for fault prediction. IEEE Trans Softw Eng. 2005;31(10):897–910.

    Article  Google Scholar 

  6. Malhotra R, Jain A. Fault prediction using statistical and machine learning methods for improving software quality. J Inf Process Syst. 2012;8(2):241–62.

    Article  Google Scholar 

  7. Jureczko M, Madeyski L. Towards identifying software project clusters with regard to defect prediction. In: Proceedings of the 6th International Conference on Predictive Models in Software Engineering - PROMISE ’10, 2010. p. 1.

  8. Kaur A, Kaur K. Performance analysis of ensemble learning for predicting defects in open source software. In: 2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI), 2014. pp. 219–225.

  9. Yu L. Using negative binomial regression analysis to predict software faults: a study of Apache ANT. Int J Inf Technol Comput Sci. 2012;4(8):63–70.

    Google Scholar 

  10. Dejaeger K, Verbraken T, Baesens B. Toward comprehensible software fault prediction models using Bayesian network classifiers. IEEE Trans Softw Eng. 2013;39(2):237–57.

    Article  Google Scholar 

  11. Catal C, Sevim U, Diri B. Clustering and metrics thresholds based software fault prediction of unlabeled program modules. In: ITNG 2009 - 6th International conference on information technology: new generations, 2009. pp. 199–204.

  12. Abaei G, Selamat A, Fujita H. An empirical study based on semi-supervised hybrid self-organizing map for software fault prediction. Knowl Based Syst. 2014;74:28–39.

    Article  Google Scholar 

  13. Shatnawi R. Improving software fault-prediction for imbalanced data. In: 2012 International Conference on Innovations in Information Technology, IIT 2012, 2012. pp. 54–59.

  14. Henderson-Sellers B. Object-oriented metrics: measures of complexity. Prentice-Hall, Inc; 1995.

    Google Scholar 

  15. Daly J, Brooks A, Miller J, Roper M, Wood M. Evaluating inheritance depth on the maintainability of object-oriented software. J Empir Softw Eng. 1996;1(2):109–32.

    Article  Google Scholar 

  16. Cartwright M. An empirical view of inheritance. Inf Softw Technol. 1998;40(4):795–9.

    Article  Google Scholar 

  17. Emam K, Benlarbi S, Goel N, Rai S. The confounding effect of class size on the validity of object-oriented metrics. IEEE Trans Softw Eng. 2001;27(7):630–48.

    Article  Google Scholar 

  18. El Emam K, Benlarbi S, Goel N, Melo W, Lounis H, Rai S. The optimal class size for object-oriented software. IEEE Trans Softw Eng. 2002;28(5):494–509.

    Article  Google Scholar 

  19. Erni K, Lewerentz C. Applying design-metrics to object-oriented frameworks. In: Proceedings of the third international symposium on software metrics: from measurement to empirical results, 1996; 64–74.

  20. Bender R. Quantitative risk assessment in epidemiological studies investigating threshold effects. Biom J. 1999;41(3):305–19.

    Article  MATH  Google Scholar 

  21. Alves TL, Ypma C, Visser J. Deriving metric thresholds from benchmark data. In: 2010 IEEE International Conference on Software Maintenance, 2010. pp. 1–10.

  22. McCabe T. A complexity measure. IEEE Trans Softw Eng. 1976;SE-2(4):308–20.

    Article  MathSciNet  MATH  Google Scholar 

  23. Rosenberg LH (1998) Applying and interpreting object oriented metrics. In: Software Technology Conference.

  24. Singh S, Kahlon KS. Object oriented software metrics threshold values at quantitative acceptable risk level. Csit. 2014;2(3):191–205.

    Article  Google Scholar 

  25. Benlarbi S, El Emam K, Goel N, Rai S. Thresholds for object-oriented measures. In: Proceedings 11th International Symposium on Software Reliability Engineering. ISSRE 2000, IEEE Comput. Soc, 2000. pp. 24–38

  26. Catal C, Alan O, Balkan K. Class noise detection based on software metrics and ROC curves. Inf Sci. 2011;181(21):4867–77.

    Article  Google Scholar 

  27. Boetticher G. The PROMISE repository of empirical software engineering data, 2007. https://cir.nii.ac.jp/all?q=http://promisedata.org/repository

  28. Canbek G, Sagiroglu S, Temizel TT, Baykal N. Binary classification performance measures/metrics: A comprehensive visualized roadmap to gain new insights. In: 2017 International Conference on Computer Science and Engineering (UBMK), IEEE, 2017. pp. 821–826.

  29. Kennedy J, Eberhart R. Particle swarm optimization. In: Proceedings of ICNN'95-international conference on neural networks, vol. 4. IEEE, 1995. pp. 1942–1948.

  30. Rathi SC, Misra S, Colomo-Palacios R, Adarsh R, Neti LBM, Kumar L. Empirical evaluation of the performance of data sampling and feature selection techniques for software fault prediction. Expert Syst Appl. 2023;223: 119806.

    Article  Google Scholar 

  31. Sharma U, Sadam R. How far does the predictive decision impact the software project? The cost, service time, and failure analysis from a cross-project defect prediction model. J Syst Softw. 2023;195: 111522.

    Article  Google Scholar 

  32. Feng S, Keung J, Zhang P, Xiao Y, Zhang M. The impact of the distance metric and measure on SMOTE-based techniques in software defect prediction. Inf Softw Technol. 2022;142: 106742.

    Article  Google Scholar 

  33. Arar ÖF, Ayan K. Deriving thresholds of software metrics to predict faults on open source software: replicated case studies. Expert Syst Appl. 2016;61:106–21.

    Article  Google Scholar 

  34. Nevendra M, Singh P. Empirical investigation of hyperparameter optimization for software defect count prediction. Expert Syst Appl. 2022;191: 116217.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Manpreet Singh.

Ethics declarations

Conflict of Interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article is part of the topical collection “Research Trends in Computational Intelligence” guest edited by Anshul Verma, Pradeepika Verma, Vivek Kumar Singh and S. Karthikeyan.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Singh, M., Chhabra, J.K. Improved Software Fault Prediction Model Based on Optimal Features Set and Threshold Values Using Metaheuristic Approach. SN COMPUT. SCI. 4, 770 (2023). https://doi.org/10.1007/s42979-023-02217-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s42979-023-02217-x

Keywords