Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

An Optimized Feature Selection Method Using Ensemble Classifiers in Software Defect Prediction for Healthcare Systems

Published: 01 January 2022 Publication History

Abstract

The healthcare systems are extensively being used with increased focus on safety of patients. Software engineering for healthcare applications is an emerging research area. Detecting defects is a critical step of software development process of healthcare applications. The performance of the Software Defect Prediction model (SDP) depends on the features of healthcare system; irrelevant features decrease the performance of the model. An optimized feature selection technique is needed to recognize and remove the irrelevant features. In this study, a new optimized feature selection technique, i.e., multiobjective Harris Hawk Optimization (HHO), is proposed for binary classification problem with Adaptive Synthetic Sampling (ADASYN) Technique. Multiobjective HHO is proposed with two main objectives, one to reduce the total amount of selected features and the other to maximize the performance of the proposed model. The multiobjective feature selection technique helps to find the optimal solution to achieve the desired objectives and increase the classification performance in terms of accuracy, AUC, precision, recall, and F1-measure. The study conducts an experiment on a healthcare dataset. Six different search techniques (RF, SVM, bagging, adaptive boosting, voting, and stacking) are implemented on the dataset. The proposed model helps to predict the software defects with a significant classification accuracy of 0.990 and AUC score of 0.992.

References

[1]
N. Lalband and D. Kavitha, “Software engineering for smart healthcare applications,” International Journal of Innovative Technology and Exploring Engineering, vol. 8, no. 6S4, pp. 325–331, 2019.
[2]
Y. Zhang and G. Koru, “Understanding and detecting defects in healthcare administration data: toward higher data quality to better support healthcare operations and decisions,” Journal of the American Medical Informatics Association, vol. 27, no. 3, pp. 386–395, 2020.
[3]
R. Jetley and B. Chelf, “Diagnosing medical device software defects using static analysis,” in Coverity, Published in MD&DI, 2009.
[4]
M. Rizwan, A. Shabbir, A. R. Javed, G. Srivastava, T. R. Gadekallu, M. Shabir, and M. A. Hassan, “Risk monitoring strategy for confidentiality of healthcare information,” Computers and Electrical Engineering, vol. 100, article 107833, 2022.
[5]
S. M. Akhtar, M. Nazir, K. Saleem, R. Z. Ahmad, A. R. Javed, S. S. Band, and A. Mosavi, “A multi-agent formalism based on contextual defeasible logic for healthcare systems,” Frontiers in Public Health, vol. 10, 2022.
[6]
A. Mubashar, K. Asghar, A. R. Javed, M. Rizwan, G. Srivastava, T. R. Gadekallu, D. Wang, and M. Shabbir, “Storage and proximity management for centralized personal health records using an IPFS-based optimization algorithm,” Journal of Circuits, Systems and Computers, vol. 31, no. 1, p. 2250010, 2022.
[7]
Y. Zhang, P. Masci, P. Jones, and H. Thimbleby, “Research: user interface software errors in medical devices: study of U.S. recall data,” Biomedical Instrumentation & Technology, vol. 53, no. 3, pp. 182–194, 2019.
[8]
M. Kondo, C.-P. Bezemer, Y. Kamei, A. E. Hassan, and O. Mizuno, “The impact of feature reduction techniques on defect prediction models,” Empirical Software Engineering, vol. 24, no. 4, pp. 1925–1963, 2019.
[9]
L. H. Son, N. Pritam, M. Khari, R. Kumar, P. Phuong, and P. Thong, “Empirical study of software defect prediction: a systematic mapping,” Symmetry, vol. 11, no. 2, p. 212, 2019.
[10]
M. B. R. Pandit and N. Varma, “A deep introduction to ai based software defect prediction (SDP) and its current challenges,” in TENCON 2019-2019 IEEE Region 10 Conference (TENCON), pp. 284–290, Kochi, India, 2019.
[11]
C. L. Prabha and N. Shivakumar, “Software defect prediction using machine learning techniques,” in 2020 4th International Conference on Trends in Electronics and Informatics (ICOEI) (48184), pp. 728–733, Tirunelveli, India, 2020.
[12]
R. Malhotra and K. Khan, “A study on software defect prediction using feature extraction techniques,” in 2020 8th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions)(ICRITO), pp. 1139–1144, Noida, India, 2020.
[13]
Y. Qiao, J. Qian, S. Jiang, W. Zhenhua, and G. Zhang, “An empirical study on the effectiveness of feature selection for crossproject defect prediction,” IEEE Access, vol. 7, pp. 35710–35718, 2019.
[14]
H. Turabieh, M. Mafarja, and X. Li, “Iterated feature selection algorithms with layered recurrent neural network for software fault prediction,” Expert Systems with Applications, vol. 122, pp. 27–42, 2019.
[15]
P. Kumar, G. P. Gupta, and R. Tripathi, “Toward design of an intelligent cyber attack detection system using hybrid feature reduced approach for IoT networks,” Arabian Journal for Science and Engineering, vol. 46, no. 4, pp. 3749–3778, 2021.
[16]
I. Tumar, Y. Hassouneh, H. Turabieh, and T. Thaher, “Enhanced binary moth flame optimization as a feature selection algorithm to predict software fault prediction,” IEEE Access, vol. 8, pp. 8041–8055, 2020.
[17]
J. A. Wang and Z. Zou, “A clusterbased hybrid feature selection method for defect prediction,” in 2019 IEEE 19th International Conference on Software Quality, Reliability and Security (QRS), pp. 1–9, Sofia, Bulgaria, 2019.
[18]
C. Ni, X. Chen, W. Fangfang, Y. Shen, and G. Qing, “An empirical study on Pareto based multi-objective feature selection for software defect prediction,” Journal of Systems and Software, vol. 152, pp. 215–238, 2019.
[19]
Y. Hassouneh, H. Turabieh, T. Thaher, I. Tumar, H. Chantar, and J. Too, “Boosted whale optimization algorithm with natural selection operators for software fault prediction,” IEEE Access, vol. 9, pp. 14239–14258, 2021.
[20]
T. Thaher and N. Arman, “Efficient multi-swarm binary Harris Hawks optimization as a feature selection approach for software fault prediction,” in 2020 11th International Conference on Information and Communication Systems (ICICS), pp. 249–254, Irbid, Jordan, 2020.
[21]
T. Thaher, M. Mafarja, B. Abdalhaq, and H. Chantar, “Wrapper-based feature selection for imbalanced data using binary queuing search algorithm,” in 2019 2nd international conference on new trends in computing sciences (ICTCS), pp. 1–6, Amman, Jordan, 2019.
[22]
X. U. Xiaolong, C. H. E. N. Wen, and W. A. N. G. Xinheng, “RFC: a feature selection algorithm for software defect prediction,” Journal of Systems Engineering and Electronics, vol. 32, no. 2, pp. 389–398, 2021.
[23]
A. Iqbal and S. Aftab, “A classification framework for software defect prediction using multi-filter feature selection technique and MLP,” International Journal of Modern Education & Computer Science, vol. 12, no. 1, pp. 18–25, 2020.
[24]
B. Mumtaz, S. Kanwal, S. Alamri, and F. Khan, “Feature selection using artificial immune network: an approach for software defect prediction,” Intelligent Automation and Soft Computing, vol. 29, no. 3, pp. 669–684, 2021.
[25]
R. Malhotra, A. Budhiraja, A. K. Singh, and I. Ghoshal, “A novel feature selection approach based on binary particle swarm optimization and ensemble learning for heterogeneous defect prediction,” in 2021 3rd Asia Pacific Information Technology Conference, pp. 115–121, Bangkok Thailand, 2021.
[26]
A. R. Javed, M. U. Sarwar, M. O. Beg, M. Asim, T. Baker, and H. Tawfik, “A collaborative healthcare framework for shared healthcare plan with ambient intelligence,” Human-centric Computing and Information Sciences, vol. 10, no. 1, pp. 1–21, 2020.
[27]
M. Fayyaz, A. A. Farhan, and A. R. Javed, “Thermal comfort model for HVAC buildings using machine learning,” Arabian Journal for Science and Engineering, vol. 47, no. 2, pp. 2045–2060, 2022.
[28]
K. Saleem, M. Saleem, R. Zeeshan, A. R. Javed, M. Alazab, T. R. Gadekallu, and A. Suleman, “Situation-aware BDI reasoning to detect early symptoms of covid 19 using smartwatch,” IEEE Sensors Journal, p. 1, 2022.
[29]
A. Amanat, M. Rizwan, A. R. Javed, M. Abdelhaq, R. Alsaqour, S. Pandya, and M. Uddin, “Deep learning for depression detection from textual data,” Electronics, vol. 11, no. 5, p. 676, 2022.
[30]
M. Rizwan, A. Shabbir, A. R. Javed, M. Shabbir, T. Baker, and D. al-Jumeily Obe, “Brain tumor and glioma grade classification using Gaussian convolutional neural network,” Access, vol. 10, pp. 29731–29740, 2022.
[31]
A. R. Javed, “Automated cognitive health assessment in smart homes using machine learning,” Sustainable Cities and Society, vol. 65, article 102572, 2021.
[32]
T. M. Ghazal, M. Anam, M. K. Hasan, M. Hussain, M. S. Farooq, H. M. A. Ali, M. Ahmad, and T. R. Soomro, “Hep-pred: hepatitis c staging prediction using fine Gaussian SVM,” Computers, Materials & Continua, vol. 69, no. 1, pp. 191–203, 2021.
[33]
K. Shailaja, B. Seetharamulu, and M. A. Jabbar, “Machine learning in healthcare: a review,” in 2018 Second international conference on electronics, communication and aerospace technology (ICECA), pp. 910–914, Coimbatore, India, 2018.
[34]
T. M. Ghazal, S. Abbas, S. Munir, M. A. Khan, M. Ahmad, G. F. Issa, S. B. Zahra, M. A. Khan, and M. K. Hasan, “Alzheimer disease detection empowered with transfer learning,” Computers, Materials & Continua, vol. 70, no. 3, pp. 5005–5019, 2022.
[35]
G. D. Magoulas and A. Prentza, “Machine learning in medical applications,” in Advanced Course on Artificial Intelligence, pp. 300–307, Springer, 2001.
[36]
T. Han, N. Stone-Weiss, J. Huang, A. Goel, and A. Kumar, “Machine learning as a tool to design glasses with controlled dissolution for healthcare applications,” Acta Biomaterialia, vol. 107, pp. 286–298, 2020.
[37]
M. K. Hasan, S. Islam, I. Memon, A. F. Ismail, S. Abdullah, A. K. Budati, and N. S. Nafi, “A novel resource oriented DMA framework for internet of medical things devices in 5g network,” IEEE Transactions on Industrial Informatics, p. 1, 2022.
[38]
M. K. Hasan, S. Islam, R. Sulaiman, S. Khan, A.-H. A. Hashim, S. Habib, M. Islam, S. Alyahya, M. M. Ahmed, S. Kamil, and M. A. Hassan, “Lightweight encryption technique to enhance medical image security on internet of medical things applications,” Access, vol. 9, pp. 47731–47742, 2021.
[39]
R. Malhotra, N. Nishant, S. Gurha, and V. Rathi, “Application of particle swarm optimization for software defect prediction using object oriented metrics,” in 2021 11th International Conference on Cloud Computing, Data Science & Engineering (Confluence), pp. 88–93, Noida, India, 2021.
[40]
J. Li, H. Kang, G. Sun, T. Feng, W. Li, W. Zhang, and B. Ji, “IBDA: improved binary dragonfly algorithm with evolutionary population dynamics and adaptive crossover for feature selection,” IEEE Access, vol. 8, pp. 108032–108051, 2020.
[41]
H. Alsawalqah, N. Hijazi, M. Eshtay, H. Faris, A. A. Radaideh, I. Aljarah, and Y. Alshamaileh, “Software defect prediction using heterogeneous ensemble classification based on segmented patterns,” Applied Sciences, vol. 10, no. 5, p. 1745, 2020.
[42]
A. O. Balogun, A. O. Bajeh, V. A. Orie, and W. A. Yusuf-Asaju, “Software defect prediction using ensemble learning: an ANP based evaluation method,” FUOYE Journal of Engineering and Technology, vol. 3, no. 2, 2018.
[43]
F. Matloob, S. Aftab, and A. Iqbal, “A framework for software defect prediction using feature selection and ensemble learning techniques,” International Journal of Modern Education & Computer Science, vol. 11, no. 12, pp. 14–20, 2019.
[44]
R. Malhotra and J. Jain, “Handling imbalanced data using ensemble learning in software defect prediction,” in 2020 10th International Conference on Cloud Computing, Data Science & Engineering (Confluence), pp. 300–304, Noida, India, 2020.
[45]
A. Alsaeedi and M. Z. Khan, “Software defect prediction using supervised machine learning and ensemble techniques: a comparative study,” Journal of Software Engineering and Applications, vol. 12, no. 5, pp. 85–100, 2019.
[46]
T. Dokeroglu, A. Deniz, and H. E. Kiziloz, “A robust multiobjective Harris' Hawks Optimization algorithm for the binary classification problem,” Knowledge-Based Systems, vol. 227, article ???, 2021.
[47]
H. He, X. Zhang, Q. Wang, J. Ren, J. Liu, X. Zhao, and Y. Cheng, “Ensemble multiboost based on ripper classifier for prediction of imbalanced software defect data,” IEEE Access, vol. 7, pp. 110333–110343, 2019.
[48]
A. A. Heidari, S. Mirjalili, H. Faris, I. Aljarah, M. Mafarja, and H. Chen, “Harris Hawks optimization: algorithm and applications,” Future Generation Computer Systems, vol. 97, pp. 849–872, 2019.
[49]
M. Abdel-Basset, W. Ding, and D. El-Shahat, “A hybrid Harris Hawks optimization algorithm with simulated annealing for feature selection,” Artificial Intelligence Review, vol. 54, no. 1, pp. 593–637, 2021.
[50]
Y. N. Soe, P. I. Santosa, and R. Hartanto, “Software defect prediction using random forest algorithm,” in 2018 12th South East Asian Technical University Consortium (SEATUC), pp. 1–5, Yogyakarta, Indonesia, 2018.
[51]
R. Li, L. Zhou, S. Zhang, H. Liu, X. Huang, and Z. Sun, “Software defect prediction based on ensemble learning,” in Proceedings of the 2019 2nd International conference on data science and information technology, pp. 1–6, Seoul Republic of Korea, 2019.
[52]
M. A. Mabayoje, A. O. Balogun, A. O. Bajeh, and B. A. Musa, “Software Defect Prediction: Effect of Feature Selection and Ensemble Methods,” FUW Trends in Science & Technology Journal, vol. 3, no. 2, pp. 518–522, 2018.
[53]
M. A. I. Aquil and W. H. W. Ishak, “Predicting software defects using machine learning techniques,” International Journal, vol. 9, no. 4, pp. 6609–6616, 2020.

Cited By

View all
  • (2023)A Systematic Literature Review on Robust Swarm Intelligence Algorithms in Search-Based Software EngineeringComplexity10.1155/2023/45775812023Online publication date: 1-Jan-2023

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Wireless Communications & Mobile Computing
Wireless Communications & Mobile Computing  Volume 2022, Issue
2022
25330 pages
This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Publisher

John Wiley and Sons Ltd.

United Kingdom

Publication History

Published: 01 January 2022

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 28 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2023)A Systematic Literature Review on Robust Swarm Intelligence Algorithms in Search-Based Software EngineeringComplexity10.1155/2023/45775812023Online publication date: 1-Jan-2023

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media