Abstract
Heart disease diagnosis is found to be a challenging issue which can offer a computerized estimate about the level of heart disease so that supplementary action can be made easy. Thus, heart disease diagnosis has expected massive attention worldwide among the healthcare environment. Optimization algorithms played a significant role in heart disease diagnosis with good efficiency. The objective of this paper is to propose an optimization function on the basis of support vector machine (SVM). This objective function is used in the genetic algorithm (GA) for selecting the more significant features to get heart disease. The experimental results of the GA–SVM are compared with the various existing feature selection algorithms such as Relief, CFS, Filtered subset, Info gain, Consistency subset, Chi squared, One attribute based, Filtered attribute, Gain ratio, and GA. The receiver operating characteristic analysis is performed to evaluate the good performance of SVM classifier. The proposed framework is demonstrated in the MATLAB environment with a dataset collected from Cleveland heart disease database.
Similar content being viewed by others
References
Tsipouras, M.G., et al.: Automated diagnosis of coronary artery disease based on data mining and fuzzy modeling. J. IEEE Trans. Inform. Technol. Biomed. 12(4), 447–458 (2008)
Kusiak, A., Caldarone, C.A., et al.: Hypo plastic left heart syndrome knowledge discovery with a data mining approach. J. Comput. Biol. Med. 36(1), 21–40 (2006)
Huang, M.-J., et al.: Integrating data mining with case-based reasoning for chronic diseases prognosis and diagnosis. J. Expert Syst. Appl. 32, 856–867 (2007)
Nahar, J., et al.: Association rule mining to detect factors which contribute to heart disease in males and females. J. Expert Syst. Appl. 40, 1086–1093 (2013)
Padma, T., Mir, S.A., Shantharajah, S.P.: Intelligent decision support system for an integrated pest management in apple orchard. In: Intelligent Decision Support Systems for Sustainable Computing, Springer, pp. 225–245 (2017)
Das, Resul, Turkoglu, Ibrahim, et al.: Effective diagnosis of heart disease through neural networks ensembles. J. Expert Syst. Appl. 36, 7675–7680 (2009)
Das, Resul, Turkoglu, Ibrahim, et al.: Diagnosis of valvular heart disease through neural networks ensembles. J. Comput. Methods Progr. Biomed. 93, 185–191 (2009)
Gokulnath, C., Priyan, M. K., Balan, E. V., Prabha, K. R., Jeyanthi, R.: Preservation of privacy in data mining by using PCA based perturbation technique. In: Smart Technologies and Management for Computing, Communication, Controls, Energy and Materials (ICSTM), 2015 International Conference on (pp. 202–206). IEEE (2015)
Babaoglu, et al.: Assessment of exercise stress testing with artificial neural network in determining coronary artery disease and predicting lesion localization. J. Expert Syst. Appl. 36, 2562–2566 (2009)
Rajeswari, K., et al.: Feature selection in ischemic heart disease identification using feed forward neural networks. Int. Symp. Robot. Intell. Sens. 41, 1818–1823 (2012)
Park, Y.-J., et al.: Cost-sensitive case-based reasoning using a genetic algorithm: application to medical diagnosis. J. Artif. Intell. Med. 51, 133–145 (2011)
Nahar, J., et al.: Computational intelligence for heart disease diagnosis: a medical knowledge driven approach. J. Expert Syst Appl. 40, 96–104 (2013)
Polat, K., Güneş, S.: A hybrid approach to medical decision support systems: combining feature selection, fuzzy weighted pre-processing and AIRS. J. Comput. Methods Progr. Biomed. 88, 164–174 (2007)
Polat, K., et al.: Automatic detection of heart disease using an artificial immune recognition system (AIRS) with fuzzy resource allocation mechanism and k-nn (nearest neighbor) based weighting preprocessing. J. Expert Syst. Appl. 32, 625–631 (2007)
Uma, S., Shantharajah, S.P., Rani, C.: Passive Incidental alertness-based link visualization for secure data transmission in manet. J. Appl. Secur. Res. 12(2), 304–322 (2017)
Kahramanli, H., Allahverdi, N.: Design of a hybrid system for the diabetes and heart diseases. J. Expert Syst. Appl. 35, 82–89 (2008)
Khatibi, V., Montazer, G.A.: A fuzzy-evidential hybrid inference engine for coronary heart disease risk assessment. J. Expert Syst. Appl. 37, 8536–8542 (2010)
Priyan, M.K., Devi, G.U.: Energy efficient node selection algorithm based on node performance index and random waypoint mobility model in internet of vehicles. Clust. Comput. 9, 1–15 (2017)
Varatharajan, R., Manogaran, G., Priyan, M.K., Sundarasekar, R.: Wearable sensor devices for early detection of Alzheimer disease using dynamic time warping algorithm. Clust. Comput. 35, 1–10 (2017)
Anooj, P.K.: Clinical decision support system: Risk level prediction of heart disease using weighted fuzzy rules. J. Comput. Inform. Sci. 24, 27–40 (2012)
Nawi, N.M. et al.: The development of improved back-propagation neural networks algorithm for predicting patients with heart disease. In: Proceedings of the First International Conference ICICA, vol. 6377, pp. 317–324 (2010)
Varatharajan, R., Manogaran, G., Priyan, M.K.: A big data classification approach using LDA with an enhanced SVM method for ECG signals in cloud computing. Multimed. Tools Appl. (2017). https://doi.org/10.1007/s11042-017-5318-1
Paredesa, S., et al.: Long term cardiovascular risk models’ combination. J. Comput. Methods Progr. Biomed. 101, 231–242 (2011)
Shilaskar, S., et al.: Feature selection for medical diagnosis: evaluation for cardiovascular diseases. J. Expert Syst. Appl 40, 4146–4153 (2013)
Pu, L.N., et al.: Investigation on cardiovascular risk prediction using genetic information. J. IEEE Trans. Inform. Technol. Biomed. 16(5), 795–808 (2012)
Manogaran, G., Vijayakumar, V., Varatharajan, R., Kumar, P.M., Sundarasekar, R., Hsu, C.H.: Machine learning based big data processing framework for cancer diagnosis using hidden markov model and gm clustering. Wirel. Person. Commun. 22, 1–18 (2017)
UCI Machine Learning Repository: Heart Disease Data Set.: Archive.ics.uci.edu. http://archive.ics.uci.edu/ml/datasets/Heart+Disease (2017). Accessed 22 Oct 2017
Balan, E.V., Priyan, M.K., Gokulnath, C., Devi, G.U.: Fuzzy based intrusion detection systems in MANET. Proc. Comput. Sci. 50, 109–114 (2015)
Babaoglu, I., et al.: A comparison of feature selection models utilizing binary particle swarm optimization and genetic algorithm in determining coronary artery disease using support vector machine. J. Expert Syst. Appl. 37(4), 3177–3183 (2010)
Ordonez, C.: Association rule discover with the train and test approach for the heart disease prediction. IEEE Trans. Inform. Technol. Biomed. 10(2), 334–343 (2006)
Tan, K.C., et al.: A hybrid evolutionary algorithm for attribute selection in data mining. J. Expert Syst. Appl. 36, 8616–8630 (2009)
Polat, K., Gunes, S.: A new feature selection method on classification of medical datasets: Kernel F-score feature selection. J. Expert Syst. Appl. 36, 10367–10373 (2009)
Luukka, P., Lampinen, J.: A classification method based on principal component analysis and differential evolution algorithm applied for prediction diagnosis from clinical EMR heart data sets. J. Comput. Intell. Optim. Adapt. Learn. Optim. 7, 263–283 (2010)
Yan, H., et al.: Selecting critical clinical features for heart diseases diagnosis with a real-coded genetic algorithm. J. Appl. Soft Comput. 8, 1105–1111 (2008)
Ozcift, A., Gulten, A.: Classifier ensemble construction with rotation forest to improve medical diagnosis performance of machine learning algorithms. J. Comput. Methods Progr. Biomed. 104, 443–451 (2011)
Varatharajan, R., Manogaran, G., Priyan, M.K., Balaş, V.E., Barna, C.: Visual analysis of geospatial habitat suitability model based on inverse distance weighting with paired comparison analysis. Multimed. Tools Appl. 24, 1–21 (2017)
Gonçalves, L.B., et al.: Inverted hierarchical neuro-fuzzy BSP system: a novel neuro-fuzzy model for pattern classification and rule extraction in databases. J. IEEE Trans. Syst. Man Cybernet. 36(2), 236–248 (2006)
Austin, P.C., et al.: Using methods from the data-mining and machine-learning literature for disease classification and prediction: a case study examining classification of heart failure subtypes”. J. Clin. Epidemiol. 66, 398–407 (2013)
Turan, R.G., et al.: improved functional activity of bone marrow derived circulating progenitor cells after intra coronary freshly isolated bone marrow cells transplantation in patients with ischemic heart disease. J. Stem Cell Rev. Rep. 7, 646–656 (2011)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Gokulnath, C.B., Shantharajah, S.P. An optimized feature selection based on genetic approach and support vector machine for heart disease. Cluster Comput 22 (Suppl 6), 14777–14787 (2019). https://doi.org/10.1007/s10586-018-2416-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10586-018-2416-4