Abstract
In recent decades, artificial neural networks (ANNs) have been extensively applied in different areas such as engineering, medicine, business, education, manufacturing and so on. Nowadays, ANNs are as a hot research in medicine especially in the fields of medical disease diagnosis. To have a high efficiency in ANN, selection of an appropriate architecture and learning algorithm is very important. ANN learning is a complex task and an efficient learning algorithm has a significant role to enhance ANN performance. In this paper, a new meta-heuristic algorithm, centripetal accelerated particle swarm optimization (CAPSO), is applied to evolve the ANN learning and accuracy. The algorithm is based on an improved scheme of particle swarm algorithm and Newton’s laws of motion. The hybrid learning of CAPSO and multi-layer perceptron (MLP) network, CAPSO-MLP, is used to classify the data of nine standard medical datasets of Hepatitis, Heart Disease, Pima Indian Diabetes, Wisconsin Prognostic Breast Cancer, Parkinson’s disease, Echocardiogram, Liver Disorders, Laryngeal 1 and Acute Inflammations. The performance of CAPSO-MLP is compared with those of PSO, gravitational search algorithm and imperialist competitive algorithm on MLP. The efficiency of methods are evaluated based on mean square error, accuracy, sensitivity, specificity, area under the receiver operating characteristics curve and statistical tests of \(t\)-test and Wilcoxon’s signed ranks test. The results indicate that CAPSO-MLP provides more effective performance than the others for medical disease diagnosis especially in term of unseen data (testing data) and datasets with high missing data values.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Abbod MF, Cheng KY, Cui XR, Huang SJ, Han YY, Shieh JS (2011) Ensembled neural networks for brain death prediction for patients with severe head injury. Biomed Signal Process Control 6:414–421
Ahmadi MA, Ahmadi MR, Shadizadeh SR (2013) Evolving artificial neural network and imperialist competitive algorithm for prediction permeability of the reservoir. Neural Comput Appl 23:563
Anderson JA (2006) An introduction to neural networks. Prentice-Hall of India Private Limited
Ari S, Saha G (2009) In search of an optimization technique for artificial neural network to classify abnormal heart sounds. Appl Soft Comput 9:330–340
Ashizawa K, Ishida T, MacMahon H, Vyborny CJ, Katsuragawa S, Doi K (1999) Artificial neural networks in chest radiography: application to the differential diagnosis of interstitial lung disease. Acad Radiol 6:2–9
Atashpaz-Gargari E, Lucas C (2007) Imperialist competitive algorithm: an algorithm for optimization inspired by imperialistic competition. In: Proceedings of IEEE congress on evolutionary computation, pp 4661–4667
Awad M, Motai Y (2008) Dynamic classification for video stream using support vector machine. Appl Soft Comput 8:1314–1325
Bache K, Lichman M (2013) UCI machine learning repository. School of Information and Computer Science, University of California, Irvine. http://archive.ics.uci.edu/ml
Beheshti Z, Shamsuddin SM (2013a) CAPSO: centripetal accelerated particle swarm optimization. Inf Sci. doi:10.1016/j.ins.2013.08.015
Beheshti Z, Shamsuddin SM (2013b) A review of population-based meta-heuristic algorithm. Int J Adv Soft Comput Appl 5(1):1–35
Bennett KP, Mangasarian OL (1992) Robust linear programming discrimination of two linearly inseparable sets. Optim Method Softw 1:23–34
Bishop CM (1995) Neural networks for pattern recognition. Oxford University Press, Oxford
Breiman L, Friedman JH, Olshen RA, Stone C (1984) Classification and regression trees. Wadsworth
Brent RP (1991) Fast training algorithms for multi-layer neural nets. IEEE Trans Neural Netw 2:346–354
Box GE-P, Hunter JS, Hunter WG (2005) Statistics for experiments: design, innovation, and discovery, 2nd edn. Wiley, New York
Chau KW (2007) Application of a PSO-based neural network in analysis of outcomes of construction claims. Autom Constr 16:642– 646
Chen MY, Chen KK, Chiang HK, Huang HS, Huang MJ (2007) Comparing extended classifier system and genetic programming for financial forecasting: an empirical study. Soft Comput. 11:1173–1183
Coppini G, Miniati M, Paterni M, Monti S, Ferdeghini EM (2007) Computer-aided diagnosis of emphysema in COPD patients: neural-network-based analysis of lung shape in digital chest radiographs. Med Eng Phys 29:76–86
Cristianini N, Shawe-Taylor J (2000) An introduction to support vector machines and other kernel-based learning methods. Cambridge University Press, Cambridge
Delen D, Walker G, Kadam A (2005) Predicting breast cancer survivability: a comparison of three data mining methods. Artif Intell Med 34:113–127
Dorigo M, Maniezzo V, Colorni A (1996) The ant system: optimization by a colony of cooperating agents. IEEE Trans Syst Man Cybern Part C Appl Rev 26:29–41
Ekici S (2012) Support vector machines for classification and locating faults on transmission lines. Appl Soft Comput 12:1650–1658
Elveren E, Yumuşak N (2011) Tuberculosis disease diagnosis using artificial neural network trained with genetic algorithm. J Med Syst 35:329–332
Er O, Sertkaya C, Temurtas F, Tanrikulu AC (2009) A comparative study on chronic obstructive pulmonary and pneumonia diseases diagnosis using neural networks and artificial immune system. J Med Syst 33:485–492
Fana CY, Changb PC, Linb JJ, Hsiehb JC (2011) A hybrid model combining case-based reasoning and fuzzy decision tree for medical data classification. Appl Soft Comput 11:632–644
Farmer JD, Packard NH, Perelson AS (1986) The immune system, adaptation and machine learning. Phys D 2:187–204
Fausett LV (1994) Fundamentals of neural networks: architectures, algorithms, and applications. Prentice-Hall, New Jersey
Fawcett T (2006) An introduction to ROC analysis. Pattern Recognit Lett 27:861–874
Folland R, Hines E, Dutta R, Boilot P, Morgan D (2004) Comparison of neural network predictors in the classification of tracheal-bronchial breath sounds by respiratory auscultation. Artif Intell Med 31:211–220
Gao WF, Liu SY, Huang LL (2012) Particle swarm optimization with chaotic opposition based population initialization and stochastic search technique. Commun Nonlinear Sci Numer Simul 17:4316–4327
García S, Fernández A, Herrera F (2009) Enhancing the effectiveness and interpretability of decision tree and rule induction classifiers with evolutionary training set selection over imbalanced problems. Appl Soft Comput 9:1304–1314
Gori M, Tesi A (1992) On the problem of local minima in backpropagation. IEEE Trans Pattern Anal Mach Intell 14:76–85
Gurney KN (1997) An introduction to neural networks. Routledge
Handels H, Roß TH, Kreusch J, Wolff HH, Poppl SJ (1999) Feature selection for optimized skin tumor recognition using genetic algorithms. Artif Intell Med 16:283–297
Hecht-Nielsen R (1987) Kolmogorov’s mapping neural network existence theorem. In: Proceedings of IEEE international conference on neural networks, pp 11–14
Heckerling PS, Gerber BS, Tape TG, Wigton RS (2004) Use of genetic algorithms for neural networks to predict community-acquired pneumonia. Artif Intell Med 30:71–84
Hornik K, Stinchcombe M, White H (1989) Multilayer feedforward networks are universal approximators. Neural Netw 2:359–366
Hrstka O, Kučerová A (2004) Improvements of real coded genetic algorithms based on differential operators preventing premature convergence. Adv Eng Softw 35:237–246
Huang ML, Hung YH, Chen WY (2010) Neural network classifier with entropy based feature selection on breast cancer diagnosis. J Med Syst 34:865–873
Jerez JM, Molina I, García-Laencina PJ, Alba E, Ribelles N, Martín M, Franco L (2010) Missing data imputation using statistical and machine learning methods in a real breast cancer problem. Artif Intell Med 50:105–115
Karabatak M, Ince MC (2009) An expert system for detection of breast cancer based on association rules and neural network. Expert Syst Appl 36:3465–3469
Karaboga D (2005) An idea based on honey bee swarm for numerical optimization. Technical Report, TR06, Erciyes University
Kayaer K, Yildirim T (2003) Medical diagnosis on Pima Indian diabetes using general regression neural networks. In: Proceedings of international conference on artificial neural networks and neural information, pp 181–184
Kennedy J, Eberhart R (1995) Particle swarm optimization. In: Proceedings of IEEE international conference on neural networks, pp 1942–1948
Kisi O, Ozturk O (2007) Adaptive neuro-fuzzy computing technique for evapotranspiration estimation. J Irrig Drainage Eng ASCE 133:368–379
Kuncheva LI (2005) Real medical data sets. Tech. Rep. School of Informatics: University of Wales, Bangor. http://www.bangor.ac.uk/~mas00a/activities/real_data.htm
Leung Y, Gao Y, Xu ZB (1997) Degree of population diversity—a perspective on premature convergence in genetic algorithms and its Markov chain analysis. IEEE Trans Neural Netw 8:1165–1176
Liang JJ, Qin AK, Suganthan PN, Baskar S (2006) Comprehensive learning particle swarm optimizer for global optimization of multimodal functions. IEEE Trans Evol Comput 10:281–295
Mitra V, Wang CJ (2008) Content based audio classification: a neural network approach. Soft Comput 12:639–646
Moslemipour G, Lee TS, Rilling D (2012) A review of intelligent approaches for designing dynamic and robust layouts in flexible manufacturing systems. Int J Adv Manuf Technol 60:11–27
Oreski S, Oreski D, Oreski G (2012) Hybrid system with genetic algorithm and artificial neural networks and its application to retail credit risk assessment. Expert Syst Appl 39:12605–12617
Ozkan C, Kisi O, Akay B (2011) Neural networks with artificial bee colony algorithm for modeling daily reference evapotranspiration. Irrig Sci 29:431–441
Park J, Sandberg I (1991) Universal approximation using radial-basis function networks. Neural Comput 3:246–257
Prechelt L (1995) Some notes on neural learning algorithm benchmarking. Neurocomputing 9:343–347
Qasem SN, Shamsuddin SM (2011) Radial basis function network based on time variant multi-objective particle swarm optimization for medical diseases diagnosis. Appl Soft Comput 11:1427–1438
Quinlan JR (1993) C4.5: Programs for Machine Learning. Morgan Kaufmann
Rashedi E, Nezamabadi-pour H, Saryazdi S (2009) GSA: a gravitational search algorithm. Inf Sci 179:2232–2248
Reynolds AP, Iglesia B (2009) A multi-objective GRASP for partial classification. Soft Comput 13:227–243
Rumelhart DE, McClelland JL (1986) Parallel distributed processing: explorations in the microstructure of cognition. MIT, Cambridge
Samanta B, Al-Balushi KR, Al-Araimi SA (2003) Artificial neural networks and support vector machines with genetic algorithm for bearing fault detection. Eng Appl Artif Intell 16:657–665
Saxena A, Saad A (2007) Evolving an artificial neural network classifier for condition monitoring of rotating mechanical systems. Appl Soft Comput 7:441–454
Schutz B (2003) Gravity from the ground up. Cambridge University Press, Cambridge
Setiono R, Hui LCK (1995) Use of a quasinewton method in a feedforward neural network construction algorithm. IEEE Trans Neural Netw 6:740–747
Shi Y, Eberhart R (1998) A modified particle swarm optimizer. In: Proceedings of IEEE international conference on evolutionary computation, pp 69–73
Sivagaminathan RK, Ramakrishnan S (2007) A hybrid approach for feature subset selection using neural networks and ant colony optimization. Expert Syst Appl 33:49–60
Socha K, Blum C (2007) An ant colony optimization algorithm for continuous optimization: application to feed-forward neural network training. Neural Comput Appl 16:235–247
Sulaiman SI, Abdul-Rahman TK, Musirin I, Shaari S (2012) An artificial immune-based hybrid multi-layer feedforward neural network for predicting grid-connected photovoltaic system output. Energy Proc 14:260–264
Tang KS, Man KF, Kwong S, He Q (1996) Genetic algorithms and their applications. IEEE Signal Process Mag 13:22–37
Tayefeh-Mahmoudi M, Taghiyareh F, Forouzideh N, Lucas C (2013) Evolving artificial neural network structure using grammar encoding and colonial competitive algorithm. Neural Comput Appl 22:1–16
Temurtas F (2007) A comparative study on thyroid disease diagnosis using neural networks. Expert Syst Appl 36:944–949
Tsoulos I, Gavrilis D, Glavas E (2008) Neural network construction and training using grammatical evolution. Neurocomputing 72:269–277
Wilcoxon F (1945) Individual comparisons by ranking methods. Biometrics Bull 1:80–83
Yao X (1999) Evolving artificial neural networks. Proc IEEE 87:1423–1447
Zhang C, Shao H, Li Y (2000) Particle swarm optimization for evolving artificial neural network. In: Proceedings of IEEE international conference on systems, man, and sybernetics, pp 2487–2490
Ziver AK, Pain CC, Carter JN, de Oliveira CRE, Goddard AJ, Overton RS (2004) Genetic algorithms and artificial neural networks for loading pattern optimization of advanced gas-cooled reactors. Ann Nucl Energy 13:431–457
Acknowledgments
This research is supported by Research University Grant (03H72), Universiti Teknologi Malaysia. The authors would like to thank Soft Computing Research Group (SCRG), Universiti Teknologi Malaysia (UTM), Johor Bahru Malaysia, for supporting this study. Also, they give kind respect and special thanks to Dr. Zhaleh Beheshti, for proof reading the manuscript and providing valuable comments.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by D. Liu.
Rights and permissions
About this article
Cite this article
Beheshti, Z., Shamsuddin, S.M.H., Beheshti, E. et al. Enhancement of artificial neural network learning using centripetal accelerated particle swarm optimization for medical diseases diagnosis. Soft Comput 18, 2253–2270 (2014). https://doi.org/10.1007/s00500-013-1198-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-013-1198-0