Abstract
Stroke is enlisted as one of the leading causes of death and serious disability affecting millions of human lives across the world with high possibilities of becoming an epidemic in the next few decades. Timely detection and prompt decision making pertinent to this disease, plays a major role which can reduce chances of brain death, paralysis and other resultant outcomes. Machine learning algorithms have been a popular choice for the diagnosis, analysis and predication of this disease but there exists issues related to data quality as they are collected cross-institutional resources. The present study focuses on improving the quality of stroke data implementing a rigorous pre-processing technique. The present study uses a multimodal stroke dataset available in the publicly available Kaggle repository. The missing values in this dataset are replaced with attribute means and LabelEncoder technique is applied to achieve homogeneity. However the dataset considered was observed to be imbalanced which reflect that the results may not represent the actual accuracy and would be biased. In order to overcome this imbalance, resampling technique was used. In case of oversampling, some data points in the minority class are replicated to increase the cardinality value and rebalance the dataset. transformed and oversampled data is further normalized using Standardscalar technique. Antlion optimization (ALO) algorithm is implemented on the deep neural network (DNN) model to select optimal hyperparameters in minimal time consumption. The proposed model consumed only 38.13% of the training time which was also a positive aspect. The experimental results proved the superiority of proposed model.
Similar content being viewed by others
References
Al-khafajiy M, Baker T, Chalmers C, Asim M, Kolivand H, Fahim M, Waraich A (2019) Remote health monitoring of elderly through wearable sensors. Multimedia Tools and Applications 78(17):24681–24706
Benjamin EJ, Virani SS, Callaway CW, Chamberlain AM, Chang AR, Cheng S, Chiuve SE, Cushman M, Delling FN, Deo R, et al. (2018) Heart disease and stroke statistics-2018 update: a report from the american heart association. Circulation 137(12):e67
Bentley P, Ganesalingam J, Jones A LC, Mahady K, Epton S, Rinne P, Sharma P, Halse O, Mehta A, Rueckert D (2014) Prediction of stroke thrombolysis outcome using ct brain machine learning. NeuroImage: Clinical 4:635–640
Chen L, Bentley P, Rueckert D (2017) Fully automatic acute ischemic lesion segmentation in dwi using convolutional neural networks. NeuroImage: Clinical 15:633–643
Chiroma H, Gital AY, Rana N, Shafi’i MA, Muhammad AN, Umar AY, Abubakar AI (2019) Nature inspired meta-heuristic algorithms for deep learning: Recent progress and novel perspective. In: Science and Information Conference, pp 59–70, Springer
Esteva A, Kuprel B, Novoa RA, Ko J, Swetter SM, Blau HM, Thrun S (2017) Dermatologist-level classification of skin cancer with deep neural networks. Nature 542(7639):115
Feng L, Ali A, Iqbal M, Bashir AK, Hussain SA, Pack S (2019) Optimal haptic communications over nanonetworks for e-health systems. IEEE Transactions on Industrial Informatics 15(5):3016–3027
Gadekallu TR, Khare N, Bhattacharya S, Singh S, Maddikunta P KR, Srivastava G (2020) Deep neural networks to predict diabetic retinopathy. Journal Of Ambient Intelligence and Humanized Computing
Garg S, Kaur K, Kumar N, Rodrigues JJPC (2019) Hybrid deep-learning-based anomaly detection scheme for suspicious flow detection in sdn: A social multimedia perspective. IEEE Transactions on Multimedia 21(3):566–578
Goyal M, et al. (2017) Prediction of stroke using deep learning model. In: International Conference on Neural Information Processing, pp 774–781, Springer
Goyal M, et al. (2018) Long short-term memory recurrent neural network for stroke prediction. In: International Conference on Machine Learning and Data Mining in Pattern Recognition, pp 312–323, Springer
Heidari AA, Faris H, Mirjalili S, Aljarah I, Mafarja M (2020) Ant lion optimizer: Theory, literature review, and application in multi-layer perceptron neural networks. In: Nature-Inspired Optimizers, pp 23–46, Springer
Huang C, Liu B (2019) New studies on dynamic analysis of inertial neural networks involving non-reduced order method. Neurocomputing 325:283–287
Jindal A, Aujla GS, Kumar N, Prodan R, Obaidat MS (2018) Drums: Demand response management in a smart city using deep learning and svr. In: 2018 IEEE Global Communications Conference (GLOBECOM), pp 1–6, IEEE
Johnson JM, Khoshgoftaar TM (2019) Survey on deep learning with class imbalance. Journal of Big Data 6(1):27
Kamal H, Lopez V, Sheth SA (2018) Machine learning in acute ischemic stroke neuroimaging. Frontiers in neurology 9:945
Kaur H, Pannu HS, Malhi AK (2019) A systematic review on imbalanced data challenges in machine learning: Applications and solutions. ACM Computing Surveys (CSUR) 52(4):1–36
Khosla A, Cao Y, Lin C C-Y, Chiu H-K, Hu J, Lee H (2010) An integrated machine learning approach to stroke prediction. In: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 183–192
Kutia S, Chauhdary SH, Iwendi C, Liu L, Yong W, Bashir AK (2019) Socio-technological factors affecting user’s adoption of ehealth functionalities: A case study of china and ukraine ehealth systems. IEEE Access 7:90777–90788
Li J, Fong S, Wong RK, Chu VW (2018) Adaptive multi-objective swarm fusion for imbalanced data classification. Information Fusion 39:1–24
Liu T, Fan W, Wu C (2019) A hybrid machine learning approach to cerebral stroke prediction based on imbalanced medical dataset. Artif Intell Med 101:101723
Maddikunta P KR, Gadekallu TR, Kaluri R, Srivastava G, Parizi RM, Khan MS (2020) Green communication in iot networks using a hybrid optimization algorithm. Comput Commun
Manogaran G, Varatharajan R, Priyan MK (2018) Hybrid recommendation system for heart disease diagnosis based on multiple kernel learning with adaptive neuro-fuzzy inference system. Multimedia tools and applications 77 (4):4379–4399
Mirjalili S (2015) The ant lion optimizer. Advances in engineering software 83:80–98
Patel H, SinghRajput D, ThippaReddy G, Iwendi C, KashifBashir A, Jo O (2020) A review on classification of imbalanced data for wireless sensor networks. International Journal of Distributed Sensor Networks 16 (4):1550147720916404
Pham Q-V, Mirjalili S, Kumar N, Alazab M, Hwang W-J (2020) Whale optimization algorithm with applications to resource allocation in wireless networks. IEEE Trans Veh Technol 69(4):4285–4297
Qin Z, Li H, Liu Z (2014) Multi-objective comprehensive evaluation approach to a river health system based on fuzzy entropy. Math Struct Comput Sci, 24(5). https://doi.org/10.1017/S0960129513000777
Reddy G, Kumar ReddyM P, Lakshmanna K, Kaluri R, SinghRajput D, Srivastava G, Baker T, et al. (2020) Analysis of dimensionality reduction techniques on big data. IEEE Access 8:54776–54788
Reddy GT, Khare N (2018) Heart disease classification system using optimised fuzzy rule based algorithm. Int J Biomed Eng Technol 27(3):183–202
Reddy T, RM SP, Parimala M, Chowdhary CL, Hakak S, Khan WZ, et al. (2020) A deep neural networks based model for uninterrupted marine environment monitoring. Comput Commun
RM SP, Bhattacharya S, Maddikunta PKR, Somayaji SRK, Lakshmanna K, Kaluri R, Hussien A, Gadekallu TR (2020) Load balancing of energy cloud using wind driven and firefly algorithms in internet of everything. Journal of Parallel and Distributed Computing
Stroke prediction (2020 (Accessed on january 22, 2020)). https://www.kaggle.com/swatis1/stroke-prediction
Salunkhe UR, Mali SN (2016) Classifier ensemble design for imbalanced data classification: a hybrid approach. Procedia Computer Science 85:725–732
Sattar HA, Cheetar A (2019) A new strategy based on gsabat to solve single objective optimization problem. International Journal of Swarm Intelligence Research (IJSIR) 10(3):1–22
Scalzo F, Alger JR, Hu X, Saver JL, Dani KA, Muir KW, Demchuk AM, Coutts SB, Luby M, Warach S, et al. (2013) Multi-center prediction of hemorrhagic transformation in acute ischemic stroke using permeability imaging features. Magnetic resonance imaging 31(6):961–969
Sultan S, Javed A, Irtaza A, Dawood H, Dawood H, Bashir AK (2019) A hybrid egocentric video summarization method to improve the healthcare for alzheimer patients. Journal of Ambient Intelligence and Humanized Computing 10 (10):4197–4206
Takahashi N, Lee Y, Tsai D-Y, Matsuyama E, Kinoshita T, Ishii K (2014) An automated detection method for the mca dot sign of acute stroke in unenhanced ct. Radiological physics and technology 7(1):79–88
Thabtah F, Hammoud S, Kamalov F, Gonsalves A (2020) Data imbalance in classification: Experimental evaluation. Inf Sci 513:429–441
Thomalla G, Simonsen CZ, Boutitie F, Andersen G, Berthezene Y, Cheng B, Cheripelli B, Cho T-H, Fazekas F, Fiehler J, et al. (2018) Mri-guided thrombolysis for stroke with unknown time of onset. N Engl J Med 379(7):611–622
Tripathy BK, Mitra A, Ojha J (2008) On rough equalities and rough equivalences of sets. In: International Conference on Rough Sets and Current Trends in Computing, pp 92–102, Springer
Tripathy BK, Sooraj TR, Mohanty RK (2017) A new approach to interval-valued fuzzy soft sets and its application in decision-making. In: Advances in Computational Intelligence, pp 3–10, Springer
Wang D, Huang L, Tang L (2017) Dissipativity and synchronization of generalized bam neural networks with multivariate discontinuous activations. IEEE transactions on neural networks and learning systems 29(8):3815–3827
Yu Y, Guo D, Lou M, Liebeskind D, Scalzo F (2017) Prediction of hemorrhagic transformation severity in acute stroke from source perfusion mri. IEEE Trans Biomed Eng 65(9):2058–2065
Yuan X, Xie L, Abouelenien M (2018) A regularized ensemble framework of deep learning for cancer detection from multi-class, imbalanced training data. Pattern Recogn 77:160–172
Zhang C, Tan KC, Li H, Hong GS (2018) A cost-sensitive deep belief network for imbalanced classification. IEEE transactions on neural networks and learning systems 30(1):109–122
Zerdoumi S, Sabri AQM, Kamsin A, Hashem IAT, Gani A, Hakak S, Chang V (2018) Image pattern recognition in big data: taxonomy and open challenges: survey. Multimed Tools Appl 77(8):10091–10121
Zhu M, Xia J, Jin X, Yan M, Cai G, Yan J, Ning G (2018) Class weights random forest algorithm for processing class imbalanced medical data. IEEE Access 6:4641–4652
Acknowledgments
The work of Saqib Hakak is supported by the University of Northern British Columbia under FUND 15021 ORG 4460.
Author information
Authors and Affiliations
Corresponding authors
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
G, T.R., Bhattacharya, S., Maddikunta, P.K.R. et al. Antlion re-sampling based deep neural network model for classification of imbalanced multimodal stroke dataset. Multimed Tools Appl 81, 41429–41453 (2022). https://doi.org/10.1007/s11042-020-09988-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-09988-y