Abstract
Big Data (BD) and cloud computing (CC) are the two widely used technologies and focus of study in several industries. Amongst all, healthcare sources generate a tremendous amount of data daily. Traditional processing techniques cannot handle this data because they are huge. Furthermore, being large, this data is also dynamic and diverse. Large data sets are stored, processed, and analyzed under BD. However, as the volume of data increased, third parties could hack it easily. The data are regularly stored in the cloud in an encrypted form to protect the data from intruders. This paper proposes a secure and efficient medical BD management and classification scheme using optimal Map Reduce (MR) and deep learning framework in a cloud environment. The system comprises '4' phases: authentication of patients, BD management in the cloud, secure data transfer and BD classification. First, the patient who wants to upload the files to the cloud and access the resources from the cloud is registered with the TC. It generates hash value using the Whirlpool Hashing (WH) algorithm, and the user credentials are stored in Blockchain (BC) to protect the network from unauthorized access. Once the authentication is successful, the patient can access any data in the cloud. Before uploading the file to the cloud, the preprocessing is carried out using missing values imputation, numerical conversion, and normalization, which improves the data quality. It is fed into the MR framework using Kernelized K-Means (KKM) clustering and Enhanced Butterfly Optimization Algorithm (EBOA) for managing the data efficiently. Then the map-reduced data is encrypted using Whirlpool Hashing-based Enhanced Rivest Shamir Adelman (WHERSA) to upload into the cloud securely. Finally, the classification of BD is done using Enhanced Ant Colony Weight Optimization based Deep Belief Network (EACWODBN) for disease prediction. The experimental outcomes reveal that the proposed system outperformed state-of-art methods while offering effectual and secure data management.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
The dataset used for the present work is the publicly available, https://www.kaggle.com/datasets/johnsmith88/heart-disease-dataset
References
Gill SS, Arya RC, Wander GS, Buyya R (2019) Fog-based smart healthcare as a big data and cloud service for heart patients using IoT. In: International Conference on Intelligent Data Communication Technologies and Internet of Things (ICICI) 2018. Springer International Publishing, pp 1376–1383
Rajabion L, Shaltooki AA, Taghikhah M, Ghasemi A, Badfar A (2019) Healthcare big data processing mechanisms: The role of cloud computing. Int J Inf Manag 49:271–289
Ramírez PLG, Lloret J, Taha M, Tomás J (2018). Architecture to integrate IoT networks using artificial intelligence in the cloud. In 2018 International Conference on Computational Science and Computational Intelligence (CSCI) (pp. 996-1001). IEEE
Khaleel MI, Boskany NW, Abdulla MT (2018) A novel cloud-based temperature monitoring service to datacenter environment. Kurdistan J Appl Res 3(2):20–26
Kulkarni O, Jena S, Sanjay CH (2020) Fractional fuzzy clustering and particle whale optimization-based mapreduce framework for big data clustering. J Intell Syst 29(1):1496–1513
Dash S, Shakyawar SK, Sharma M, Kaushik S (2019) Big data in healthcare: management, analysis and future prospects. J Big Data 6(1):1–25
Saranya, P., & Asha, P. (2019). Survey on big data analytics in health care. In 2019 International Conference on Smart Systems and Inventive Technology (ICSSIT) (pp. 46-51). IEEE
Lv Z, Qiao L (2020) Analysis of healthcare big data. Futur Gener Comput Syst 109:103–110
Chen PT, Lin CL, Wu WN (2020) Big data management in healthcare: Adoption challenges and implications. Int J Inf Manag 53:102078
Jagadeeswari V, Subramaniyaswamy V, Logesh R, Vijayakumar V (2018) A study on medical Internet of Things and Big Data in personalized healthcare system. Health Inform Sci Syst 6:1–20
Alexandru A, Alexandru C, Coardos D, Tudora E (2016) Healthcare, big data and cloud computing. management 1(2)
Mohamed A, Najafabadi MK, Wah YB, Zaman EAK, Maskat R (2020) The state of the art and taxonomy of big data analytics: view from new big data framework. Artif Intell Rev 53:989–1037
Gupta YK (2020) Aspect of Big Data in Medical Imaging to Extract the Hidden Information Using HIPI in HDFS Environment. Advance Mach Intel Interact Med Image Analys:19–40
Kulkarni AJ, Siarry P, Singh PK, Abraham A, Zhang M, Zomaya A, Baki F, (Eds.). (2020) Big Data Analytics in Healthcare. Springer
Ramani R, Vimala Devi K, Ruba Soundar K (2020) RETRACTED ARTICLE: MapReduce-based big data framework using modified artificial neural network classifier for diabetic chronic disease prediction. Soft Comput 24(21):16335–16345
Abualigah L, Masri BA (2021). Advances in MapReduce big data processing: platform, tools, and algorithms. Artificial intelligence and IoT: Smart convergence for eco-friendly topography, pp. 105-128
Tariq MI, Tayyaba S, Ashraf MW, Balas VE (2020) Deep learning techniques for optimizing medical big data. In: Deep Learning Techniques for Biomedical and Health Informatics. Academic Press, pp 187–211
Saeed MM, Al Aghbari Z, Alsharidah M (2020) Big data clustering techniques based on spark: a literature review. PeerJ Comput Sci 6:e321
Farooqi MM, Shah MA, Wahid A, Akhunzada A, Khan F, ul Amin, N., & Ali, I. (2019) Big data in healthcare: A survey. Appli Intel Technol Health:143–152
Lo'ai AT, Saldamli G (2021) Reconsidering big data security and privacy in cloud and mobile cloud systems. J King Saud Univ-Comput InformSci 33(7):810–819
Essa YM, Hemdan EED, El-Mahalawy A, Attiya G, El-Sayed A (2019) IFHDS: intelligent framework for securing healthcare bigdata. J Med Syst 43:1–13
Khan M, Ansari MD (2019) Security and privacy issue of big data over the cloud computing: a comprehensive analysis. IJRTE-Scopus Indexed 7(6s):413–417
Deepa N, Pham QV, Nguyen DC, Bhattacharya S, Prabadevi B, Gadekallu TR, ... & Pathirana PN (2022). A survey on blockchain for big data: approaches, opportunities, and future directions. Future Generation Computer Systems
Stergiou CL, Plageras AP, Psannis KE, Gupta BB (2020) Secure machine learning scenario from big data in cloud computing via internet of things network. Principles and Paradigms, Handbook of Computer Networks and Cyber Security, pp 525–554
Premkamal PK, Pasupuleti SK, Alphonse PJA (2019) A new verifiable outsourced ciphertext-policy attribute based encryption for big data privacy and access control in cloud. J Ambient Intell Humaniz Comput 10:2693–2707
Namasudra S, Devi D, Kadry S, Sundarasekar R, Shanthini A (2020) Towards DNA based data security in the cloud computing environment. Comput Commun 151:539–547
Subramanian EK, Tamilselvan L (2020) Elliptic curve Diffie–Hellman cryptosystem in big data cloud security. Clust Comput 23:3057–3067
Ramachandra MN, Srinivasa Rao M, Lai WC, Parameshachari BD, Ananda Babu J, Hemalatha KL (2022) An efficient and secure big data storage in cloud environment by using triple data encryption standard. Big Data Cognit Comput 6(4):101
Djemaiel Y, Berrahal S, Boudriga N (2018) A novel graph-based approach for the management of health data on cloud-based WSANs. J Grid Comput 16:317–344
Alabdulatif A, Khalil I, Yi X (2020) Towards secure big data analytic for cloud-enabled applications with fully homomorphic encryption. J Paral Distributed Comput 137:192–204
Venkatesh R, Balasubramanian C, Kaliappan M (2019) Development of big data predictive analytics model for disease prediction using machine learning technique. J Med Syst 43:1–8
Vaishali G, Kalaivani V (2016) Big data analysis for heart disease detection system using map reduce technique. In 2016 International Conference on Computing Technologies and Intelligent Data Engineering (ICCTIDE'16) (pp. 1-6). IEEE
Mohiyuddin A, Javed AR, Chakraborty C, Rizwan M, Shabbir M, Nebhen J (2022) Secure cloud storage for medical IoT data using adaptive neuro-fuzzy inference system. Int J Fuzzy Syst 24(2):1203–1215
Mohan S, Thirumalai C, Srivastava G (2019) Effective heart disease prediction using hybrid machine learning techniques. IEEE access 7:81542–81554
Choi SY, Chung K (2020) Knowledge process of health big data using MapReduce-based associative mining. Pers Ubiquit Comput 24:571–581
Harb H, Mroue H, Mansour A, Nasser A, Motta Cruz E (2020) A hadoop-based platform for patient classification and disease diagnosis in healthcare applications. Sensors 20(7):1931
Rajendran S, Khalaf OI, Alotaibi Y, Alghamdi S (2021) MapReduce-based big data classification model using feature subset selection and hyperparameter tuned deep belief network. Sci Rep 11(1):24138
Zhou Y, Varzaneh MG (2022) Efficient and scalable patients clustering based on medical big data in cloud platform. J Cloud Comput 11(1):1–10
Game PS, Vaze V, Emmanuel M (2019) Optimized Decision tree rules using divergence based grey wolf optimization for big data classification in health care. Evol Intel:1–17
Li JP, Haq AU, Din SU, Khan J, Khan A, Saboor A (2020) Heart disease identification method using machine learning classification in e-healthcare. IEEE access 8:107562–107582
Jain P, Gyanchandani M, Khare N (2019) Enhanced secured map reduce layer for big data privacy and security. J Big Data 6(1):1–17
Masud M, Gaba GS, Choudhary K, Alroobaea R, Hossain MS (2021) A robust and lightweight secure access scheme for cloud-based E-healthcare services. Peer-to-peer Network Appli 14(5):3043–3057
Thamrin A, Xu H (2020). Cloud based secure and reliable bigdata storage service in healthcare system. in proceedings of international conference on Service Oriented System Engineering, https://doi.org/10.1109/SOSE52839.2021.00015
Nguyen DC, Pathirana PN, Ding M, Seneviratne A (2019) Blockchain for secure ehrs sharing of mobile cloud-based e-health systems. IEEE access 7:66792–66806
Vione E, Darmawan JB (2019, April) Performance of K-means in Hadoop Using MapReduce Programming Model. In: In Proceedings of the 1st International Conference on Science and Technology for an Internet of Things, 20 October 2018. Indonesia, Yogyakarta
Arora S, Singh S (2019) Butterfly optimization algorithm: a novel approach for global optimization. Soft Comput 23:715–734
Parthasarathy PR, Yee HW, Loong SS, Rajamanickam L, Ayyappan P (2019) Implementation of RSA Algorithm to Secure Data in Cloud Computing. Int J Innov Sci, Engin Technol 6(4)
Chen X, Li TH, Zhao Y, Wang CC, Zhu CC (2021) Deep-belief network for predicting potential miRNA-disease associations. Brief Bioinform 22(3):bbaa186
Zhang H, Li Z, Shu W, Chou J (2019) Ant colony optimization algorithm based on mobile sink data collection in industrial wireless sensor networks. EURASIP J Wirel Commun Netw 2019(1):1–10
Funding
Not applicable.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interest
The authors declare that there are no conflicts of interest regarding the publication of this paper.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Rajeshkumar, K., Dhanasekaran, S. & Vasudevan, V. Efficient and secure medical big data management system using optimal map-reduce framework and deep learning. Multimed Tools Appl 83, 47111–47138 (2024). https://doi.org/10.1007/s11042-023-17381-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-17381-8